publication . Article . 2019

Detecting Similarity in Paraphrased Persian Texts using Semantic and Probabilistic Methods

Nasrollah Pakniat; Azadeh Mohebi;
Open Access Persian
  • Published: 01 Sep 2019 Journal: Iranian Journal of Information Processing & Management, volume 34, issue 4, pages 1,823-1,848 (issn: 2251-8223, eissn: 2251-8231, Copyright policy)
  • Publisher: Iranian Research Institute for Information and Technology
Abstract
Plagiarism detection is the process of locating instances of plagiarism within a work or document. The main component of a plagiarism detection system is its text alignment algorithm aiming at detecting paraphrased passages of texts in a suspicious document, using a small set of candidate source documents. As text alignment algorithms are highly language-dependent, thus the numerous existing algorithms for other languages rather than Pesian cannot be employed for Persian plagiarism detection puposes. There are different text alignment algorithms for Persian text, while most of them are only able to detect exactly identical passages shared between texts. However,...
Subjects
ACM Computing Classification System: ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
free text keywords: plagiarism, semantic text alignment, probabilistic text alignment, paraphrased texts., lcsh:Bibliography. Library science. Information resources, lcsh:Z
Any information missing or wrong?Report an Issue