• shareshare
  • link
  • cite
  • add
Publication . Article . 2019

Semantic measure of plagiarism using a hierarchical graph model

Tingting Zhang; Baozhen Lee; Qinghua Zhu;
Closed Access
Published: 19 Aug 2019 Journal: Scientometrics, volume 121, pages 209-239 (issn: 0138-9130, eissn: 1588-2861, Copyright policy )
Publisher: Springer Science and Business Media LLC

Traditional plagiarism detection is based primarily on methods of character matching or topic similarity. Another promising methodology remains largely unexplored: employing deep mining to establish a contextual hierarchy among themes. This paper proposes a semantic approach to measuring the extent of plagiarism, based on a hierarchical graph model. The main innovations are as follows: (1) hierarchical extraction of topic feature terms and elucidation of a corresponding graph structure; (2) graph similarity calculation based on the maximum common subgraph. This semantic-measure method goes beyond semantic detection of topics to take into account the context of topic feature terms, as well as the hierarchical structure by which those topics are related. This contextual-hierarchical perspective should, in turn, improve the accuracy of plagiarism detection. In addition, by mining the implicit relationships between hierarchical feature terms, our method can detect plagiarized documents with similar themes but using different topic words: a potential boon to plagiarism detection recall. In an experiment conducted on a dataset from Chinese paper database CNKI, the semantic-measure method indeed demonstrates accuracy and recall superior to those achieved with current state-of-the-art methods.

Subjects by Vocabulary

Microsoft Academic Graph classification: Computer science Natural language processing computer.software_genre computer Recall Deep mining Graph similarity Graph model Graph (abstract data type) Plagiarism detection Artificial intelligence business.industry business


Library and Information Sciences, Computer Science Applications, General Social Sciences

Related to Research communities
Social Science and Humanities
Download from