Conceptual similarity and graph-based method for plagiarism detection

Plagiarism is a form of academic misconduct. It has increased rapidly because it is now quick and easy to reach data and information through electronic documents and the Internet. The problem occurs when found documents content is illegal and without permission or citation, this problem is known as...

Full description

Bibliographic Details
Main Authors: Osman, Ahmed Hamza, Salim, Naomie, Binwahlan, Mohammed Salem, Hentably, Hamza, Ali, Albaraa M.
Format: Article
Published: Asian Research Publishing Network (A R P N) 2011
Subjects:
_version_ 1796858624178388992
author Osman, Ahmed Hamza
Salim, Naomie
Binwahlan, Mohammed Salem
Hentably, Hamza
Ali, Albaraa M.
author_facet Osman, Ahmed Hamza
Salim, Naomie
Binwahlan, Mohammed Salem
Hentably, Hamza
Ali, Albaraa M.
author_sort Osman, Ahmed Hamza
collection ePrints
description Plagiarism is a form of academic misconduct. It has increased rapidly because it is now quick and easy to reach data and information through electronic documents and the Internet. The problem occurs when found documents content is illegal and without permission or citation, this problem is known as plagiarism. One of the major challenges is to detect the plagiarism and illegal copy. This paper discusses a new representation method for text documents called text graph-based representation. The proposed method does not represent the content of a text document as a graph only, but also captures the underlying semantic meaning in terms of the relationships among its concepts in order to defeat the difficulty which the traditional plagiarism detection systems face with some kinds of plagiarism such as complicated plagiarism in which users can reword the plagiarized part or replace some words by their synonyms. The experiments have been carried out using PAN-PC-09 standardization of plagiarism detection corpus. The results showed that our method remarkably outperforms the modern methods for plagiarism detection.
first_indexed 2024-03-05T19:15:13Z
format Article
id utm.eprints-44810
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T19:15:13Z
publishDate 2011
publisher Asian Research Publishing Network (A R P N)
record_format dspace
spelling utm.eprints-448102017-09-14T01:54:43Z http://eprints.utm.my/44810/ Conceptual similarity and graph-based method for plagiarism detection Osman, Ahmed Hamza Salim, Naomie Binwahlan, Mohammed Salem Hentably, Hamza Ali, Albaraa M. PN Literature (General) Plagiarism is a form of academic misconduct. It has increased rapidly because it is now quick and easy to reach data and information through electronic documents and the Internet. The problem occurs when found documents content is illegal and without permission or citation, this problem is known as plagiarism. One of the major challenges is to detect the plagiarism and illegal copy. This paper discusses a new representation method for text documents called text graph-based representation. The proposed method does not represent the content of a text document as a graph only, but also captures the underlying semantic meaning in terms of the relationships among its concepts in order to defeat the difficulty which the traditional plagiarism detection systems face with some kinds of plagiarism such as complicated plagiarism in which users can reword the plagiarized part or replace some words by their synonyms. The experiments have been carried out using PAN-PC-09 standardization of plagiarism detection corpus. The results showed that our method remarkably outperforms the modern methods for plagiarism detection. Asian Research Publishing Network (A R P N) 2011 Article PeerReviewed Osman, Ahmed Hamza and Salim, Naomie and Binwahlan, Mohammed Salem and Hentably, Hamza and Ali, Albaraa M. (2011) Conceptual similarity and graph-based method for plagiarism detection. Journal of Theoretical and Applied Information Technology, 32 (2). pp. 135-145. ISSN 1992-8645
spellingShingle PN Literature (General)
Osman, Ahmed Hamza
Salim, Naomie
Binwahlan, Mohammed Salem
Hentably, Hamza
Ali, Albaraa M.
Conceptual similarity and graph-based method for plagiarism detection
title Conceptual similarity and graph-based method for plagiarism detection
title_full Conceptual similarity and graph-based method for plagiarism detection
title_fullStr Conceptual similarity and graph-based method for plagiarism detection
title_full_unstemmed Conceptual similarity and graph-based method for plagiarism detection
title_short Conceptual similarity and graph-based method for plagiarism detection
title_sort conceptual similarity and graph based method for plagiarism detection
topic PN Literature (General)
work_keys_str_mv AT osmanahmedhamza conceptualsimilarityandgraphbasedmethodforplagiarismdetection
AT salimnaomie conceptualsimilarityandgraphbasedmethodforplagiarismdetection
AT binwahlanmohammedsalem conceptualsimilarityandgraphbasedmethodforplagiarismdetection
AT hentablyhamza conceptualsimilarityandgraphbasedmethodforplagiarismdetection
AT alialbaraam conceptualsimilarityandgraphbasedmethodforplagiarismdetection