Digital Repository

Plagiarism detection educational tool: A student’s assessments similarity checker

Show simple item record

dc.contributor.author Jayakody, J.R.K.C.
dc.date.accessioned 2017-01-04T09:29:43Z
dc.date.available 2017-01-04T09:29:43Z
dc.date.issued 2016
dc.identifier.citation Jayakody, J.R.K.C. 2016. Plagiarism detection educational tool: A student’s assessments similarity checker. In Proceedings of the International Research Symposium on Pure and Applied Sciences (IRSPAS 2016), Faculty of Science, University of Kelaniya, Sri Lanka. p 70. en_US
dc.identifier.isbn 978-955-704-008-0
dc.identifier.uri http://repository.kln.ac.lk/handle/123456789/15724
dc.description.abstract Plagiarism is very common among students in higher education institutes due to many reasons such as lack of knowledge about the subject, poor academic writing skills or difficulty in meeting a given deadline. The most popular method of plagiarism is to use the online web pages or e-books as it is an easy effort to get the contents from internet, change it and to submit as an original work. Hence, there are bunch of online software tools as well as offline tools exists to detect the plagiarism. However, there are less software tools to identify the copied works among students. Therefore, in this research I developed a plagiarism detection tool to identify the plagiarized assignments or tutorial submitted. Individual assignments and tutorials which had been given to software engineering courses of the Department of Computing and Information System of Wayamba University were used as the dataset. Natural language processing algorithms were developed to derive the statistical features from the assignments such as bag of words, most frequent words, number of words, name entities and paragraphs etc. Moreover, Term Frequency and Inverted Document Frequency (TF-IDF) module was developed to generate a similarity index value among assignments. In addition, Latent semantic analysis module was developed with the word dictionary and vector corpus. Features that were generated and extracted from every module were used to identify the clusters of similar assignments. K-mean clustering algorithms in rapid minor were used to identify the clusters. Most of the submitted assignments were identified with number of clusters. Once the clustering results were verified with the students, it was evident that fairly good results were the given by the automatic cluster classification. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Kelaniya, Sri Lanka en_US
dc.subject Plagiarism en_US
dc.subject Natural Language Processing en_US
dc.subject Term frequency en_US
dc.subject Inverted Document Frequency en_US
dc.title Plagiarism detection educational tool: A student’s assessments similarity checker en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account