Comparative analysis of text-based plagiarism detection techniques.

PLoS One

Centre for Research in Data Science, Computer and Information Sciences Department, Universiti Teknologi Petronas, Perak, Malaysia.

Published: April 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In text analysis, identifying plagiarism is a crucial area of study that looks for copied information in a document and determines whether or not the same author writes portions of the text. With the emergence of publicly available tools for content generation based on large language models, the problem of inherent plagiarism has grown in importance across various industries. Students are increasingly committing plagiarism as a result of the availability and use of computers in the classroom and the generally extensive accessibility of electronic information found on the internet. As a result, there is a rising need for reliable and precise detection techniques to deal with this changing environment. This paper compares several plagiarism detection techniques and looks into how well different detection systems can distinguish between content created by humans and content created by Artificial Intelligence (AI). This article systematically evaluates 189 research papers published between 2019 and 2024 to provide an overview of the research on computational approaches for plagiarism detection (PD). We suggest a new technically focused structure for efforts to prevent and identify plagiarism, types of plagiarism, and computational techniques for detecting plagiarism to organize the way the research contributions are presented. We demonstrated that the field of plagiarism detection is rife with ongoing research. Significant progress has been made in the field throughout the time we reviewed in terms of automatically identifying plagiarism that is highly obscured and hence difficult to recognize. The exploration of nontextual contents, the use of machine learning, and improved semantic text analysis techniques are the key sources of these advancements. Based on our analysis, we concluded that the combination of several analytical methodologies for textual and nontextual content features is the most promising subject for future research contributions to further improve the detection of plagiarism.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11977957PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0319551PLOS

Publication Analysis

Top Keywords

plagiarism detection
16
plagiarism
12
detection techniques
12
text analysis
8
identifying plagiarism
8
content created
8
detection
7
techniques
5
comparative analysis
4
analysis text-based
4

Similar Publications

Objectives: Generative Artificial Intelligence (AI) could transform how science is conducted, supporting researchers with writing, coding, peer review, and evidence synthesis. However, it is not yet known how eating disorder researchers utilize generative AI, and uncertainty remains regarding its safe, ethical, and transparent use. The Executive Committee of the International Journal of Eating Disorders disseminated a survey for eating disorder researchers investigating their practices and perspectives on generative AI, with the goal of informing guidelines on appropriate AI use for authors, reviewers, and editors.

View Article and Find Full Text PDF

The JsSAMDCs promotes the expression of polyamine synthesis genes and regulates the expression of flowering genes which in turn promotes the differentiation of female flower buds in Juglans sigillata Dode. Juglans sigillata is a typical dioecious plant, and its low female-to-male ratio has been a significant factor limiting J. sigillata yield.

View Article and Find Full Text PDF

Can we trust academic AI detective? Accuracy and limitations of AI-output detectors.

Acta Neurochir (Wien)

August 2025

Department of Neurosurgery, Istinye University Faculty of Medicine, Maltepe, İstinye Üniversitesi Topkapı Kampüsü, Teyyareci Sami Sk. No.3, 34010, Zeytinburnu/İstanbul, Türkiye.

Objective: This study evaluates the reliability and accuracy of AI-generated text detection tools in distinguishing human-authored academic content from AI-generated texts, highlighting potential challenges and ethical considerations in their application within the scientific community.

Methods: This study analyzed the detectability of AI-generated academic content using abstracts and introductions created by ChatGPT versions 3.5, 4, and 4o, alongside human-written originals from the pre-ChatGPT era.

View Article and Find Full Text PDF

Mechanism analysis of Salvia miltiorrhiza Bunge (Danshen) on circadian rhythm for treating myocardial injury by mathematic model.

Fitoterapia

August 2025

Key Laboratory of TCM-information Engineer of State Administration of TCM, School of Chinese Materia Medica, Beijing University of Chinese Medicine, Beijing 102488, China. Electronic address:

Aim Of The Study: Salvia miltiorrhiza Bunge (Danshen, SMB) is commonly used in the treatment of myocardial injury in cardiovascular and cerebrovascular diseases. In recent research, circadian rhythm disruption was identified as a potential reason of myocardial injury. In this study, we investigated the mechanism of SMB and its components exerting protective effect on myocardial injury through circadian rhythms pathway.

View Article and Find Full Text PDF