Distinguishing Human From Machine: A Review of Advances and Challenges in AI-Generated Text Detection

Author
Keywords
Abstract
The rise of Large Language Models (LLMs) has dramatically altered the generation and spreading of textual content. This advancement offers benefits in various domains, including medicine, education, law, coding, and journalism, but also has negative implications, mainly related to ethical concerns. Preventing measures to mitigate negative implications pass through solutions that distinguish machine-generated text from humanwritten text. This study aims to provide a comprehensive review of existing literature for detecting LLMgenerated texts. Emerging techniques are categorized into five categories: watermarking, feature-based, neural-based, hybrid, and human-aided methods. For each introduced category, strengths and limitations are discussed, providing insights into their effectiveness and potential for future improvements. Moreover, available datasets and tools are introduced. Results demonstrate that, despite the good delimited performance, the multitude of languages to recognize, hybrid texts, the continuous improvement of algorithms for text generation and the lack of regulation require additional efforts for efficient detection.
Year of Publication
2025
Journal
International Journal of Interactive Multimedia and Artificial Intelligence
Volume
9
Start Page
6
Issue
Regular issue
Number
3
Number of Pages
6-18
Date Published
06/2025
ISSN Number
1989-1660
URL
DOI
Attachment
Acknowledgment
This work was partially supported by project SERICS (PE00000014) under the MUR National Recovery and Resilience Plan funded by the European Union - NextGenerationEU.