Detailed Information on Publication Record
2023
Can we detect ChatGPT-generated texts in Czech and Slovak languages?
ŠIGUT, Petr and Tomáš FOLTÝNEKBasic information
Original name
Can we detect ChatGPT-generated texts in Czech and Slovak languages?
Authors
ŠIGUT, Petr (703 Slovakia, belonging to the institution) and Tomáš FOLTÝNEK (203 Czech Republic, guarantor, belonging to the institution)
Edition
Brno, Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023, p. 35-43, 9 pp. 2023
Publisher
Tribun EU
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10200 1.2 Computer and information sciences
Country of publisher
Czech Republic
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
printed version "print"
References:
RIV identification code
RIV/00216224:14330/23:00132775
Organization unit
Faculty of Informatics
ISBN
978-80-263-1793-7
ISSN
Keywords in English
ChatGPT; AI-detection; Czech; Slovak
Tags
Reviewed
Změněno: 15/8/2024 09:39, Mgr. Tomáš Foltýnek, Ph.D.
Abstract
V originále
The wide availability of generative AI exacerbates existing threats to society. It would not be easy even for linguists to tell whether the text we are reading was generated by a Large Language Model (LLM) or written by a human. Researchers have started developing tools that detect AI-generated content. This paper tested how two of these tools, Compilatio and GPT-2 Output Detector, performed with Czech, Slovak and English texts. There was only one tool somewhat capable of detecting AI-generated texts: Compilatio. Other tools were designed to work only with English texts. Hence, we also tested whether automatically translating the Czech and Slovak texts to English before uploading them to the detectors would have given any promising results. Ultimately, we showed that the texts generated by ChatGPT4 were less detectable than the texts generated by ChatGPT3.5.