ŠIGUT, Petr and Tomáš FOLTÝNEK. Can we detect ChatGPT-generated texts in Czech and Slovak languages? In Aleš Horák, Pavel Rychlý, Adam Rambousek. Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023. Brno: Tribun EU, 2023, p. 35-43. ISBN 978-80-263-1793-7.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Can we detect ChatGPT-generated texts in Czech and Slovak languages?
Authors ŠIGUT, Petr (703 Slovakia, belonging to the institution) and Tomáš FOLTÝNEK (203 Czech Republic, guarantor, belonging to the institution).
Edition Brno, Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023, p. 35-43, 9 pp. 2023.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10200 1.2 Computer and information sciences
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/23:00132775
Organization unit Faculty of Informatics
ISBN 978-80-263-1793-7
ISSN 2336-4289
Keywords in English ChatGPT; AI-detection; Czech; Slovak
Tags Reviewed
Changed by Changed by: Mgr. Tomáš Foltýnek, Ph.D., učo 4374. Changed: 15/8/2024 09:39.
Abstract
The wide availability of generative AI exacerbates existing threats to society. It would not be easy even for linguists to tell whether the text we are reading was generated by a Large Language Model (LLM) or written by a human. Researchers have started developing tools that detect AI-generated content. This paper tested how two of these tools, Compilatio and GPT-2 Output Detector, performed with Czech, Slovak and English texts. There was only one tool somewhat capable of detecting AI-generated texts: Compilatio. Other tools were designed to work only with English texts. Hence, we also tested whether automatically translating the Czech and Slovak texts to English before uploading them to the detectors would have given any promising results. Ultimately, we showed that the texts generated by ChatGPT4 were less detectable than the texts generated by ChatGPT3.5.
PrintDisplayed: 10/10/2024 18:33