Can we detect ChatGPT-generated texts in Czech and Slovak
languages?

D 2023

Can we detect ChatGPT-generated texts in Czech and Slovak languages?

ŠIGUT, Petr a Tomáš FOLTÝNEK

Základní údaje

Originální název

Can we detect ChatGPT-generated texts in Czech and Slovak languages?

Autoři

ŠIGUT, Petr (703 Slovensko, domácí) a Tomáš FOLTÝNEK (203 Česká republika, garant, domácí)

Vydání

Brno, Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023, od s. 35-43, 9 s. 2023

Nakladatel

Tribun EU

Další údaje

Jazyk

angličtina

Typ výsledku

Stať ve sborníku

Obor

10200 1.2 Computer and information sciences

Stát vydavatele

Česká republika

Utajení

není předmětem státního či obchodního tajemství

Forma vydání

tištěná verze "print"

Odkazy

URL

Kód RIV

RIV/00216224:14330/23:00132775

Organizační jednotka

Fakulta informatiky

ISBN

978-80-263-1793-7

ISSN

Klíčová slova anglicky

ChatGPT; AI-detection; Czech; Slovak

Příznaky

Recenzováno

Změněno: 15. 8. 2024 09:39, Mgr. Tomáš Foltýnek, Ph.D.

Anotace

V originále

The wide availability of generative AI exacerbates existing threats to society. It would not be easy even for linguists to tell whether the text we are reading was generated by a Large Language Model (LLM) or written by a human. Researchers have started developing tools that detect AI-generated content. This paper tested how two of these tools, Compilatio and GPT-2 Output Detector, performed with Czech, Slovak and English texts. There was only one tool somewhat capable of detecting AI-generated texts: Compilatio. Other tools were designed to work only with English texts. Hence, we also tested whether automatically translating the Czech and Slovak texts to English before uploading them to the detectors would have given any promising results. Ultimately, we showed that the texts generated by ChatGPT4 were less detectable than the texts generated by ChatGPT3.5.

Citovat

ŠIGUT, Petr a Tomáš FOLTÝNEK. Can we detect ChatGPT-generated texts in Czech and Slovak languages? In Aleš Horák, Pavel Rychlý, Adam Rambousek. Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023. Brno: Tribun EU, 2023, s. 35-43. ISBN 978-80-263-1793-7.

@inproceedings{2355998,
   author = {Šigut, Petr and Foltýnek, Tomáš},
   address = {Brno},
   booktitle = {Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023},
   editor = {Aleš Horák, Pavel Rychlý, Adam Rambousek},
   keywords = {ChatGPT; AI-detection; Czech; Slovak},
   howpublished = {tištěná verze "print"},
   language = {eng},
   location = {Brno},
   isbn = {978-80-263-1793-7},
   pages = {35-43},
   publisher = {Tribun EU},
   title = {Can we detect ChatGPT-generated texts in Czech and Slovak languages?},
   url = {http://nlp.fi.muni.cz/raslan/2023/paper10.pdf},
   year = {2023}
}

TY  - JOUR
ID  - 2355998
AU  - Šigut, Petr - Foltýnek, Tomáš
PY  - 2023
TI  - Can we detect ChatGPT-generated texts in Czech and Slovak languages?
PB  - Tribun EU
CY  - Brno
SN  - 9788026317937
KW  - ChatGPT
KW  - AI-detection
KW  - Czech
KW  - Slovak
UR  - http://nlp.fi.muni.cz/raslan/2023/paper10.pdf
N2  - The wide availability of generative AI exacerbates existing threats to society. It would not be easy even for linguists to tell whether the text we are reading was generated by a Large Language Model (LLM) or written by a human. Researchers have started developing tools that detect AI-generated content. This paper tested how two of these tools, Compilatio and GPT-2 Output Detector, performed with Czech, Slovak and English texts. There was only one tool somewhat capable of detecting AI-generated texts: Compilatio. Other tools were designed to work only with English texts. Hence, we also tested whether automatically translating the Czech and Slovak texts to English before uploading them to the detectors would have given any promising results. Ultimately, we showed that the texts generated by ChatGPT4 were less detectable than the texts generated by ChatGPT3.5.
ER  -

ŠIGUT, Petr a Tomáš FOLTÝNEK. Can we detect ChatGPT-generated texts in Czech and Slovak languages? In Aleš Horák, Pavel Rychlý, Adam Rambousek. \textit{Proceedings of the Sixteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2023}. Brno: Tribun EU, 2023, s.~35-43. ISBN~978-80-263-1793-7.

Podrobný výpis o publikaci