LLOD schema for Simplified Offensive Language Taxonomy in
multilingual detection and applications

LEWANDOWSKA-TOMASZCZYK, Barbara, Anna BĄCZKOWSKA, Olga DONTCHEVA-NAVRÁTILOVÁ, Chaya LIEBESKIND, Giedrė VALŪNAITĖ OLEŠKEVIČIENĖ, Slavko ŽITNIK, Marvin TROJSZCZAK, Renata POVOLNÁ, Linas SELMISTRAITIS, Andrius UTKA and Dangis GUDELIS. LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications. Lodz Papers in Pragmatics. Německo: De Gruyter, 2023, vol. 19, No 2, p. 301-324. ISSN 1895-6106. Available from: https://dx.doi.org/10.1515/lpp-2023-0016.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
Authors	LEWANDOWSKA-TOMASZCZYK, Barbara (616 Poland), Anna BĄCZKOWSKA (616 Poland), Olga DONTCHEVA-NAVRÁTILOVÁ (100 Bulgaria, guarantor, belonging to the institution), Chaya LIEBESKIND, Giedrė VALŪNAITĖ OLEŠKEVIČIENĖ (440 Lithuania), Slavko ŽITNIK (705 Slovenia), Marvin TROJSZCZAK (616 Poland), Renata POVOLNÁ (203 Czech Republic, belonging to the institution), Linas SELMISTRAITIS (440 Lithuania), Andrius UTKA (440 Lithuania) and Dangis GUDELIS (440 Lithuania).
Edition	Lodz Papers in Pragmatics, Německo, De Gruyter, 2023, 1895-6106.

Other information
Original language	English
Type of outcome	Article in a journal
Field of Study	60203 Linguistics
Country of publisher	Germany
Confidentiality degree	is not subject to a state or trade secret
WWW	URL
RIV identification code	RIV/00216224:14410/23:00133087
Organization unit	Faculty of Education
Doi	http://dx.doi.org/10.1515/lpp-2023-0016
Keywords in English	offensive language; offensive language taxonomy; annotation; LLOD; linguistic linked open data; hate speech
Tags	International impact, Reviewed
Changed by	Changed by: Mgr. Daniela Marcollová, učo 111148. Changed: 25/1/2024 09:27.

Abstract

The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema.

PrintDisplayed: 2/6/2024 17:45

LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications

Other applications