LLOD schema for Simplified Offensive Language Taxonomy in
multilingual detection and applications

J 2023

LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications

LEWANDOWSKA-TOMASZCZYK, Barbara; Anna BĄCZKOWSKA; Olga DONTCHEVA-NAVRÁTILOVÁ; Chaya LIEBESKIND; Giedrė VALŪNAITĖ OLEŠKEVIČIENĖ et al.

Základní údaje

Originální název

LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications

Autoři

Vydání

Lodz Papers in Pragmatics, Německo, De Gruyter, 2023, 1895-6106

Další údaje

Jazyk

angličtina

Typ výsledku

Článek v odborném periodiku

Obor

60203 Linguistics

Stát vydavatele

Německo

Utajení

není předmětem státního či obchodního tajemství

Odkazy

URL

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14410/23:00133087

Organizační jednotka

Pedagogická fakulta

DOI

https://doi.org/10.1515/lpp-2023-0016

EID Scopus

2-s2.0-85180448082

Klíčová slova anglicky

offensive language; offensive language taxonomy; annotation; LLOD; linguistic linked open data; hate speech

Příznaky

Mezinárodní význam, Recenzováno

Změněno: 5. 3. 2026 11:55, doc. PhDr. Renata Povolná, Ph.D.

Anotace

V originále

The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema.

Citovat

LEWANDOWSKA-TOMASZCZYK, Barbara; Anna BĄCZKOWSKA; Olga DONTCHEVA-NAVRÁTILOVÁ; Chaya LIEBESKIND; Giedrė VALŪNAITĖ OLEŠKEVIČIENĖ; Slavko ŽITNIK; Marvin TROJSZCZAK; Renata POVOLNÁ; Linas SELMISTRAITIS; Andrius UTKA a Dangis GUDELIS. LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications. Lodz Papers in Pragmatics. Německo: De Gruyter, 2023, roč. 19, č. 2, s. 301-324. ISSN 1895-6106. Dostupné z: https://doi.org/10.1515/lpp-2023-0016.

@article{2362562,
   author = {LewandowskaandTomaszczyk, Barbara and Bączkowska, Anna and DontchevaandNavrátilová, Olga and Liebeskind, Chaya and Valūnaitė Oleškevičienė, Giedrė and Žitnik, Slavko and Trojszczak, Marvin and Povolná, Renata and Selmistraitis, Linas and Utka, Andrius and Gudelis, Dangis},
   article_location = {Německo},
   article_number = {2},
   doi = {https://doi.org/10.1515/lpp-2023-0016},
   keywords = {offensive language; offensive language taxonomy; annotation; LLOD; linguistic linked open data; hate speech},
   language = {eng},
   issn = {1895-6106},
   journal = {Lodz Papers in Pragmatics},
   title = {LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications},
   url = {https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html},
   volume = {19},
   year = {2023}
}

TY  - JOUR
ID  - 2362562
AU  - Lewandowska-Tomaszczyk, Barbara - Bączkowska, Anna - Dontcheva-Navrátilová, Olga - Liebeskind, Chaya - Valūnaitė Oleškevičienė, Giedrė - Žitnik, Slavko - Trojszczak, Marvin - Povolná, Renata - Selmistraitis, Linas - Utka, Andrius - Gudelis, Dangis
PY  - 2023
TI  - LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications
JF  - Lodz Papers in Pragmatics
VL  - 19
IS  - 2
SP  - 301-324
EP  - 301-324
PB  - De Gruyter
SN  - 18956106
KW  - offensive language
KW  - offensive language taxonomy
KW  - annotation
KW  - LLOD
KW  - linguistic linked open data
KW  - hate speech
UR  - https://www.degruyter.com/document/doi/10.1515/lpp-2023-0016/html
N2  - The goal of the paper is to present a Simplified Offensive Language (SOL) Taxonomy, its application and testing in the Second Annotation Campaign conducted between March-May 2023 on four languages: English, Czech, Lithuanian, and Polish to be verified and located in LLOD. Making reference to the previous Offensive Language taxonomic models proposed mostly by the same COST Action Nexus Linguarum WG 4.1.1 team, the number and variety of the categories underwent the definitional revision, and the present typology was tested in the annotation on the publicly available offensive language datasets of each of the four languages. The results of the annotation are presented and as they are contained within the accepted statistical values on the inter-annotator agreement in the SOL categories and their aspects, we propose this taxonomy as a core ontology which represents the encoding of the supported offensive languages and justify its use on new data in terms of a more universal Linguistic Linked Open Data (LLOD) schema.
ER  -

LEWANDOWSKA-TOMASZCZYK, Barbara; Anna BĄCZKOWSKA; Olga DONTCHEVA-NAVRÁTILOVÁ; Chaya LIEBESKIND; Giedrė VALŪNAITĖ OLEŠKEVIČIENĖ; Slavko ŽITNIK; Marvin TROJSZCZAK; Renata POVOLNÁ; Linas SELMISTRAITIS; Andrius UTKA a Dangis GUDELIS. LLOD schema for Simplified Offensive Language Taxonomy in multilingual detection and applications. \textit{Lodz Papers in Pragmatics}. Německo: De Gruyter, 2023, roč.~19, č.~2, s.~301-324. ISSN~1895-6106. Dostupné z: https://doi.org/10.1515/lpp-2023-0016.

Přehled o publikaci