Graded and Word-Sense-Disambiguation Decisions in Corpus
Pattern Analysis: a Pilot Study

CINKOVA, Silvie, Ema KREJČOVÁ, Anna VERNEROVÁ a Vít BAISA. Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study. Online. In Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia: European Language Resources Association (ELRA), 2016, s. 848-854. ISBN 978-2-9517408-9-1.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Autoři	CINKOVA, Silvie (203 Česká republika), Ema KREJČOVÁ (203 Česká republika), Anna VERNEROVÁ (203 Česká republika) a Vít BAISA (203 Česká republika, domácí).
Vydání	Portorož, Slovenia, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), od s. 848-854, 7 s. 2016.
Nakladatel	European Language Resources Association (ELRA)

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Slovinsko
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	elektronická verze "online"
Kód RIV	RIV/00216224:14330/16:00090038
Organizační jednotka	Fakulta informatiky
ISBN	978-2-9517408-9-1
Klíčová slova anglicky	CPA; graded decisions; English; verbs; usage patterns; annotation; Likert scales
Štítky	firank_B
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: Mgr. et Mgr. Vít Baisa, Ph.D., učo 139654. Změněno: 27. 5. 2016 13:35.

Anotace

We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585 ). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014). The selection was random and based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.

Návaznosti
LM2015071, projekt VaV	Název: Jazyková výzkumná infrastruktura v České republice (Akronym: LINDAT-Clarin)
LM2015071, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Projekt LINDAT-Clarin - Vybudování a provoz českého uzlu pan-evropské infrastruktury pro výzkum
7F14047, projekt VaV	Název: Harvesting big text data for under-resourced languages (Akronym: HaBiT)
7F14047, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Harvesting big text data for under-resourced languages

VytisknoutZobrazeno: 17. 5. 2024 09:06

Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study

Další aplikace