Multilingual CPA: Linking Verb Patterns across Languages

BAISA, Vít, Sara MOŽE a Irene RENAU. Multilingual CPA: Linking Verb Patterns across Languages. Online. In Tinatin Margalitadze, George Meladze. Proceedings of the XVII EURALEX International congress. Tbilisi: Ivane Javakhishvili Tbilisi State University, 2016, s. 410-417. ISBN 978-9941-13-542-2.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Multilingual CPA: Linking Verb Patterns across Languages
Autoři	BAISA, Vít (203 Česká republika, garant, domácí), Sara MOŽE (705 Slovinsko) a Irene RENAU (724 Španělsko).
Vydání	Tbilisi, Proceedings of the XVII EURALEX International congress, od s. 410-417, 8 s. 2016.
Nakladatel	Ivane Javakhishvili Tbilisi State University

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Gruzie
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	elektronická verze "online"
Kód RIV	RIV/00216224:14330/16:00090692
Organizační jednotka	Fakulta informatiky
ISBN	978-9941-13-542-2
UT WoS	000392695200044
Klíčová slova anglicky	Corpus Pattern Analysis; corpus lexicography; multilingual resources; verb patterns
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: Mgr. Michal Petr, učo 65024. Změněno: 20. 7. 2018 14:41.

Anotace

This paper presents the results of a pilot study in linking corresponding English and Spanish verb patterns using both automatic and manual procedures. Our work is rooted in Corpus Pattern Analysis (CPA) (Hanks 2004, 2013), a corpus-driven technique that was used in the creation of existing monolingual pattern dictionaries of English and Spanish verbs, which were used in our experiment to design a gold standard of manually annotated verb pattern pairs. Research in CPA has inspired parallel projects in English, Spanish, Italian and German. Our study represents the first attempt to build a multilingual lexical resource by linking verb patterns in these languages. Verb have special difficulties related to grammar and argument structure that we do not find in other parts-of-speech, and for that reason we think that it is necessary to create a specific resource for them. After applying the automatic matching to a set of 87 Spanish verbs linked to 176 English verbs, an evaluation of a random selection of 50 of these pairs show 80% precision

Návaznosti
7F14047, projekt VaV	Název: Harvesting big text data for under-resourced languages (Akronym: HaBiT)
7F14047, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Harvesting big text data for under-resourced languages

VytisknoutZobrazeno: 5. 5. 2024 18:25

Multilingual CPA: Linking Verb Patterns across Languages

Další aplikace