NEVĚŘILOVÁ, Zuzana. Idiomatic Expressions in VerbaLex. In Horák A., Rychlý P., Rambousek, A. Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017. Brno: Tribun EU, 2017, p. 59-67. ISBN 978-80-263-1340-3.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Idiomatic Expressions in VerbaLex
Authors NEVĚŘILOVÁ, Zuzana (203 Czech Republic, guarantor, belonging to the institution).
Edition Brno, Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017, p. 59-67, 9 pp. 2017.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/17:00099004
Organization unit Faculty of Informatics
ISBN 978-80-263-1340-3
ISSN 2336-4289
UT WoS 000426613500007
Keywords (in Czech) idiomy; slovesné fráze; slovesné valence; valenční slovník; korpus
Keywords in English idioms; verb phrases; verb frames; valency lexicon; corpus
Tags International impact
Changed by Changed by: RNDr. Zuzana Nevěřilová, Ph.D., učo 3839. Changed: 26/4/2018 08:08.
Abstract
Idiomatic expressions are part of everyday language, therefore NLP applications that can ``understand'' idioms are desirable. The nature of idioms is somewhat heterogenous - idioms form classes differing in many aspects (e.g. syntactic structure, lexical and syntactic fixedness). Although dictionaries of idioms exist, they usually do not contain information about fixedness or frequency since they are intended to be used by humans, not computer programs. In this work, we propose how to deal with idioms in the valency lexicon VerbaLex using automatically extracted information from the largest dictionary Czech idioms and a web corpus.
Links
LM2015071, research and development projectName: Jazyková výzkumná infrastruktura v České republice (Acronym: LINDAT-Clarin)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 26/7/2024 07:30