BUŠTA, Jan. Type-based Search of Idiomatic Expression. In Aleš Horák, Pavel Rychlý. Seventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2013. Brno: Tribun EU, 2013, p. 93-96. ISBN 978-80-263-0520-0.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Type-based Search of Idiomatic Expression
Authors BUŠTA, Jan (203 Czech Republic, guarantor, belonging to the institution).
Edition Brno, Seventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2013, p. 93-96, 4 pp. 2013.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/13:00070342
Organization unit Faculty of Informatics
ISBN 978-80-263-0520-0
Keywords in English idioms; idiomatic candidates; syntactic fixedness; lexical fixedness; transitive verbs; thesaurus
Tags International impact, Reviewed
Changed by Changed by: Mgr. Jan Bušta, učo 172959. Changed: 1/6/2021 07:46.
Abstract
This paper presents evaluation of different approaches to extract verb-noun idiomatic expressions in Czech. These approaches are based on the structure of the idiom and its behavior in language. PMI and syntactic and lexical fixedness modified using VerbaLex and generated thesaurus provide useful tool for choosing best idiomatic candidates for manual annotation and evaluation. Moreover we focused on general adapting the algorithms for Czech.
Links
LM2010013, research and development projectName: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat (Acronym: LINDAT-Clarin)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 31/5/2024 10:41