NEVĚŘILOVÁ, Zuzana. Syntactic Patterns of Czech Multiword Expressions. In Aleš Horák, Klára Osolsobě, Adam Rambousek, Pavel Rychlý. Slavonic Natural Language Processing in the 21st Century. Brno: Tribun EU, 2019, p. 174-184. ISBN 978-80-263-1545-2.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Syntactic Patterns of Czech Multiword Expressions
Authors NEVĚŘILOVÁ, Zuzana.
Edition Brno, Slavonic Natural Language Processing in the 21st Century, p. 174-184, 11 pp. 2019.
Publisher Tribun EU
Other information
Type of outcome Proceedings paper
Confidentiality degree is not subject to a state or trade secret
ISBN 978-80-263-1545-2
Changed by Changed by: RNDr. Zuzana Nevěřilová, Ph.D., učo 3839. Changed: 24/5/2020 16:55.
Abstract
We focus on a MWE collection that we created in past works. We analyze the collection using K-means clustering of the MWE tags as they occur in a web corpus. Afterwards, we compare the collection with another Czech MWE collection, the SemLex. The comparison shows how different the data are. Our collection created from web corpus contains less formal language and exemplifies the use of noun phrases with noun modifiers, mainly in English borrowings. On the other hand, the SemLex collection is extracted from dataset containing mostly formal Czech and noun phrase with adjective modifier is the prevalent syntactic pattern.
PrintDisplayed: 24/8/2024 12:21