JAKUBÍČEK, Miloš, Vojtěch KOVÁŘ and Marek GRÁC. Through Low-Cost Annotation to Reliable Parsing Evaluation. In PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation. Tokyo: Waseda University, 2010, p. 555-562. ISBN 978-4-905166-00-9.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Through Low-Cost Annotation to Reliable Parsing Evaluation
Authors JAKUBÍČEK, Miloš (203 Czech Republic, belonging to the institution), Vojtěch KOVÁŘ (203 Czech Republic, guarantor, belonging to the institution) and Marek GRÁC (703 Slovakia, belonging to the institution).
Edition Tokyo, PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, p. 555-562, 8 pp. 2010.
Publisher Waseda University
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 60200 6.2 Languages and Literature
Country of publisher Japan
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/10:00065887
Organization unit Faculty of Informatics
ISBN 978-4-905166-00-9
Keywords in English noun phrases;parsing;parser evaluation;annotation;inter-annotator agreement
Tags best
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 30/4/2014 10:04.
Abstract
In this paper, we present an~application-driven low-cost concept of building a~multi-purpose language resource for Czech which is based on currently available results of previous work by various research teams active in the area of natural language processing. We particularly focus on the first phase which consists in extracting noun phrases from a~morphologically annotated corpus and providing a~simple and easy-to-use application for verifying them. For the extraction task, three Czech parsers have been accommodated and evaluated. Finally we discuss the currently achieved results in the context of ongoing work and show that they lead to consistent and reliable results.
Links
GAP401/10/0792, research and development projectName: Temporální aspekty znalostí a informací
Investor: Czech Science Foundation
LC536, research and development projectName: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky
248307, interní kód MUName: Pattern Recognition-based Statistically Enhanced MT (Acronym: PRESEMT)
Investor: European Union, Pattern Recognition-based Statistically Enhanced MT, Cooperation
PrintDisplayed: 21/7/2024 21:24