BAISA, Vít and Vojtěch KOVÁŘ. Information Extraction for Czech Based on Syntactic Analysis. In Zygmunt Vetulani, Joseph Mariani. Human Language Technology Challenges for Computer Science and Linguistics. Cham: Springer, 2014, p. 155-165. ISBN 978-3-319-08957-7. Available from: https://dx.doi.org/10.1007/978-3-319-08958-4_13.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Information Extraction for Czech Based on Syntactic Analysis
Authors BAISA, Vít (203 Czech Republic, belonging to the institution) and Vojtěch KOVÁŘ (203 Czech Republic, guarantor, belonging to the institution).
Edition Cham, Human Language Technology Challenges for Computer Science and Linguistics, p. 155-165, 11 pp. 2014.
Publisher Springer
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Switzerland
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
Impact factor Impact factor: 0.402 in 2005
RIV identification code RIV/00216224:14330/14:00073242
Organization unit Faculty of Informatics
ISBN 978-3-319-08957-7
ISSN 0302-9743
Doi http://dx.doi.org/10.1007/978-3-319-08958-4_13
UT WoS 000345651500013
Keywords (in Czech) extrakce informací, čeština, syntaktická analýza
Keywords in English information extraction; Czech language; syntactic analysis
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 1/4/2015 09:06.
Abstract
We present a complex pipeline of natural language processing tools for Czech that performs extraction of basic facts presented in a text. The input for the tool is a plain text, the output contains verb and noun phrases with basic semantic classification. Automatic syntactic analysis of Czech plays a crucial role in the pipeline. In this paper, we describe the particular tools used in the system, then we give an example of its usage and conclude with a basic evaluation of the overall system accuracy.
Links
GAP401/10/0792, research and development projectName: Temporální aspekty znalostí a informací
Investor: Czech Science Foundation
GA407/07/0679, research and development projectName: Právní e-slovník - PES
Investor: Czech Science Foundation, Legal e-dictionary - PES
VF20102014003, research and development projectName: Analýza přirozeného jazyka v prostředí internetu (Acronym: APJI)
Investor: Ministry of the Interior of the CR
PrintDisplayed: 25/4/2024 10:26