Information Extraction for Czech Based on Syntactic Analysis

D 2014

Information Extraction for Czech Based on Syntactic Analysis

BAISA, Vít and Vojtěch KOVÁŘ

Basic information

Original name

Information Extraction for Czech Based on Syntactic Analysis

Authors

BAISA, Vít (203 Czech Republic, belonging to the institution) and Vojtěch KOVÁŘ (203 Czech Republic, guarantor, belonging to the institution)

Edition

Cham, Human Language Technology Challenges for Computer Science and Linguistics, p. 155-165, 11 pp. 2014

Publisher

Springer

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Switzerland

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

Impact factor

Impact factor: 0.402 in 2005

RIV identification code

RIV/00216224:14330/14:00073242

Organization unit

Faculty of Informatics

ISBN

978-3-319-08957-7

ISSN

DOI

http://dx.doi.org/10.1007/978-3-319-08958-4_13

UT WoS

000345651500013

Keywords (in Czech)

extrakce informací, čeština, syntaktická analýza

Keywords in English

information extraction; Czech language; syntactic analysis

Abstract

V originále

We present a complex pipeline of natural language processing tools for Czech that performs extraction of basic facts presented in a text. The input for the tool is a plain text, the output contains verb and noun phrases with basic semantic classification. Automatic syntactic analysis of Czech plays a crucial role in the pipeline. In this paper, we describe the particular tools used in the system, then we give an example of its usage and conclude with a basic evaluation of the overall system accuracy.

Links

GAP401/10/0792, research and development project

Name: Temporální aspekty znalostí a informací

Investor: Czech Science Foundation

GA407/07/0679, research and development project

Name: Právní e-slovník - PES

Investor: Czech Science Foundation, Legal e-dictionary - PES

VF20102014003, research and development project

Name: Analýza přirozeného jazyka v prostředí internetu (Acronym: APJI)

Investor: Ministry of the Interior of the CR

Detailed Information on Publication Record