The Saara Framework: An Anaphora Resolution System for Czech

D 2009

The Saara Framework: An Anaphora Resolution System for Czech

NĚMČÍK, Václav

Basic information

Original name

The Saara Framework: An Anaphora Resolution System for Czech

Name in Czech

Saara: Systém pro automatickou analýzu anafor pro češtinu

Authors

NĚMČÍK, Václav (203 Czech Republic, guarantor, belonging to the institution)

Edition

1. vyd. Brno, RASLAN 2009 : Recent Advances in Slavonic Natural Language Processing, p. 49-54, 6 pp. 2009

Publisher

Masaryk University

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

References:

URL

RIV identification code

RIV/00216224:14330/09:00038427

Organization unit

Faculty of Informatics

ISBN

978-80-210-5048-8

Keywords (in Czech)

anafora; čeština; syntaktická analýza

Keywords in English

anaphora; anaphora resolution; Czech; system; framework; syntactic analysis

Abstract

ORIG CZ

V originále

Determining reference and referential links in discourse is one of the biggest and most important challenges in natural language understanding. In particular, computing coreference classes over the set of referring expressions in text is crucial for its further syntactic and semantic processing. We present a system for automatic anaphora resolution that can be used on arbitrary texts in Czech. The article describes the individual phases of processing the input text and mentions selected issues that need to be addressed by the system.

In Czech

Určení reference a referenčních vztahů v diskursu je jedním z největších a nejdůležitějších úkolů v oblasti porozumění přirozenému jazyku. Zejména získání koreferenčních tříd nad referenčními výrazy v textu je klíčové pro jeho další syntaktické a sémantické zpracování. V tomto článku je představen systém pro automatické určování anaforických vztahů, který je použitelný na libovolné české texty. Článek popisuje jednotlivé fáze zpracování vstupního textu a věnuje se vybraným problémům, které se během zpracování musí řešit.

Links

LC536, research and development project

Name: Centrum komputační lingvistiky

Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky

Detailed Information on Publication Record