Enhancing Anaphora Resolution for Czech

D 2007

Enhancing Anaphora Resolution for Czech

NĚMČÍK, Václav

Basic information

Original name

Enhancing Anaphora Resolution for Czech

Name in Czech

Jak lépe rozpoznávat anaforické vztahy v češtině

Authors

NĚMČÍK, Václav (203 Czech Republic, guarantor, belonging to the institution)

Edition

1. vyd. Brno, RASLAN 2007, p. 57-62, 6 pp. 2007

Publisher

Masarykova Univerzita

Other information

Language

English

Type of outcome

Proceedings paper

Field of Study

20200 2.2 Electrical engineering, Electronic engineering, Information engineering

Country of publisher

Czech Republic

Confidentiality degree

is not subject to a state or trade secret

Publication form

printed version "print"

References:

URL

RIV identification code

RIV/00216224:14330/07:00023097

Organization unit

Faculty of Informatics

ISBN

978-80-210-4471-5

UT WoS

000268015500007

Keywords in English

anaphora resolution; linguistic resources; verb valency; semantic plausibility; WordNet; Czech

Abstract

ORIG CZ

V originále

Resolution of anaphoric reference is one of the most important challenges in natural language processing (NLP). Functionality of most NLP systems crucially relies on an accurate mechanism for determining which expressions in the input refer to the same entity in the real world. The immense complexity of this issue has led the research community to adopt predominantly knowledge-poor methods, despite the fact that these are known to be incapable of solving this task reliably. This paper suggests several ways of extending such methods by further resources and mechanisms in order to arrive at a more adequate anaphora resolution procedure. First, the paper sketches how to exploit information about verb valencies or co-occurrence statistics to account for semantic plausibility of individual antecedent candidates. Further, several ways of adapting ML-based AR methods are suggested, so that they account for the structure of the AR task more closely.

In Czech

Rozpoznávání anaforických vztahů je jedním z nejdůležitějších úkolů v automatickém zpracování přirozeného jazyka (ZPJ). Funkčnost většiny systémů ZPJ závisí na vhodnm mechanismu, jenž spolehlivě určí, které výrazy ve vstupním textu odkazují ke stejným entitám ve světě. Obtížnost této problematiky má za následek, že se v praxi používají metody zanedbávající sémantické znalosti, přestože jsou k řešení této úlohy potřeba. Tento článek předkládá několik způsobů, jak tyto metody rozšířit a zvýšit tak jejich úspěšnost. Je nastíněno, jak lze využít informace o slovesných valencích nebo společných výskytech slov k posouzení sémantické vhodnosti kandidátů na antecedent. Dále jsou v článku navrženy možné úpravy metod založených na strojovém učení, aby lépe odrážely strukturu úlohy rozpoznávání anaforických vztahů.

Links

LC536, research and development project

Name: Centrum komputační lingvistiky

Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky

Přehled o publikaci