D 2000

Competing Patterns for Language Engineering

SOJKA, Petr

Basic information

Original name

Competing Patterns for Language Engineering

Authors

SOJKA, Petr (203 Czech Republic, guarantor)

Edition

Heidelberg, Proceedings of Third International Workshop on Text, Speech and Dialogue, TSD 2000, p. 157-162, 2000

Publisher

Springer-Verlag

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

20200 2.2 Electrical engineering, Electronic engineering, Information engineering

Country of publisher

Germany

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/00:00000123

Organization unit

Faculty of Informatics

ISBN

3-540-41042-2

UT WoS

000170595900027

Keywords in English

patterns;finite automata;natural language processing;language engineering

Tags

International impact, Reviewed
Změněno: 15/6/2009 21:26, doc. RNDr. Petr Sojka, Ph.D.

Abstract

V originále

In this paper we describe a method of effective handling of linguistic data by means of \emph{covering and inhibiting patterns}---patterns that ``compete'' each other. A methodology of developing such patterns is outlined. Applications in the areas of morphology, hyphenation and part-of-speech tagging are shown. This pattern-driven approach to language engineering allows the combination of linguist expertise with the data learned from corpora---layering of knowledge. Searching for information in pattern database (dictionary problem) is blindingly fast---linear with respect to the length of searching word as with other finite-state approaches.

Links

MSM 143300003, plan (intention)
Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies
VS97028, research and development project
Name: Laboratoř zpracování přirozeného jazyka (s aplikacemi pro podporu výuky zrakově postižených)
Investor: Ministry of Education, Youth and Sports of the CR, Natural Language Processing Laboratory (with applications supporting education of people with limited sight)