SOJKA, Petr. Competing Patterns for Language Engineering. In Proceedings of Third International Workshop on Text, Speech and Dialogue, TSD 2000. Heidelberg: Springer-Verlag, 2000, p. 157-162. ISBN 3-540-41042-2.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Competing Patterns for Language Engineering
Authors SOJKA, Petr (203 Czech Republic, guarantor).
Edition Heidelberg, Proceedings of Third International Workshop on Text, Speech and Dialogue, TSD 2000, p. 157-162, 2000.
Publisher Springer-Verlag
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 20200 2.2 Electrical engineering, Electronic engineering, Information engineering
Country of publisher Germany
Confidentiality degree is not subject to a state or trade secret
WWW URL
RIV identification code RIV/00216224:14330/00:00000123
Organization unit Faculty of Informatics
ISBN 3-540-41042-2
UT WoS 000170595900027
Keywords in English patterns;finite automata;natural language processing;language engineering
Tags Finite Automata, language engineering, natural language processing, patterns
Tags International impact, Reviewed
Changed by Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 15/6/2009 21:26.
Abstract
In this paper we describe a method of effective handling of linguistic data by means of \emph{covering and inhibiting patterns}---patterns that ``compete'' each other. A methodology of developing such patterns is outlined. Applications in the areas of morphology, hyphenation and part-of-speech tagging are shown. This pattern-driven approach to language engineering allows the combination of linguist expertise with the data learned from corpora---layering of knowledge. Searching for information in pattern database (dictionary problem) is blindingly fast---linear with respect to the length of searching word as with other finite-state approaches.
Links
MSM 143300003, plan (intention)Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies
VS97028, research and development projectName: Laboratoř zpracování přirozeného jazyka (s aplikacemi pro podporu výuky zrakově postižených)
Investor: Ministry of Education, Youth and Sports of the CR, Natural Language Processing Laboratory (with applications supporting education of people with limited sight)
PrintDisplayed: 7/6/2024 08:36