Language Resources for Intelligent Processing of Dialogues
about Electrical Networks

HORÁK, Aleš, Lukáš SVOBODA, Vladimír KADLEC a Pavel CENEK. Language Resources for Intelligent Processing of Dialogues about Electrical Networks. In Proceedings of ElNet 2005. Ostrava: VŠB TU Ostrava, 2006, s. 42-49, 7 s. ISBN 80-248-0975-3.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Language Resources for Intelligent Processing of Dialogues about Electrical Networks
Název česky	Jazykové zdroje pro inteligentní zpracování dialogů o elektrických sítích
Autoři	HORÁK, Aleš (203 Česká republika, garant), Lukáš SVOBODA (203 Česká republika), Vladimír KADLEC (203 Česká republika) a Pavel CENEK (203 Česká republika).
Vydání	Ostrava, Proceedings of ElNet 2005, od s. 42-49, 7 s. 2006.
Nakladatel	VŠB TU Ostrava

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Česká republika
Utajení	není předmětem státního či obchodního tajemství
Kód RIV	RIV/00216224:14330/06:00015281
Organizační jednotka	Fakulta informatiky
ISBN	80-248-0975-3
Klíčová slova anglicky	corpora; question answering; desambiguation; electircal networks
Štítky	corpora, desambiguation, electircal networks, question answering
Příznaky	Recenzováno
Změnil	Změnil: doc. RNDr. Aleš Horák, Ph.D., učo 1648. Změněno: 9. 1. 2007 11:21.

Anotace

The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation and preparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collection of large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in Czech) over the database of power networks failures.

Anotace česky

The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation and preparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collection of large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in Czech) over the database of power networks failures.

Návaznosti
1ET100300414, projekt VaV	Název: Inteligentní metody pro zvýšení spolehlivosti elektrických sítí
1ET100300414, projekt VaV	Investor: Akademie věd ČR, Inteligentní metody pro zvýšení spolehlivosti elektrických sítí

VytisknoutZobrazeno: 13. 9. 2024 00:30

Language Resources for Intelligent Processing of Dialogues about Electrical Networks

Další aplikace