Language Resources for Intelligent Processing of Dialogues
about Electrical Networks

HORÁK, Aleš, Lukáš SVOBODA, Vladimír KADLEC and Pavel CENEK. Language Resources for Intelligent Processing of Dialogues about Electrical Networks. In Proceedings of ElNet 2005. Ostrava: VŠB TU Ostrava, 2006, p. 42-49, 7 pp. ISBN 80-248-0975-3.

Other formats: BibTeX LaTeX RIS

Basic information
Original name	Language Resources for Intelligent Processing of Dialogues about Electrical Networks
Name in Czech	Jazykové zdroje pro inteligentní zpracování dialogů o elektrických sítích
Authors	HORÁK, Aleš (203 Czech Republic, guarantor), Lukáš SVOBODA (203 Czech Republic), Vladimír KADLEC (203 Czech Republic) and Pavel CENEK (203 Czech Republic).
Edition	Ostrava, Proceedings of ElNet 2005, p. 42-49, 7 pp. 2006.
Publisher	VŠB TU Ostrava

Other information
Original language	English
Type of outcome	Proceedings paper
Field of Study	10201 Computer sciences, information science, bioinformatics
Country of publisher	Czech Republic
Confidentiality degree	is not subject to a state or trade secret
RIV identification code	RIV/00216224:14330/06:00015281
Organization unit	Faculty of Informatics
ISBN	80-248-0975-3
Keywords in English	corpora; question answering; desambiguation; electircal networks
Tags	corpora, desambiguation, electircal networks, question answering
Tags	Reviewed
Changed by	Changed by: doc. RNDr. Aleš Horák, Ph.D., učo 1648. Changed: 9/1/2007 11:21.

Abstract

The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation and preparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collection of large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in Czech) over the database of power networks failures.

Abstract (in Czech)

The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation and preparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collection of large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in Czech) over the database of power networks failures.

Links
1ET100300414, research and development project	Name: Inteligentní metody pro zvýšení spolehlivosti elektrických sítí
1ET100300414, research and development project	Investor: Academy of Sciences of the Czech Republic, Intelligentmethods for incresing of reliability of electrical networks

PrintDisplayed: 8/9/2024 14:31

Language Resources for Intelligent Processing of Dialogues about Electrical Networks

Other applications