KOVÁŘ, Vojtěch and Aleš HORÁK. Power Networks Dialogues - Automatic Analysis and Evaluation of a Domain-Specific Text Corpus. In Proceedings of ELNET 2007. Ostrava: Faculty of Electrical Engineering and Computer Science, VŠB - Technical University of Ostrava, 2007, p. 30-37. ISBN 978-80-248-1681-4.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Power Networks Dialogues - Automatic Analysis and Evaluation of a Domain-Specific Text Corpus
Name in Czech Dialogy o elektrorozvodných sítích - automatická analýza a vyhodnocení doménově specifického korpusu textů
Authors KOVÁŘ, Vojtěch (203 Czech Republic) and Aleš HORÁK (203 Czech Republic, guarantor).
Edition Ostrava, Proceedings of ELNET 2007, p. 30-37, 2007.
Publisher Faculty of Electrical Engineering and Computer Science, VŠB - Technical University of Ostrava
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
RIV identification code RIV/00216224:14330/07:00019534
Organization unit Faculty of Informatics
ISBN 978-80-248-1681-4
Keywords in English power network; corpus; dialogue; domain-specific; synt; syntactic analysis
Tags corpus, Dialogue, domain-specific, power network, synt, syntactic analysis
Tags International impact, Reviewed
Changed by Changed by: RNDr. Vojtěch Kovář, Ph.D., učo 139915. Changed: 20/10/2010 14:34.
Abstract
Automatic analysis of domain-specific dialogues is a special part of common analysis of natural language texts. In this paper, we describe the creation of fundamental resource for working with dialogues about electrical power networks - the corpus of 1 million tokens specialized to the power networks topics. We show the details of building such corpus and results of automatic analysis of the corpus content such as the term extraction, morphological disambiguation and syntactic analysis of the domain-specific texts.
Abstract (in Czech)
Článek popisuje sestavení milionového specializovaného korpusu textů o elektrorozvodných sítích. Jsou popsány výsledky automatické analýzy obsahu korpusu jako extrakce termínů, morfologická desambiguace a syntaktická analýza doménově specifických textů.
Links
1ET100300414, research and development projectName: Inteligentní metody pro zvýšení spolehlivosti elektrických sítí
Investor: Academy of Sciences of the Czech Republic, Intelligentmethods for incresing of reliability of electrical networks
2C06009, research and development projectName: Prostředky tvorby komplexní báze znalostí pro komunikaci se sémantickým webem v přirozeném jazyce (Acronym: COT-SEWing)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 25/4/2024 10:19