D 2002

Automated Selection of Interesting Medical Text Documents by the TEA Text Analyzer

ŽIŽKA, Jan and Aleš BOUREK

Basic information

Original name

Automated Selection of Interesting Medical Text Documents by the TEA Text Analyzer

Authors

ŽIŽKA, Jan (203 Czech Republic, guarantor) and Aleš BOUREK (203 Czech Republic)

Edition

Berlin, Heidelberg, Germany, Third International Conference on Intelligent Text Processing and Computational Linguistics CICLing-2002 Proceedings, Mexico City, February 2002. p. 402-404, 2002

Publisher

Springer-Verlag

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

20200 2.2 Electrical engineering, Electronic engineering, Information engineering

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

RIV identification code

RIV/00216224:14330/02:00004820

Organization unit

Faculty of Informatics

ISBN

3-540-43219-1

Keywords in English

machine learnig; text-document classification; automated selection; unstructured text; Bayes classification; dictionary modification
Změněno: 15/5/2003 11:58, doc. Ing. Jan Žižka, CSc.

Abstract

V originále

The paper briefly describes the experience in the automated selection of interesting medical text documents by the TEA text analyzer based on the naive Bayes classifier. Even if the used type of the classifier provides generally good results, physicians needed certain supporting functions to obtain really interesting medical text documents, for example, from resources like the Internet. The influence of the functions is summarized and discussed. In addition, some remaining problems are mentioned.

Links

MSM 143300003, plan (intention)
Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies