BURŠA, M., L. LHOTSKÁ, V. CHUDÁČEK, J. SPILKA, Petr JANKŮ and Martin HUSER. Effective free-text medical record processing and information retrieval. In Mian Long. IFMBE Proceedings: World Congress on Medical Physics and Biomedical Engineering. Berlin: Springer, 2012, p. 1305-1308. ISBN 978-3-642-29304-7.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Effective free-text medical record processing and information retrieval.
Authors BURŠA, M. (203 Czech Republic, guarantor), L. LHOTSKÁ (203 Czech Republic), V. CHUDÁČEK (203 Czech Republic), J. SPILKA (203 Czech Republic), Petr JANKŮ (203 Czech Republic, belonging to the institution) and Martin HUSER (203 Czech Republic, belonging to the institution).
Edition Berlin, IFMBE Proceedings: World Congress on Medical Physics and Biomedical Engineering, p. 1305-1308, 4 pp. 2012.
Publisher Springer
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 30214 Obstetrics and gynaecology
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
RIV identification code RIV/00216224:14110/12:00064004
Organization unit Faculty of Medicine
ISBN 978-3-642-29304-7
Keywords in English Swarm Intelligence; Ant Colony; Textual Data Mining; Medical Record Processing; Hospital Information System
Changed by Changed by: prof. MUDr. Martin Huser, Ph.D., MBA, učo 185124. Changed: 13/10/2013 13:24.
Abstract
Information mining from textual data becomes a very challenging task when the structure of the text record is very loose without any rules. The task becomes even more difficult when natural language is used and no apriori knowledge is available. The medical environment itself is also very specific: the natural language used in textual description varies with the personality creating the record (there are many personalized approaches), however it is restricted by terminology (i.e. medical terms, medical standards, etc.). Moreover, the typical patient record is filled with typographical errors, duplicates, ambiguities, syntax errors and many (nonstandard) abbreviations. This paper describes the process of mining information from loosely structured medical textual records with no apriori knowledge. In the paper we depict the process of mining a large dataset of 50,000–120,000 records 20 attributes in database tables, originating from the hospital information system (thanks go to the University Hospital in Brno, Czech Republic) recording over 11 years. This paper concerns only textual attributes with free text input, that means 620,000 text fields in 16 attributes. Each attribute item contains ~800–1,500 characters (diagnoses, medications, etc.). The output of this task is a set of ordered/nominal attributes suitable for rule discovery mining and automated processing that can help in asphyxia prediction during delivery. The proposed technique has an important impact on reduction of the processing time of loosely structured textual records for experts. Note that this project is an ongoing process (and research) and new data are irregularly received from the medical facility, justifying the need for robust and fool-proof algorithms
PrintDisplayed: 27/5/2024 16:00