Detailed Information on Publication Record
1999
Automatic Structuring of Written Texts
VEBER, Marek, Aleš HORÁK, Rostislav JULINEK and Pavel SMRŽBasic information
Original name
Automatic Structuring of Written Texts
Name in Czech
Automatické strukturování psaných textů
Authors
VEBER, Marek (203 Czech Republic), Aleš HORÁK (203 Czech Republic, guarantor), Rostislav JULINEK (203 Czech Republic) and Pavel SMRŽ (203 Czech Republic)
Edition
Berlin, Proceedings of 2nd International Conference on Text, Speech, and Dialogue (TSD 1999), p. 101-104, 4 pp. 1999
Publisher
Springer-Verlag
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
20200 2.2 Electrical engineering, Electronic engineering, Information engineering
Country of publisher
Germany
Confidentiality degree
není předmětem státního či obchodního tajemství
References:
Impact factor
Impact factor: 0.872
RIV identification code
RIV/00216224:14330/99:00001590
Organization unit
Faculty of Informatics
ISBN
3-540-66494-7
ISSN
UT WoS
000089259200018
Keywords (in Czech)
struktura textu
Keywords in English
text structure
Tags
Tags
International impact, Reviewed
Změněno: 25/3/2010 11:53, doc. RNDr. Aleš Horák, Ph.D.
Abstract
V originále
This paper deals with automatic structuring and sentence boundary labelling in natural language texts. We describe the implemented structure tagging algorithm and heuristic rules that are used for automatic or semiautomatic labelling. Inside the detected sentence the algorithm performs a decomposition to clauses and then marks the parts of text which do not form a sentence, i.e. headings, signatures, tables and other structured data. We also pay attention to the processing of matched symbols in the text, especially to the analysis of direct speech notation.
Links
VS97028, research and development project |
|