SMRŽ, Pavel and Pavel RYCHLÝ. Finding Semantically Related Words in Large Corpora. In Text, Speech and Dialogue, 4th International Conference, TSD 2001. Berlin: Springer-Verlag, 2001, p. 108-115. LNAI 2166. ISBN 3-540-42557-8.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Finding Semantically Related Words in Large Corpora
Authors SMRŽ, Pavel and Pavel RYCHLÝ.
Edition Berlin, Text, Speech and Dialogue, 4th International Conference, TSD 2001, p. 108-115, LNAI 2166, 2001.
Publisher Springer-Verlag
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 20206 Computer hardware and architecture
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
WWW URL
RIV identification code RIV/00216224:14330/01:00004534
Organization unit Faculty of Informatics
ISBN 3-540-42557-8
Keywords in English natural language processing; large corpus; semantically related words
Tags large corpus, natural language processing, semantically related words
Changed by Changed by: doc. RNDr. Pavel Smrž, Ph.D., učo 1297. Changed: 2/11/2001 12:11.
Abstract
The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and describe the treatment of collocations. Next we present the procedure of hierarchical clustering of a very large number of semantically related words and give examples of the resulting partitioning of data in the form of dendrogram. Finally we show a form of the output presentation that facilitates the inspection of the resulting word clusters.
Links
MSM 143300003, plan (intention)Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies
PrintDisplayed: 10/6/2024 01:22