Detailed Information on Publication Record
2001
Finding Semantically Related Words in Large Corpora
SMRŽ, Pavel and Pavel RYCHLÝBasic information
Original name
Finding Semantically Related Words in Large Corpora
Authors
SMRŽ, Pavel and Pavel RYCHLÝ
Edition
FIMU-RS-2001-02, 2001
Other information
Language
English
Type of outcome
Audiovizuální tvorba
Country of publisher
Czech Republic
Confidentiality degree
není předmětem státního či obchodního tajemství
References:
Organization unit
Faculty of Informatics
Keywords in English
natural language; large corpus; semantically related words
Změněno: 2/11/2001 13:04, doc. RNDr. Pavel Smrž, Ph.D.
Abstract
V originále
The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and describe the treatment of collocations. Next we present the procedure of hierarchical clustering of a very large number of semantically related words and give examples of the resulting partitioning of data in the form of dendrogram. Finally we show a form of the output presentation that facilitates the inspection of the resulting word clusters.