D 2001

Finding Semantically Related Words in Large Corpora

SMRŽ, Pavel and Pavel RYCHLÝ

Basic information

Original name

Finding Semantically Related Words in Large Corpora

Authors

SMRŽ, Pavel and Pavel RYCHLÝ

Edition

Berlin, Text, Speech and Dialogue, 4th International Conference, TSD 2001, p. 108-115, LNAI 2166, 2001

Publisher

Springer-Verlag

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

20206 Computer hardware and architecture

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/01:00004534

Organization unit

Faculty of Informatics

ISBN

3-540-42557-8

Keywords in English

natural language processing; large corpus; semantically related words
Změněno: 2/11/2001 12:11, doc. RNDr. Pavel Smrž, Ph.D.

Abstract

V originále

The paper deals with the linguistic problem of fully automatic grouping of semantically related words. We discuss the measures of semantic relatedness of basic word forms and describe the treatment of collocations. Next we present the procedure of hierarchical clustering of a very large number of semantically related words and give examples of the resulting partitioning of data in the form of dendrogram. Finally we show a form of the output presentation that facilitates the inspection of the resulting word clusters.

Links

MSM 143300003, plan (intention)
Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies