D 2007

Derivational Relations in Czech WordNet

PALA, Karel and Dana HLAVÁČKOVÁ

Basic information

Original name

Derivational Relations in Czech WordNet

Name in Czech

Derivační vztahy v českém WordNetu

Authors

PALA, Karel (203 Czech Republic, guarantor) and Dana HLAVÁČKOVÁ (203 Czech Republic)

Edition

1. vyd. Praha, Proceedings of the Workshop on Balto-Slavonic Natural Language Processing, p. 75-81, 6 pp. 2007

Publisher

Universita Karlova, ÚFAL MFF UK

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

RIV identification code

RIV/00216224:14330/07:00019440

Organization unit

Faculty of Informatics

ISBN

978-1-932432-88-6

Keywords in English

derivational relations; WordNet; semantic networks; computer processing

Tags

International impact, Reviewed
Změněno: 17/12/2007 19:52, Mgr. Dana Hlaváčková, Ph.D.

Abstract

V originále

In the paper we describe enriching Czech WordNet with the derivational relations that in highly inflectional languages like Czech form typical derivational nests (or subnets). Derivational relations are mostly of semantic nature and their regularity in Czech allows us to add them to the WordNet almost automatically. For this purpose we have used the derivational version of morphological analyzer Ajka that is able to handle the basic and most productive derivational relations in Czech. Using a special derivational interface developed in our NLP Lab we have explored the semantic nature of the selected noun derivational suffixes and established a set of the semantically labeled derivational relations, presently 14. We have added them to the Czech WordNet and in this way enriched it with approx. 30 000 new Czech synsets. A similar enrichment for Princeton WordNet has been reported in its recently released version 3.0, we will comment on the partial similarities and differences.

In Czech

Formální popis slovotvorných vztahů v češtině, stanovení hlavních typů a jejich zpracování na slovníku 126 000 českých substantivních kmenů. Propojení s literály v českém Wordnetu

Links

LC536, research and development project
Name: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky
1ET100300414, research and development project
Name: Inteligentní metody pro zvýšení spolehlivosti elektrických sítí
Investor: Academy of Sciences of the Czech Republic, Intelligentmethods for incresing of reliability of electrical networks
1ET208050401, research and development project
Name: E-learning v kontextu sémantického webu
Investor: Academy of Sciences of the Czech Republic, E-learning in the Semantic Web Context