D 2004

The Sketch Engine

KILGARRIFF, Adam, Pavel RYCHLÝ, Pavel SMRŽ and David TUGWELL

Basic information

Original name

The Sketch Engine

Name in Czech

Sketch Engine

Authors

KILGARRIFF, Adam (826 United Kingdom of Great Britain and Northern Ireland), Pavel RYCHLÝ (203 Czech Republic, guarantor), Pavel SMRŽ (203 Czech Republic) and David TUGWELL (826 United Kingdom of Great Britain and Northern Ireland)

Edition

Lorient, France, Proceedings of the Eleventh EURALEX International Congress, p. 105-116, 12 pp. 2004

Publisher

Universite de Bretagne-Sud

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

France

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/04:00010460

Organization unit

Faculty of Informatics

ISBN

2952245703

Keywords in English

corpora; corpus management; statistics; word sketches
Změněno: 18/1/2005 11:22, doc. RNDr. Pavel Smrž, Ph.D.

Abstract

V originále

Word sketches are one-page automatic, corpus-based summaries of a word s grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary and were presented at Euralex 2002. At that point, they only existed for English. Now, we have developed the Sketch Engine, a corpus tool which takes as input a corpus of any language and a corresponding grammar patterns and which generates word sketches for the words of that language. It also generates a thesaurus and sketch differences , which specify similarities and differences between near-synonyms. We briefly present a case study investigating applicability of the Sketch Engine to free wordorder languages. The results show that word sketches could facilitate lexicographic work in Czech as they have for English.

In Czech

Sketch engine je korpusový nástroj, který bere jako vstup korpus libovolného jazyka a příslušné gramatické vzory a generuje jednostránkové charakteristiky gramatických a kolokačních vlastností zadaných slov. Produkt je demonstrován na češtině a angličtině.

Links

MSM 143300003, plan (intention)
Name: Interakce člověka s počítačem, dialogové systémy a asistivní technologie
Investor: Ministry of Education, Youth and Sports of the CR, Human-computer interaction, dialog systems and assistive technologies