SRDANOVIĆ, Irena, Naomi IDA, Chikako SHIGEMORI BUČAR, Adam KILGARRIFF and Vojtěch KOVÁŘ. Japanese Word Sketches: Advances and Problems. Acta Linguistica Asiatica. Ljubljana: University of Ljubljana, 2011, 1/2011, No 2, p. 63-82. ISSN 2232-3317.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Japanese Word Sketches: Advances and Problems
Name in Czech Word Sketches pro japonštinu: Pozitiva a problémy
Authors SRDANOVIĆ, Irena (705 Slovenia), Naomi IDA (392 Japan), Chikako SHIGEMORI BUČAR (705 Slovenia), Adam KILGARRIFF (826 United Kingdom of Great Britain and Northern Ireland) and Vojtěch KOVÁŘ (203 Czech Republic, guarantor, belonging to the institution).
Edition Acta Linguistica Asiatica, Ljubljana, University of Ljubljana, 2011, 2232-3317.
Other information
Original language English
Type of outcome Article in a journal
Field of Study 60200 6.2 Languages and Literature
Country of publisher Slovenia
Confidentiality degree is not subject to a state or trade secret
WWW URL
RIV identification code RIV/00216224:14330/11:00053629
Organization unit Faculty of Informatics
Keywords (in Czech) word sketches;vyhodnocení;japonština
Keywords in English word sketches;evaluation;japanese
Tags International impact, Reviewed
Changed by Changed by: RNDr. Vojtěch Kovář, Ph.D., učo 139915. Changed: 25/11/2011 13:10.
Abstract
In this paper, we present results of an evaluation of Japanese word sketches and address in detail issues that were observed by the evaluators. A word sketch presents a list of salient collocates of a word, organized by the grammatical relations holding between the word and its collocate. The word sketch functionality is incorporated into the Sketch Engine corpus query system and has been created for more than twenty languages so far, including Japanese. The issues that have been discovered in the evaluation of word sketches in Japanese are to be addressed for further enhancement of the word sketch functionality. Other tools and resources which are combined for use and influence the performance of the word sketches should also be looked over. We divide the issues into the following: 1) the lemmatizer and tagger in use, 2) the sketch grammar that is specifically written for Japanese, and 3) the corpus and statistical methods.
Abstract (in Czech)
Článek preentuje výsledky evaluace aplikace word sketches na japonštinu. Word sketches prezentují seznam důvěryhodných kolokací slova organizovaných podle gramatických relací, na základě korpusu japonštiny. Evaluace je rozdělena do následujících fází: 1) použitý lemmatizér a značkovač, 2) "sketch grammar" -- syntaktická pravidla pro extrakci kolokací a 3) korpus a statistické metody.
Links
LC536, research and development projectName: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky
248307, interní kód MUName: Pattern Recognition-based Statistically Enhanced MT (Acronym: PRESEMT)
Investor: European Union, Pattern Recognition-based Statistically Enhanced MT, Cooperation
PrintDisplayed: 30/8/2024 16:24