Detailed Information on Publication Record
2021
When Word Pairs Matter - Analysis of the English-Slovak Evaluation Dataset
DENISOVÁ, Michaela and Pavel RYCHLÝBasic information
Original name
When Word Pairs Matter - Analysis of the English-Slovak Evaluation Dataset
Authors
DENISOVÁ, Michaela (703 Slovakia, guarantor, belonging to the institution) and Pavel RYCHLÝ (203 Czech Republic, belonging to the institution)
Edition
Brno, Proceedings of the Fifteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2021, p. 141-149, 9 pp. 2021
Publisher
Tribun EU
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10200 1.2 Computer and information sciences
Country of publisher
Czech Republic
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
printed version "print"
References:
RIV identification code
RIV/00216224:14330/21:00123252
Organization unit
Faculty of Informatics
ISBN
978-80-263-1670-1
ISSN
Keywords in English
Cross-lingual word embeddings; Ground truth dictionary; Evaluation; English; Slovak
Změněno: 15/5/2024 09:28, RNDr. Pavel Šmerk, Ph.D.
Abstract
V originále
Cross-lingual word embeddings facilitate the transfer of lexical knowledge across languages, and they are mainly used for finding transla- tion equivalents. Translation equivalents obtained in this way are usually evaluated with the help of ground truth dictionaries. However, the evalu- ation process, including the ground truth dictionaries, differs from model to model, impeding the correct interpretation of the results. Therefore, in this paper, we provide a thorough analysis of the English-Slovak ground truth dictionary and employ our analysis in evaluating two cross-lingual word embedding models. We show that word pairs choice is an important factor when accurately reflecting the model’s performance.
Links
LM2018101, research and development project |
|