Improving Coverage of Translation Memories with Language
Modelling

BAISA, Vít, Josef BUŠTA a Aleš HORÁK. Improving Coverage of Translation Memories with Language Modelling. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU. s. 27-34. ISSN 2336-4289. 2014.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Improving Coverage of Translation Memories with Language Modelling
Autoři	BAISA, Vít (203 Česká republika, domácí), Josef BUŠTA (203 Česká republika, domácí) a Aleš HORÁK (203 Česká republika, garant, domácí).
Vydání	Brno, Eighth Workshop on Recent Advances in Slavonic Natural Language Processing, od s. 27-34, 8 s. 2014.
Nakladatel	Tribun EU

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	10201 Computer sciences, information science, bioinformatics
Stát vydavatele	Česká republika
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	tištěná verze "print"
WWW	URL
Kód RIV	RIV/00216224:14330/14:00077510
Organizační jednotka	Fakulta informatiky
ISSN	2336-4289
UT WoS	000374560500004
Klíčová slova anglicky	translation memory; CAT; segment; subsegment leveraging; partial translation; Moses; GIZA++; word matrix; METEOR; MemoQ; language model
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: Mgr. et Mgr. Vít Baisa, Ph.D., učo 139654. Změněno: 27. 5. 2021 09:12.

Anotace

In this paper, we describe and evaluate current improvements to methods for enlarging translation memories. In comparison with the previous results in 2013, we have achieved improvement in coverage by almost 35 percentage points on the same test data. The basic subsegment splitting of the translation pairs is done using Moses and (M)GIZA++ tools, which provide the subsegment translation probabilities. The obtained phrases are then combined with subsegment combination techniques and filtered by large target language models.

Návaznosti
ED3.1.00/10.0216, projekt VaV	Název: Transfer technologií na Masarykově univerzitě

VytisknoutZobrazeno: 20. 4. 2024 06:08

Improving Coverage of Translation Memories with Language Modelling

Další aplikace