Intelligent Search and Replace for Czech Phrases

NEVĚŘILOVÁ, Zuzana a Vít SUCHOMEL. Intelligent Search and Replace for Czech Phrases. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014, s. 97-105. ISSN 2336-4289.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Intelligent Search and Replace for Czech Phrases
Autoři	NEVĚŘILOVÁ, Zuzana (203 Česká republika, garant, domácí) a Vít SUCHOMEL (203 Česká republika, domácí).
Vydání	Brno, Eighth Workshop on Recent Advances in Slavonic Natural Language Processing, od s. 97-105, 9 s. 2014.
Nakladatel	Tribun EU

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	60200 6.2 Languages and Literature
Stát vydavatele	Česká republika
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	tištěná verze "print"
WWW	URL
Kód RIV	RIV/00216224:14330/14:00077518
Organizační jednotka	Fakulta informatiky
ISSN	2336-4289
UT WoS	000374560500013
Klíčová slova anglicky	search and replace; detecting phrases; generating phrases; subject-predicative complement
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: RNDr. Vít Suchomel, Ph.D., učo 139723. Změněno: 25. 5. 2021 19:20.

Anotace

This work proposes a new improvement of the ‘Search and Replace’ function well known from most text processing software. The standard search and replace function is used to replace exact form of words or phrases by another words or phrases in text documents. It is quite sufficient for languages with minimal inflection such as English. However, a well working word or phrase replacement function for morphologically rich languages requires much more thought. We explore the issues of implementing a useful search and replace in the Czech language and propose solutions to majority of the problems: A syntactic parser is employed to identify the phrases containing the search word or phrase. The correct word forms used as a replacement are generated by a morphological analyser. A web demonstration utilizing the proposed solution is presented. The attached examples of use reveal the cases in which the implemented method works well.

Návaznosti
LM2010013, projekt VaV	Název: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat (Akronym: LINDAT-Clarin)
LM2010013, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Projekt LINDAT-Clarin - Vybudování a provoz českého uzlu pan-evropské infrastruktury pro výzkum
7F14047, projekt VaV	Název: Harvesting big text data for under-resourced languages (Akronym: HaBiT)
7F14047, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Harvesting big text data for under-resourced languages

VytisknoutZobrazeno: 27. 7. 2024 14:18

Intelligent Search and Replace for Czech Phrases

Další aplikace