Semi-automatic Theme-Rheme Identification

D 2013

Semi-automatic Theme-Rheme Identification

PALA, Karel a Ondřej SVOBODA

Základní údaje

Originální název

Semi-automatic Theme-Rheme Identification

Autoři

PALA, Karel a Ondřej SVOBODA

Vydání

Brno, Seventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2013, od s. 39-48, 10 s. 2013

Nakladatel

Tribun EU

Další údaje

Jazyk

angličtina

Typ výsledku

Stať ve sborníku

Obor

10201 Computer sciences, information science, bioinformatics

Stát vydavatele

Česká republika

Utajení

není předmětem státního či obchodního tajemství

Forma vydání

tištěná verze "print"

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14330/13:00070352

Organizační jednotka

Fakulta informatiky

ISBN

978-80-263-0520-0

Klíčová slova anglicky

theme-rheme; Functional Sentence Perspective; topic-focus articulation;

Příznaky

Mezinárodní význam, Recenzováno

Změněno: 2. 12. 2013 15:19, Mgr. Lucia Kocincová

Anotace

V originále

In this paper we start from the theory of the Functional Sentence Perspective developed primarily by Firbas [1], Svoboda [2] and also Sgall, Hajicová [3] and make an attempt to formulate a procedure allowing to semi-automatically recognize which sentence constituents carry information that is contextually dependent and thus known to an adressee (theme), constituents containing new information (rheme), and also constituents bearing non-thematic and non-rhematic information (transition). Having themes and rhemes recognized as successfully as possible we also hope to investigate thematic progression (thematic line) in texts in the future. The core of the procedure and its experimental implementation for Czech (using the bushbank corpus CBB.Blog [4] as a data source) are described in the paper. Since the task is really complicated we only offer basic evaluation, which, in our view, shows that the task is feasible.

Návaznosti

LM2010013, projekt VaV

Název: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat (Akronym: LINDAT-Clarin)

Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Projekt LINDAT-Clarin - Vybudování a provoz českého uzlu pan-evropské infrastruktury pro výzkum

Přehled o publikaci