Automatic Identification of Legal Terms in Czech Law Texts

PALA, Karel, Pavel RYCHLÝ a Pavel ŠMERK. Automatic Identification of Legal Terms in Czech Law Texts. In Semantic Processing of Legal Texts. Berlin: Springer, 2010, s. 83-94. ISBN 978-3-642-12836-3. Dostupné z: https://dx.doi.org/10.1007/978-3-642-12837-0_5.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Automatic Identification of Legal Terms in Czech Law Texts
Název česky	Automatická identifikace právních termínů v českých právních textech
Autoři	PALA, Karel (203 Česká republika, garant, domácí), Pavel RYCHLÝ (203 Česká republika, domácí) a Pavel ŠMERK (203 Česká republika, domácí).
Vydání	Berlin, Semantic Processing of Legal Texts, od s. 83-94, 12 s. 2010.
Nakladatel	Springer

Další údaje
Originální jazyk	angličtina
Typ výsledku	Stať ve sborníku
Obor	60200 6.2 Languages and Literature
Stát vydavatele	Česká republika
Utajení	není předmětem státního či obchodního tajemství
Forma vydání	tištěná verze "print"
Impakt faktor	Impact factor: 0.402 v roce 2005
Kód RIV	RIV/00216224:14330/10:00065871
Organizační jednotka	Fakulta informatiky
ISBN	978-3-642-12836-3
ISSN	0302-9743
Doi	http://dx.doi.org/10.1007/978-3-642-12837-0_5
Klíčová slova anglicky	terminology extraction; natural language processing; legal language
Příznaky	Mezinárodní význam, Recenzováno
Změnil	Změnil: RNDr. Pavel Šmerk, Ph.D., učo 3880. Změněno: 30. 4. 2014 04:24.

Anotace

Law texts including constitution, acts, public notices and court judgements form a huge database of texts. As many texts from small domains, the used sublanguage is partially restricted and also different from general language (Czech). As a starting collection of data, the legal database Lexis containing approx. 50,000 Czech law documents has been chosen. Our attention is concentrated mostly on noun groups, which are the main candidates for law terms. We were able to recognize 3992 such different noun groups in the selected text samples. The paper also presents results of the morphological analysis, lemmatization, tagging, disambiguation, and the basic syntactic analysis of Czech law texts as these tasks are crucial for any further sophisticated natural language processing. The verbs in legal texts have been explored preliminarily as well. In this respect, we are trying to explore how the linguistic analysis can help in identification of the semantic nature of law terms.

Návaznosti
GA407/07/0679, projekt VaV	Název: Právní e-slovník - PES
GA407/07/0679, projekt VaV	Investor: Grantová agentura ČR, Právní e-slovník - PES
LC536, projekt VaV	Název: Centrum komputační lingvistiky
LC536, projekt VaV	Investor: Ministerstvo školství, mládeže a tělovýchovy ČR, Centrum komputační lingvistiky

VytisknoutZobrazeno: 26. 4. 2024 13:16

Automatic Identification of Legal Terms in Czech Law Texts

Další aplikace