PALA, Karel, Pavel RYCHLÝ and Pavel ŠMERK. Morphological Analysis of Law texts. In Petr Sojka, Aleš Horák. First Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2007. Brno: Masaryk University, 2007, p. 21-26, 7 pp. ISBN 978-80-210-4471-5.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Morphological Analysis of Law texts
Name in Czech Morfologická analýza právních textů
Authors PALA, Karel (203 Czech Republic, guarantor, belonging to the institution), Pavel RYCHLÝ (203 Czech Republic, belonging to the institution) and Pavel ŠMERK (203 Czech Republic, belonging to the institution).
Edition Brno, First Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2007, p. 21-26, 7 pp. 2007.
Publisher Masaryk University
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL RASLAN 2007 Workshop web page
RIV identification code RIV/00216224:14330/07:00020686
Organization unit Faculty of Informatics
ISBN 978-80-210-4471-5
UT WoS 000268015500003
Keywords in English morphological analysis; partial syntactic analysis; noun groups detection
Tags morphological analysis, noun groups detection, partial syntactic analysis
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 7/1/2019 14:05.
Abstract
In the paper we explore the morphology of the Czech law texts including Constitution, acts, public notices and court judgements which form a huge textual database. As many texts from small domains, the used language is partially restricted and in relevant aspects also different from general Czech. The paper presents first results of the morphological analysis of Czech law texts and their conversion to the specific formats. Partly, the partial syntactic analysis has been performed as well.
Abstract (in Czech)
V článku jsou popsány první výsledky ze zpracování rozsáhlého korpusu právnických textů zahrnujících Ústavu, zákony, vyhlášky a judikaturu. Stejně jako u mnoha jiných domén je těmito dokumenty používaný jazyk určitým způsobem omezený a zároveň odlišný od běžné češtiny. Na vzorku dat byla provedena jak morfologická analýza a desambiguace, tak i částečná syntaktická analýza, orientovaná zejména na detekci jmenných skupin, tedy právních termínů.
Links
GA407/07/0679, research and development projectName: Právní e-slovník - PES
Investor: Czech Science Foundation, Legal e-dictionary - PES
LC536, research and development projectName: Centrum komputační lingvistiky
Investor: Ministry of Education, Youth and Sports of the CR, Centrum komputační lingvistiky
PrintDisplayed: 21/8/2024 07:52