Citation Data of Czech Apex Courts (preprint)

HARAŠTA, Jakub, Tereza NOVOTNÁ a Jaromír ŠAVELKA. Citation Data of Czech Apex Courts (preprint). arXiv. arXiv:2002.02224, 2020, 7 s.

Další formáty: BibTeX LaTeX RIS

Základní údaje
Originální název	Citation Data of Czech Apex Courts (preprint)
Autoři	HARAŠTA, Jakub, Tereza NOVOTNÁ a Jaromír ŠAVELKA.
Vydání	arXiv, arXiv:2002.02224, 2020.

Další údaje
Originální jazyk	angličtina
Typ výsledku	Článek v odborném periodiku (nerecenzovaný)
Obor	50500 5.5 Law
Stát vydavatele	Spojené státy
Utajení	není předmětem státního či obchodního tajemství
WWW	Text (arXiv.org) Dataset (GitHub)
Organizační jednotka	Právnická fakulta
Klíčová slova anglicky	reference recognition; reference extraction; document segmentation; NLP pipeline; citation data; Supreme Court; Supreme Administrative Court; Constitutional Court; Czech Republic
Příznaky	Mezinárodní význam
Změnil	Změnil: JUDr. Mgr. Jakub Harašta, Ph.D., učo 323070. Změněno: 18. 12. 2020 06:51.

Anotace

In this paper, we introduce the citation data of the Czech apex courts (Supreme Court, Supreme Administrative Court and Constitutional Court). This dataset was automatically extracted from the corpus of texts of Czech court decisions - CzCDC 1.0. We obtained the citation data by building the natural language processing pipeline for extraction of the court decision identifiers. The pipeline included the (i) document segmentation model and the (ii) reference recognition model. Furthermore, the dataset was manually processed to achieve high-quality citation data as a base for subsequent qualitative and quantitative analyses. The dataset is available to the general public at GitHub.

Návaznosti
GA17-20645S, projekt VaV	Název: Exaktní hodnocení aplikační relevance judikatury
GA17-20645S, projekt VaV	Investor: Grantová agentura ČR, Exaktní hodnocení aplikační relevance judikatury

VytisknoutZobrazeno: 13. 5. 2024 13:21

Citation Data of Czech Apex Courts (preprint)

Další aplikace