NOVOTNÁ, Tereza, Jakub HARAŠTA and Jakub KÓL. Topic Modelling of the Czech Supreme Court Decisions. In Cyberspace 2020. 2020.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Topic Modelling of the Czech Supreme Court Decisions
Authors NOVOTNÁ, Tereza, Jakub HARAŠTA and Jakub KÓL.
Edition Cyberspace 2020, 2020.
Other information
Original language English
Type of outcome Presentations at conferences
Field of Study 50501 Law
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Organization unit Faculty of Law
Keywords (in Czech) topic modelling; latent Dirichlet allocation; non-negative matrix factorization; court decisions; coherence score
Keywords in English modelování těmat; latentní Dirichletova alokace; nezáporná maticová faktorizace; soudní rozhodnutí; koherenční skóre
Tags International impact
Changed by Changed by: Mgr. Tereza Novotná, učo 421694. Changed: 27. 1. 2021 14:03.
Czech Supreme Court produces several thousands of court decisions per year. The Supreme court decisions are published almost unprocessed in the full-text with minimal fundamental metadata (date of the decision, docket number). This fact makes a case law research very time-consuming. Therefore, new automatic methods of processing court decisions need to be developed in order to improve ways how to retrieve more relevant case law efficiently. Topic modelling methods have the potential to cluster a large number of documents automatically or to provide new categories of relevant metadata to these documents. In this paper, two topic modelling methods - latent Dirichlet allocation and non-negative matrix factorization are applied to the corpus of Czech Supreme Court decisions. Several models for methods are trained and compared according to their coherence scores in order to find the best number of topics. Further manual qualitative analysis of the most coherent models is performed by authors.
MUNI/A/1454/2019, interní kód MUName: Automatické zpracování soudních rozhodnutí: experiment s uživateli
Investor: Masaryk University, Category A
PrintDisplayed: 27. 3. 2023 18:55