Masaryk University

Publication Records

česky | in English

Filter publications

    2023

    1. NOVOTNÝ, Vít, Kristýna LUGER, Michal ŠTEFÁNIK, Tereza VRABCOVÁ and Aleš HORÁK. People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts. Online. In Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki. Findings of the Association for Computational Linguistics: ACL 2023. Toronto, Canada: Association for Computational Linguistics, 2023, p. 14104-14113. ISBN 978-1-959429-62-3.
      article preprint
      Name (in English): People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts
      RIV/00216224:14330/23:00130934 Proceedings paper. English. United States of America.
      Novotný, Vít (203 Czech Republic, guarantor, belonging to the institution) -- Luger, Kristýna (203 Czech Republic, belonging to the institution) -- Štefánik, Michal (703 Slovakia, belonging to the institution) -- Vrabcová, Tereza (203 Czech Republic, belonging to the institution) -- Horák, Aleš (203 Czech Republic, belonging to the institution)
      Keywords in English: natural language processing; nlp; historical documents; optical character recognition; ocr; named entity recognition; ner; czech; german; latin
      Type of proceedings: post-proceedings
      International impact: yes
      Reviewed: yes

      Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 7/4/2024 23:02.

    2020

    1. NOVOTNÝ, Vít. When Tesseract Does It Alone: Optical Character Recognition of Medieval Texts. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Proceedings of the Fourteenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2020. Brno: Tribun EU, 2020, p. 3-12. ISBN 978-80-263-1600-8.
      Domovská stránka workshopu PDF
      RIV/00216224:14330/20:00117104 Proceedings paper. English. Czech Republic.
      Novotný, Vít (203 Czech Republic, guarantor, belonging to the institution)
      Keywords in English: Optical character recognition; OCR; Historical texts
      Type of proceedings: pre-proceedings
      International impact: yes

      Changed by: Mgr. Michal Petr, učo 65024. Changed: 16/5/2022 15:06.

    2008

    1. SOJKA, Petr, Radovan PANÁK and Tomáš MUDRÁK. DML-CZ OCR of mathematical texts. 2008.
      URL
      Name in Czech: DML-CZ OCR matematických textů
      Name (in English): DML-CZ OCR of mathematical texts
      RIV/00216224:14330/08:00024487 Pilot plant, certified technology, variety, breed. Use of computers, robotics and its application. English. Czech Republic.
      Sojka, Petr (203 Czech Republic, guarantor) -- Panák, Radovan (703 Slovakia) -- Mudrák, Tomáš (203 Czech Republic)
      Keywords in English: OCR; Optical Character Recognition; DML-CZ; digitization; Digital mathematics library project; ABBYY FineReader; FineReader SDK; InftyReader
      International impact: yes

      Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 22/6/2009 12:19.

    2006

    1. SOJKA, Petr. Towards Digital Mathematical Library: Optical Character Recognition of Mathematical Texts. In ŠTULLER, Julius and Zdenka LINKOVÁ. Inteligentní modely, algoritmy a nástroje pro vytváření sémantickeho webu. první. Praha: Üstav informatiky AV ČR, 2006, p. 110-113. ISBN 80-903298-7-X.
      Full paper--proceedings
      Name in Czech: Budování digitální matematické knihovny: OCR matematických textů
      RIV/00216224:14330/06:00015510 Proceedings paper. Documentation, librarianship, work with information. English. Czech Republic.
      Sojka, Petr (203 Czech Republic, guarantor)
      Keywords in English: OCR; Optical Character Recognition; DML-CZ; digitization; Digital mathematics library project
      Type of proceedings: post-proceedings
      International impact: yes

      Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 7/6/2009 01:05.
Displayed: 6/10/2024 17:18