SOJKA, Petr. Towards Digital Mathematics Libraries: Collection and Search (invited talk 7.9.2012, Coimbra University, Math Dept, PT). In 10a reuniao da Comissao Nacional de Matematica. 2012.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Towards Digital Mathematics Libraries: Collection and Search (invited talk 7.9.2012, Coimbra University, Math Dept, PT)
Name in Czech Na cestě k digitáním matematickým knihovnám: integrace dat a vyhledávání (zvaná přednáška 7.9.2012, Coimbra University, Math Dept.,PT)
Authors SOJKA, Petr (203 Czech Republic, guarantor, belonging to the institution).
Edition 10a reuniao da Comissao Nacional de Matematica, 2012.
Other information
Original language English
Type of outcome requested lectures
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
WWW slides
RIV identification code RIV/00216224:14330/12:00060855
Organization unit Faculty of Informatics
Keywords (in Czech) vyhledávání matematických formulí;TeX;DML-CZ;workflow digitalizace;digitalni knihovny;pdfjbim;jbig2enc;RDF recompression
Keywords in English math-aware search;mathematics knowledge management;TeX;DML-CZ;digitization workflow;digital libraries;pdfJbim;big2enc;PDF recompression
Tags International impact
Changed by Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 13/9/2012 20:22.
The talk will consist of two parts: Collection and Search. Collection: Projects DML-CZ and EuDML will be described and demoed, and the main lessons from them enumerated. Attention will be given to the process of creation and delivery of new metadata to global DML as EuDML via local repositories like DML-CZ. Search: The recent move to the semantic search and MathML has brought renewed attention to the need of unambiguous canonical math representation in texts. As part of the project of building the European Digital Mathematics Library ( we have designed and implemented a math search engine, MIaS ( It currently indexes and searches more than 160,000,000 formulae originally written by authors in TeX in their scientific papers. We will present the system and will discuss the ways towards a global math search engine based on the TeX math notation.
LA09016, research and development projectName: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM)
Investor: Ministry of Education, Youth and Sports of the CR, INGO
250503, internal MU codeName: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union, Competitiveness and inovation framework programme
PrintDisplayed: 29/3/2020 12:12