p 2012

Towards Digital Mathematics Libraries: Collection and Search (invited talk 7.9.2012, Coimbra University, Math Dept, PT)

SOJKA, Petr

Basic information

Original name

Towards Digital Mathematics Libraries: Collection and Search (invited talk 7.9.2012, Coimbra University, Math Dept, PT)

Name in Czech

Na cestě k digitáním matematickým knihovnám: integrace dat a vyhledávání (zvaná přednáška 7.9.2012, Coimbra University, Math Dept.,PT)

Authors

SOJKA, Petr (203 Czech Republic, guarantor, belonging to the institution)

Edition

10a reuniao da Comissao Nacional de Matematica, 2012

Other information

Language

English

Type of outcome

Vyžádané přednášky

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/12:00060855

Organization unit

Faculty of Informatics

Keywords (in Czech)

vyhledávání matematických formulí;TeX;DML-CZ;workflow digitalizace;digitalni knihovny;pdfjbim;jbig2enc;RDF recompression

Keywords in English

math-aware search;mathematics knowledge management;TeX;DML-CZ;digitization workflow;digital libraries;pdfJbim;big2enc;PDF recompression

Tags

International impact
Změněno: 13/9/2012 20:22, doc. RNDr. Petr Sojka, Ph.D.

Abstract

V originále

The talk will consist of two parts: Collection and Search. Collection: Projects DML-CZ and EuDML will be described and demoed, and the main lessons from them enumerated. Attention will be given to the process of creation and delivery of new metadata to global DML as EuDML via local repositories like DML-CZ. Search: The recent move to the semantic search and MathML has brought renewed attention to the need of unambiguous canonical math representation in texts. As part of the project of building the European Digital Mathematics Library (http://www.eudml.eu) we have designed and implemented a math search engine, MIaS (http://nlp.fi.muni.cz/projekty/eudml/mias). It currently indexes and searches more than 160,000,000 formulae originally written by authors in TeX in their scientific papers. We will present the system and will discuss the ways towards a global math search engine based on the TeX math notation.

Links

LA09016, research and development project
Name: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM)
Investor: Ministry of Education, Youth and Sports of the CR, Czech Republic membership in the European Research Consortium for Informatics and Mathematics
250503, interní kód MU
Name: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union