LÍŠKA, Martin and Petr SOJKA. Math Indexer and Searcher. 2011.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Math Indexer and Searcher
Name in Czech Indexátor a vyhledávač matematiky
Authors LÍŠKA, Martin (703 Slovakia, belonging to the institution) and Petr SOJKA (203 Czech Republic, guarantor, belonging to the institution).
Edition 2011.
Other information
Original language English
Type of outcome Software
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
WWW webová stránka projektu
RIV identification code RIV/00216224:14330/11:00053973
Organization unit Faculty of Informatics
Keywords (in Czech) indexování a vyhledávání matematiky; matematické digitální knihovny; informační systémy; vyhledávání; vyhledávání matematického obsahu; MIaS; WebMIaS
Keywords in English math indexing and retrieval; mathematical digital libraries; information systems; information retrieval; mathematical content search; document ranking of mathematical papers; math text mining; MIaS; WebMIaS
Technical parameters Petr Sojka, FI MU Brno, Botanická 68a, 60200 Brno, CZ, tel. +420549496966
Tags EuDML, mathematical information retrieval, MIaS, WebMIaS
Tags International impact
Changed by Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 10/5/2013 12:47.
Abstract
A math-aware, full-text indexing based search engine that enables users to search for mathematical formulae inside documents. Search engine is unique because it is able to index and search structural information like representation of mathematical formulae. There is no other software or IR system that is able to store three billions of formulae in its index and search it with response time below a second. MIaS processes documents containing mathematical notation in MathML format. The system is built as an extension to any full-text indexing engine and has been verifiend on state-of-the-art Lucene core. It is scalable - it was verified to index almost whole arxiv.org (about 500,000 papers) having more than 160,000,000 formulae. Software is being used in EuDML (eudml.org) and other digital libraries. For more details see papers in peer reviewed conferences: [1] Sojka, Petr; Líška, Martin. In Matthew R. B. Hardy, Frank Wm. Tompa. Proceedings of the 2011 ACM Symposium on Document Engineering. Mountain View, CA, USA : ACM, 2011. pp.57--60. [2] Sojka, Petr; Líška, Martin. In J.H.Davenport, W.M. Farmer, J.Urban, F. Rabe. Intelligent Computer Mathematics LNCS 6824. Springer, 2011, pp.228--243.
Links
LA09016, research and development projectName: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM)
Investor: Ministry of Education, Youth and Sports of the CR, Czech Republic membership in the European Research Consortium for Informatics and Mathematics
MUNI/A/0057/2011, interní kód MUName: Posílení zapojení studentů Fakulty informatiky do mezinárodní vědecké komunity (Acronym: SKONF)
Investor: Masaryk University, Category A
250503, interní kód MUName: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union
PrintDisplayed: 2/5/2024 14:54