Other formats:
BibTeX
LaTeX
RIS
@misc{957129, author = {Sojka, Petr}, booktitle = {Informatics Colloquium}, keywords = {digital library; math search;math retrieval;indexing of mathematics;metadata handling; EuDML; semantics of mathematical documents; knowledge management; digitization; MathML; portalsystems; repositories of knowledge; DMLCZ}, language = {eng}, title = {The Art of Mathematics Retrieval (invited talk at Informatics Colloquium FI MU, 8.11.2011)}, url = {http://www.fi.muni.cz/usr/sojka/presentations/sojkafimucolloquiumpres2011.pdf}, year = {2011} }
TY  SLIDE ID  957129 AU  Sojka, Petr PY  2011 TI  The Art of Mathematics Retrieval (invited talk at Informatics Colloquium FI MU, 8.11.2011) KW  digital library KW  math search;math retrieval;indexing of mathematics;metadata handling KW  EuDML KW  semantics of mathematical documents KW  knowledge management KW  digitization KW  MathML KW  portalsystems KW  repositories of knowledge KW  DMLCZ UR  http://www.fi.muni.cz/usr/sojka/presentations/sojkafimucolloquiumpres2011.pdf N2  The design and architecture of MIaS (Math Indexer and Searcher), a~system for mathematics retrieval is presented, and design decisions are discussed. We argue for an approach based on Presentation MathML using a~similarity of math subformulae. The system was implemented as a~mathaware search engine based on the stateoftheart system Apache Lucene and is used in The European Digital Mathematics Library  EuDML. Scalability issues were checked against more than 400,000 arXiv documents with 158 million mathematical formulae. Almost three billion MathML subformulae were indexed using a~Solrcompatible Lucene. ER 
SOJKA, Petr. \textit{The Art of Mathematics Retrieval (invited talk at Informatics Colloquium FI MU, 8.11.2011)}. In \textit{Informatics Colloquium}. 2011.
