Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{991762, author = {Sojka, Petr}, address = {Maui, USA}, booktitle = {Proceedings of ESAIR 2012}, doi = {http://dx.doi.org/10.1145/2390148.2390157}, editor = {Jaap Kamps, Jussi Karlgren, Peter Mika, Vanessa Murdock}, keywords = {MIaS;MathML;indexing;search;canonical MathML;EuDML;digital libraries;information systems;information retrieval;mathematical content search;math indexing and retrieval;document ranking of math papers;text mining;DML-CZ;DML projects;semantics}, howpublished = {paměťový nosič}, language = {eng}, location = {Maui, USA}, isbn = {978-1-4503-1717-7}, note = {ESAIR 2012 (c/o CIKM 2012)}, pages = {15-16}, publisher = {ACM}, title = {Exploiting Semantic Annotations in Math Information Retrieval}, url = {http://www.fi.muni.cz/usr/sojka/posters/sojka-esair2012.pdf}, year = {2012} }
TY - JOUR ID - 991762 AU - Sojka, Petr PY - 2012 TI - Exploiting Semantic Annotations in Math Information Retrieval PB - ACM CY - Maui, USA SN - 9781450317177 N1 - ESAIR 2012 (c/o CIKM 2012) KW - MIaS;MathML;indexing;search;canonical MathML;EuDML;digital libraries;information systems;information retrieval;mathematical content search;math indexing and retrieval;document ranking of math papers;text mining;DML-CZ;DML projects;semantics UR - http://www.fi.muni.cz/usr/sojka/posters/sojka-esair2012.pdf L2 - http://dx.doi.org/10.1145/2390148.2390157 N2 - This paper describes exploitation of semantic annotations in the design and architecture of MIaS (Math Indexer and Searcher) system for mathematics retrieval. Basing on the claim that navigational and research search are `killer' applications for digital library such as the European Digital Mathematics Library, EuDML, we argue for an approach based on Natural Language Processing techniques as used in corpus management systems such as the Sketch Engine, that will reach web scalability and avoid inference problems. The main ideas are 1) to augment surface texts (including math formulae) with additional linked representations (maps) bearing semantic information (expanded formulae as text, canonicalized text and subformulae) for indexing, including support for indexing structural information (expressed as Content MathML or other tree structures) and 2) use semantic user preferences to order found documents. The semantic enhancements of the MIaS system are being implemented as a math-aware search engine based on the state-of-the-art system Apache Lucene, with support for [MathML] tree indexing. Scalability issues have been checked against more than 400,000 arXiv documents. ER -
SOJKA, Petr. Exploiting Semantic Annotations in Math Information Retrieval. In Jaap Kamps, Jussi Karlgren, Peter Mika, Vanessa Murdock. \textit{Proceedings of ESAIR 2012}. Maui, USA: ACM, 2012, s.~15-16. ISBN~978-1-4503-1717-7. Dostupné z: https://dx.doi.org/10.1145/2390148.2390157.
|