SOJKA, Petr, Vít NOVOTNÝ, Eniafe Festus AYETIRAN, Dávid LUPTÁK and Michal ŠTEFÁNIK. Quo Vadis, Math Information Retrieval. In Aleš Horák and Pavel Rychlý and Adam Rambousek. Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2019. Brno: Tribun EU, 2019, p. 117-128. ISBN 978-80-263-1517-9.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Quo Vadis, Math Information Retrieval
Authors SOJKA, Petr (203 Czech Republic, guarantor, belonging to the institution), Vít NOVOTNÝ (203 Czech Republic, belonging to the institution), Eniafe Festus AYETIRAN (566 Nigeria, belonging to the institution), Dávid LUPTÁK (703 Slovakia, belonging to the institution) and Michal ŠTEFÁNIK (703 Slovakia, belonging to the institution).
Edition Brno, Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2019, p. 117-128, 12 pp. 2019.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW full paper Domovská stránka workshopu
RIV identification code RIV/00216224:14330/19:00111500
Organization unit Faculty of Informatics
ISBN 978-80-263-1517-9
ISSN 2336-4289
UT WoS 000604899800014
Keywords (in Czech) matematické získávání znalostí; zodpovídání dotazů; STEM; digitální matematické knihovny; embeddingy; MIaS; MIaSNG; DML
Keywords in English math information retrieval; question answering; STEM; digital mathematical libraries; embeddings; MIaS; MIaSNG; DML
Tags digital mathematical libraries, information retrieval, math indexing and retrieval, math information retrieval, MathML, MIaS, similarity search, soft cosine measure
Tags International impact
Changed by Changed by: RNDr. Vít Starý Novotný, Ph.D., učo 409729. Changed: 3/1/2023 13:53.
Abstract
With the exponential growth of information in the digital form, information retrieval and querying digital libraries is of paramount importance, and mathematical and technical STEM documents are not an exception. The key for precise searching is the adequate and unambiguous representation of documents, paragraphs, sentences and words, which we are going to evaluate. We are presenting a roadmap to tackle the problem of searching and question answering in the digital mathematical libraries, and discuss the pros and cons of promising approaches primarily for the key part, namely the document representation: several types of embeddings, topic mixtures and LSTM. The listed representation learning options will be evaluated at the next ARQMath evaluation lab of CLEF 2020 conference.
Links
MUNI/A/1145/2018, interní kód MUName: Aplikovaný výzkum na FI: softwarové architektury kritických infrastruktur, bezpečnost počítačových systémů, techniky pro zpracování a vizualizaci velkých dat a rozšířená realita.
Investor: Masaryk University, Critical Infrastructure Software Architectures, Computer Systems Security, Data Processing and Visualization Techniques, and Augmented Reality, Category A
PrintDisplayed: 25/4/2024 14:26