LÍŠKA, Martin. Building the Ultimate Math Search Engine. školitel diplomové práce. Brno: Masarykova univerzita, 2015, 79 s.
Další formáty:   BibTeX LaTeX RIS
Základní údaje
Originální název Building the Ultimate Math Search Engine
Autoři LÍŠKA, Martin (703 Slovensko, garant, domácí).
školitel diplomové práce.
Vydání Brno, 79 s. 2015.
Nakladatel Masarykova univerzita
Další údaje
Originální jazyk angličtina
Typ výsledku Účelové publikace
Obor 10201 Computer sciences, information science, bioinformatics
Stát vydavatele Česká republika
Utajení není předmětem státního či obchodního tajemství
WWW Archiv závěrečné práce
Kód RIV RIV/00216224:14330/15:00083290
Organizační jednotka Fakulta informatiky
Klíčová slova anglicky mathematics information retrieva;MIR;MIaS;WebMIaS;evaluation;effectiveness;query expansion;semantics
Změnil Změnil: RNDr. Martin Líška, učo 255768. Změněno: 1. 7. 2015 16:55.
Anotace
Mathematics information retrieval (MIR) is a domain specific branch of Information Retrieval. MIR aims at searching information in documents with significant amount of mathematical content in the form of expressions and formulae. Based on the newly established international MIR evaluation forum and on the number of MIR related research groups around the world, it is definitely on the rise. In this work I have summarized and compared different approaches to math-aware search systems. More detailed description of Math Indexer and Searcher (MIaS) was provided as this is our system created at Faculty of Informatics, Masaryk University, primarily designed and developed by me. MIaS is currently reported as the best performing MIR system in terms of effectiveness. In this work I proposed several topics which are main research interests of my studies. The topics correlate with possible features that can improve the effectiveness of MIR systems. Namely, the proposed topics are math formula substree unification, integration of algebraic computational power into the indexing as well as searching phase, query expansion as a way of increasing recall, query variables, combination of more approaches within one system and a utilization of combination of text and math search. One topic that spans over all other topics is evaluation which is a necessity in a process of continuous improvement of effectiveness.
Návaznosti
250503, interní kód MUNázev: The European Digital Mathematics Library (Akronym: EuDML)
Investor: Evropská unie, The European Digital Mathematics Library
VytisknoutZobrazeno: 26. 4. 2024 19:48