u 2015

Building the Ultimate Math Search Engine

LÍŠKA, Martin

Základní údaje

Originální název

Building the Ultimate Math Search Engine

Autoři

LÍŠKA, Martin
školitel diplomové práce.

Vydání

Brno, 79 s. 2015

Nakladatel

Masarykova univerzita

Další údaje

Jazyk

angličtina

Typ výsledku

Účelové publikace

Obor

10201 Computer sciences, information science, bioinformatics

Stát vydavatele

Česká republika

Utajení

není předmětem státního či obchodního tajemství

Označené pro přenos do RIV

Ano

Kód RIV

RIV/00216224:14330/15:00083290

Organizační jednotka

Fakulta informatiky

Klíčová slova anglicky

mathematics information retrieva;MIR;MIaS;WebMIaS;evaluation;effectiveness;query expansion;semantics
Změněno: 1. 7. 2015 16:55, RNDr. Martin Líška

Anotace

V originále

Mathematics information retrieval (MIR) is a domain specific branch of Information Retrieval. MIR aims at searching information in documents with significant amount of mathematical content in the form of expressions and formulae. Based on the newly established international MIR evaluation forum and on the number of MIR related research groups around the world, it is definitely on the rise. In this work I have summarized and compared different approaches to math-aware search systems. More detailed description of Math Indexer and Searcher (MIaS) was provided as this is our system created at Faculty of Informatics, Masaryk University, primarily designed and developed by me. MIaS is currently reported as the best performing MIR system in terms of effectiveness. In this work I proposed several topics which are main research interests of my studies. The topics correlate with possible features that can improve the effectiveness of MIR systems. Namely, the proposed topics are math formula substree unification, integration of algebraic computational power into the indexing as well as searching phase, query expansion as a way of increasing recall, query variables, combination of more approaches within one system and a utilization of combination of text and math search. One topic that spans over all other topics is evaluation which is a necessity in a process of continuous improvement of effectiveness.

Návaznosti

250503, interní kód MU
Název: The European Digital Mathematics Library (Akronym: EuDML)
Investor: Evropská unie, The European Digital Mathematics Library