LÍŠKA, Martin. Vyhledávání v matematickém textu (Searching Mathematical Texts). Petr Sojka (školitel bakalářské práce). Brno: Masarykova univerzita, 2010, 35 pp. |
Other formats:
BibTeX
LaTeX
RIS
|
Basic information | |
---|---|
Original name | Vyhledávání v matematickém textu |
Name in Czech | Vyhledávání v matematickém textu |
Name (in English) | Searching Mathematical Texts |
Authors | LÍŠKA, Martin (703 Slovakia, guarantor, belonging to the institution). Petr Sojka (školitel bakalářské práce). |
Edition | Brno, 35 pp. 2010. |
Publisher | Masarykova univerzita |
Other information | |
---|---|
Original language | Slovak |
Type of outcome | Special-purpose publication |
Field of Study | 10201 Computer sciences, information science, bioinformatics |
Country of publisher | Czech Republic |
Confidentiality degree | is not subject to a state or trade secret |
WWW | Archiv závěrečné práce |
RIV identification code | RIV/00216224:14330/10:00058915 |
Organization unit | Faculty of Informatics |
Keywords (in Czech) | formula; vyhľadávanie; indexácia; MathML; Lucene; tokenizácia |
Keywords in English | formula; searching; indexing; MathML; Lucene; tokenization |
Tags | formula, indexing, Lucene, MathML, searching, tokenization |
Changed by | Changed by: doc. RNDr. Petr Sojka, Ph.D., učo 2378. Changed: 25/1/2013 13:57. |
Abstract |
---|
Práca sa zaoberá problematikou vyhľadávania v matematických textoch. Rozoberá niekoľko existujúcich riešení vyhľadávania matematiky a z tohto sa snaží si odniesťi dôležité poznatky použité pri návrhu vlastného riešenia. Ten obsahuje idey a zdôvodnenia navrhnutých súčastí riešiacich vyhľadávanie matematiky, ako vhodná tokenizácia, úpravy a hodnotenie formúl. Časť venovaná implementácií tohoto návrhu popisuje ako bolo dosiahnuté konečné riešenie za použitia indexovacieho jadra Lucene. V závere dochádza k zhodnoteniu projektu a návrhom na ďalší vývoj. |
Abstract (in English) |
---|
The thesis deals with an issue of searching in mathematical texts. It analyzes several existing approaches in mathematical aware searching and tries to collect notable observations for designing own solution. The design consist of ideas and considerations of components dealing with mathematical processing like tokenization, modifications and ranking of the formulae. Implementation part describes techniques used in achieving the final solution with the use of Lucene indexing core. Evaluation of the project and proposals for future development are stated in the end. |
Links | |
---|---|
LA09016, research and development project | Name: Účast ČR v European Research Consortium for Informatics and Mathematics (ERCIM) (Acronym: ERCIM) |
Investor: Ministry of Education, Youth and Sports of the CR, Czech Republic membership in the European Research Consortium for Informatics and Mathematics | |
250503, interní kód MU | Name: The European Digital Mathematics Library (Acronym: EuDML) |
Investor: European Union |
PrintDisplayed: 1/10/2024 06:59