a 2011

Why TEX math search is more relevant now than ever?

SOJKA, Petr

Základní údaje

Originální název

Why TEX math search is more relevant now than ever?

Název česky

Proč je vyhledávání TeXové matematiky relevantnější než kdy předtím?

Autoři

SOJKA, Petr (203 Česká republika, garant, domácí)

Vydání

TUG 2011, Trivandrum, Kerala, Indie, 2011

Další údaje

Jazyk

angličtina

Typ výsledku

Konferenční abstrakt

Obor

10201 Computer sciences, information science, bioinformatics

Stát vydavatele

Spojené státy

Utajení

není předmětem státního či obchodního tajemství

Kód RIV

RIV/00216224:14330/11:00054134

Organizační jednotka

Fakulta informatiky

ISSN

Klíčová slova česky

vyhledávání matematiky;indexace matematických formulí; TeXovská notace matematických formulí

Klíčová slova anglicky

math search; indexing of math; TeX math notation

Příznaky

Mezinárodní význam, Recenzováno
Změněno: 27. 2. 2012 11:24, doc. RNDr. Petr Sojka, Ph.D.

Anotace

V originále

TeX is around 30 years old, and was conceived and written before the advent of MathML, not to mention the Internet. At that time the idea of indexing and searching mathematics was just a futuristic idea. When people jumped on the Google bandwagon, it was predicted that old technologies such as TeX mark-up for math would disappear in time (it is not used for tokenization and indexing properly). The advent of the Internet and W3C brought mark-up and global search to the attention of the public. Somehow it was acceptable again. The recent move to the semantic search and MathML has brought renewed attention to the need of unambiguous canonical math representation in texts. As part of the project of building the European Digital Mathematics Library (http://www.eudml.eu) we have designed and implemented a math search engine, MIaS (http://nlp.fi.muni.cz/projekty/eudml/mias). It currently indexes and searches more than 160,000,000 formulae originally written by authors in TeX in their scientific papers. We will present the system and will discuss the ways towards a global math search engine based on the TeX math notation.

Česky

TeXu je 30 let, a byl navržen a implementován před MathML, či Internetem. V té době byla myšlenka indexace a vyhledávání matematiky v oblasti futurologie. When people jumped on the Google bandwagon, it was predicted that old technologies such as TEX mark-up for math would disappear in time (it is not used for tokenization and indexing properly). The advent of the Internet and W3C brought mark-up and global search to the attention of the public. Somehow it was acceptable again. The recent move to the semantic search and MathML has brought renewed attention to the need of unambiguous canonical math representation in texts. As part of the project of building the European Digital Mathematics Library (http://www.eudml.eu) we have designed and implemented a math search engine, MIaS (http://nlp.fi.muni.cz/projekty/eudml/mias). It currently indexes and searches more than 160,000,000 formulae originally written by authors in TeX in their scientific papers. We will present the system and will discuss the ways towards a global math search engine based on the TeX math notation.

Návaznosti

250503, interní kód MU
Název: The European Digital Mathematics Library (Akronym: EuDML)
Investor: Evropská unie, The European Digital Mathematics Library