u 2013

Toolset for Entity and Semantic Associations – Final Release: Deliverable 8.4 of project EuDML

LEE, Mark, Petr SOJKA, Radim ŘEHŮŘEK, Radim HATLAPATKA, Maroš KUCBEL et. al.

Basic information

Original name

Toolset for Entity and Semantic Associations – Final Release: Deliverable 8.4 of project EuDML

Authors

LEE, Mark (826 United Kingdom of Great Britain and Northern Ireland), Petr SOJKA (203 Czech Republic, guarantor, belonging to the institution), Radim ŘEHŮŘEK (203 Czech Republic, belonging to the institution), Radim HATLAPATKA (203 Czech Republic, belonging to the institution), Maroš KUCBEL (203 Czech Republic, belonging to the institution), Thierry BOUCHE (250 France), Claude GOUTORBE (250 France), Romeo ANGHELACHE (250 France) and Krzysztof WOJCIECHOWSKI (616 Poland)

Edition

1.0 as of 8th February 2013. 13 pp. Deliverable D8.4, 2013

Publisher

EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library

Other information

Language

English

Type of outcome

Účelové publikace

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United Kingdom of Great Britain and Northern Ireland

Confidentiality degree

není předmětem státního či obchodního tajemství

References:

RIV identification code

RIV/00216224:14330/13:00068102

Organization unit

Faculty of Informatics

Keywords in English

The European Digital Mathematics Library; EuDML;

Tags

International impact, Reviewed
Změněno: 28/4/2014 06:26, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

In this document we describe the final release of the toolset for entity and semantic associations, integrating two versions (language dependent and language independent) of Unsupervised Document Similarity implemented by MU (using the gensim tool) and Citation Indexing, Resolution and Matching (UJF/CMD). We give a brief description of tools, the rationale behind decisions made, and provide elementary evaluation. Tools are integrated in the main project result, EuDML website, and they deliver the needed functionality for exploratory searching and browsing the collected documents. EuDML users and content providers thus benefit from millions of algorithmically generated similarity and citation links, developed using state of the art machine learning and matching methods.

Links

LG13010, research and development project
Name: Zastoupení ČR v European Research Consortium for Informatics and Mathematics (Acronym: ERCIM-CZ)
Investor: Ministry of Education, Youth and Sports of the CR
250503, interní kód MU
Name: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union

Files attached