u 2012

Toolset for Entity and Semantic Associations – Value Release: Deliverable 8.3 of project EuDML

LEE, Mark; Petr SOJKA and Radim ŘEHŮŘEK

Basic information

Original name

Toolset for Entity and Semantic Associations – Value Release: Deliverable 8.3 of project EuDML

Authors

LEE, Mark (826 United Kingdom of Great Britain and Northern Ireland); Petr SOJKA (203 Czech Republic, guarantor, belonging to the institution) and Radim ŘEHŮŘEK (203 Czech Republic, belonging to the institution)

Edition

1.0 as of 31st May 2012. 12 pp. Deliverable D8.3, 2012

Publisher

EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library

Other information

Language

English

Type of outcome

Special-purpose publication

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United Kingdom of Great Britain and Northern Ireland

Confidentiality degree

is not subject to a state or trade secret

References:

RIV identification code

RIV/00216224:14330/12:00062173

Organization unit

Faculty of Informatics

Keywords in English

The European Digital Mathematics Library; EuDML; author disambiguation; document clustering; gensim; similarity; plagiarism; Yadda

Tags

International impact, Reviewed
Changed: 4/12/2012 16:47, doc. RNDr. Petr Sojka, Ph.D.

Abstract

In the original language

In this document we describe the value release of the toolset for entity and semantic associations, integrating Unsupervised Document Similarity implemented by MU (using GENSIM tool) and Citation Indexing and Matching (as provided by ICM and UJF/CMD). We give a brief description of tools and provide some initial evaluation.

Links

250503, interní kód MU
Name: The European Digital Mathematics Library (Acronym: EuDML)
Investor: European Union