SHERWANI, Moiz Khan, Petr SOJKA and Francesco CALIMERI. Semantic Similarities between Locations based on Ontology. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017. Brno: Tribun EU, 2017, p. 85-94. ISBN 978-80-263-1340-3.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Semantic Similarities between Locations based on Ontology
Authors SHERWANI, Moiz Khan (586 Pakistan, belonging to the institution), Petr SOJKA (203 Czech Republic, guarantor, belonging to the institution) and Francesco CALIMERI (380 Italy).
Edition Brno, Proceedings of the Eleventh Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2017, p. 85-94, 10 pp. 2017.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW full paper Domovská stránka workshopu
RIV identification code RIV/00216224:14330/17:00094377
Organization unit Faculty of Informatics
ISBN 978-80-263-1340-3
ISSN 2336-4289
UT WoS 000426613500010
Keywords (in Czech) zjednoznačnění toponym; geonames; vyhledávání geografických názvů v textu; ontologie geografických názvů; podobnost toponym
Keywords in English toponym disambiguation; geonames; geographic text retrieval; ontology based geoname relations; toponym similarity
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 4/4/2018 17:05.
Abstract
Toponym disambiguation or location names resolution is a critical task in unstructured text, articles or documents. Our research explores how to link ambiguous locations mentioned in documents, news and articles with latitude/longitude coordinates. We designed an evaluation system for toponym disambiguation based on annotated GEOCLEF data. We implemented a node-based approach taking population into account and a geographic distance-based approach. We have proposed new approach based on edges between the pairs of toponyms in ontology, taking also population attribute into account. Our edge-based approach gave better results than population and distance-based only approaches. The results could be used in any information system dealing with texts containing geographic locations, such as news texts.
Links
TD03000295, research and development projectName: Inteligentní software pro sémantické hledání dokumentů (Acronym: ISSHD)
Investor: Technology Agency of the Czech Republic
PrintDisplayed: 19/9/2024 11:30