JAKUBÍČEK, Miloš and Pavel ŠMERK. Large Scale Keyword Extraction using a Finite State Backend. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016. Brno: Tribun EU, 2016, p. 143-146. ISBN 978-80-263-1095-2.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Large Scale Keyword Extraction using a Finite State Backend
Authors JAKUBÍČEK, Miloš (203 Czech Republic, belonging to the institution) and Pavel ŠMERK (203 Czech Republic, belonging to the institution).
Edition Brno, Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016, p. 143-146, 4 pp. 2016.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/16:00092379
Organization unit Faculty of Informatics
ISBN 978-80-263-1095-2
ISSN 2336-4289
UT WoS 000466886400016
Keywords in English terminology extraction; keyword extraction; fsa; Sketch Engine
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 21/5/2021 23:15.
Abstract
We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine.
Links
7F14047, research and development projectName: Harvesting big text data for under-resourced languages (Acronym: HaBiT)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 27/7/2024 13:50