D
2016
Large Scale Keyword Extraction using a Finite State Backend
JAKUBÍČEK, Miloš and Pavel ŠMERK
Basic information
Original name
Large Scale Keyword Extraction using a Finite State Backend
Edition
Brno, Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016, p. 143-146, 4 pp. 2016
Other information
Type of outcome
Stať ve sborníku
Field of Study
10201 Computer sciences, information science, bioinformatics
Country of publisher
Czech Republic
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
printed version "print"
RIV identification code
RIV/00216224:14330/16:00092379
Organization unit
Faculty of Informatics
Keywords in English
terminology extraction; keyword extraction; fsa; Sketch Engine
V originále
We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine.
Links
7F14047, research and development project | Name: Harvesting big text data for under-resourced languages (Acronym: HaBiT) | Investor: Ministry of Education, Youth and Sports of the CR |
|
Displayed: 15/11/2024 04:10