D 2016

Large Scale Keyword Extraction using a Finite State Backend

JAKUBÍČEK, Miloš and Pavel ŠMERK

Basic information

Original name

Large Scale Keyword Extraction using a Finite State Backend

Authors

JAKUBÍČEK, Miloš (203 Czech Republic, belonging to the institution) and Pavel ŠMERK (203 Czech Republic, belonging to the institution)

Edition

Brno, Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016, p. 143-146, 4 pp. 2016

Publisher

Tribun EU

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

Czech Republic

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

References:

URL

RIV identification code

RIV/00216224:14330/16:00092379

Organization unit

Faculty of Informatics

ISBN

978-80-263-1095-2

ISSN

UT WoS

000466886400016

Keywords in English

terminology extraction; keyword extraction; fsa; Sketch Engine
Změněno: 21/5/2021 23:15, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

We present a novel method for performing fast keyword extraction from large text corpora using a finite state backend. The FSA3 package has been adopted for this purposes. We outline the basic approach and present a comparison with previous hash-based method as used in Sketch Engine.

Links

7F14047, research and development project
Name: Harvesting big text data for under-resourced languages (Acronym: HaBiT)
Investor: Ministry of Education, Youth and Sports of the CR
Displayed: 15/11/2024 04:10