ŠMERK, Pavel. Tools for Fast Morphological Analysis Based on Finite State Automata. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014, p. 147-150. ISSN 2336-4289.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Tools for Fast Morphological Analysis Based on Finite State Automata
Authors ŠMERK, Pavel (203 Czech Republic, guarantor, belonging to the institution).
Edition Brno, Eighth Workshop on Recent Advances in Slavonic Natural Language Processing, p. 147-150, 4 pp. 2014.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/14:00077522
Organization unit Faculty of Informatics
ISSN 2336-4289
Keywords in English morphological analysis; minimal deterministic finite state automata
Tags International impact, Reviewed
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 21/5/2021 23:13.
Abstract
The paper presents a new implementation of some of Jan Daciuk’s algorithms and tools for morphological analysis based on finite state automata. In particular, we offer a reimplemented version of the tool which builds the automata from an input set of strings and of the tool which performs the morphological analysis itself. In addition to 8-bit versions we also offer “Unicode-aware” versions with the Unicode characters encoded directly in the arcs of the automaton. The new implementation is faster than the original one and its code is much more simple and straightforward.
Links
LM2010013, research and development projectName: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat (Acronym: LINDAT-Clarin)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 26/7/2024 00:24