MEDVEĎ, Marek, Miloš JAKUBÍČEK and Vojtěch KOVÁŘ. Towards taggers and parsers for Slovak. In Zygmunt Vetulani & Hans Uszkoreit. Human Language Technologies as a Challenge for Computer Science and Linguistics. Proceedings of the 6th Language and Technology Conference. Poznań, Poland: Fundacja Uniwersytetu im. A. Mickiewicza, 2013, p. 527-530. ISBN 978-83-932640-3-2.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Towards taggers and parsers for Slovak
Authors MEDVEĎ, Marek (703 Slovakia, belonging to the institution), Miloš JAKUBÍČEK (203 Czech Republic, guarantor, belonging to the institution) and Vojtěch KOVÁŘ (203 Czech Republic, belonging to the institution).
Edition Poznań, Poland, Human Language Technologies as a Challenge for Computer Science and Linguistics. Proceedings of the 6th Language and Technology Conference, p. 527-530, 4 pp. 2013.
Publisher Fundacja Uniwersytetu im. A. Mickiewicza
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Poland
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW LTC website
RIV identification code RIV/00216224:14330/13:00070329
Organization unit Faculty of Informatics
ISBN 978-83-932640-3-2
Keywords in English Slovak; parsing; tagging; parser adaptation
Tags International impact, Reviewed
Changed by Changed by: RNDr. Vojtěch Kovář, Ph.D., učo 139915. Changed: 28/4/2014 10:28.
Abstract
In this paper we present tools prepared for morphological and syntactic processing of Slovak: a model trained for tagging by the RFTagger and two syntactic analyzers Synt and SET for which we adapted their Czech grammars for Slovak. We describe the training process of RFTagger using the r-mak corpus and modifications of both parsers that have been performed partially in the lexical analysis and mainly in the formal grammars used in both systems. Finally we provide an evaluation of both tagging and parsing, the latter on two datasets – a phrasal and dependency treebank of Slovak.
Links
LM2010013, research and development projectName: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat (Acronym: LINDAT-Clarin)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 30/8/2024 16:34