RYCHLÝ, Pavel and Gezahegn Tsegaye LEMMA. An Update of the Manually Annotated Amharic Corpus. In Aleš Horák, Pavel Rychlý and Adam Rambousek. Proceedings of the Twelfth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2018. Brno: Tribun EU, 2018, p. 124-128. ISBN 978-80-263-1517-9.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name An Update of the Manually Annotated Amharic Corpus
Authors RYCHLÝ, Pavel (203 Czech Republic, guarantor, belonging to the institution) and Gezahegn Tsegaye LEMMA (231 Ethiopia).
Edition Brno, Proceedings of the Twelfth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2018, p. 124-128, 5 pp. 2018.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10200 1.2 Computer and information sciences
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW URL
RIV identification code RIV/00216224:14330/18:00101542
Organization unit Faculty of Informatics
ISBN 978-80-263-1517-9
ISSN 2336-4289
UT WoS 000612420300015
Keywords in English text corpus; Amharic corpus; part-of-speech tagging
Changed by Changed by: Mgr. Michal Petr, učo 65024. Changed: 16/5/2022 15:45.
Abstract
The paper describes an update of the manually annotated Amharic corpus WIC 2.0. It lists the problems of the previous version of the corpus and shows that even small changes in the corpus annotation could lead to a higher quality of trained part-of-speech taggers.
Links
GA18-23891S, research and development projectName: Hyperintensionální usuzování nad texty přirozeného jazyka
Investor: Czech Science Foundation
LM2015071, research and development projectName: Jazyková výzkumná infrastruktura v České republice (Acronym: LINDAT-Clarin)
Investor: Ministry of Education, Youth and Sports of the CR
PrintDisplayed: 8/9/2024 21:16