RYCHLÝ, Pavel a Vít SUCHOMEL. Annotated Amharic Corpora. In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala. Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings. Switzerland: Springer International Publishing, 2016, s. 295-302. ISBN 978-3-319-45509-9. Dostupné z: https://dx.doi.org/10.1007/978-3-319-45510-5_34. |
Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{1353390, author = {Rychlý, Pavel and Suchomel, Vít}, address = {Switzerland}, booktitle = {Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings}, doi = {http://dx.doi.org/10.1007/978-3-319-45510-5_34}, editor = {Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala}, keywords = {Amharic; text corpus; web corpus; under-resourced language; corpus annotation; morphological tagger}, howpublished = {tištěná verze "print"}, language = {eng}, location = {Switzerland}, isbn = {978-3-319-45509-9}, pages = {295-302}, publisher = {Springer International Publishing}, title = {Annotated Amharic Corpora}, url = {http://link.springer.com/chapter/10.1007/978-3-319-45510-5_34}, year = {2016} }
TY - JOUR ID - 1353390 AU - Rychlý, Pavel - Suchomel, Vít PY - 2016 TI - Annotated Amharic Corpora PB - Springer International Publishing CY - Switzerland SN - 9783319455099 KW - Amharic KW - text corpus KW - web corpus KW - under-resourced language KW - corpus annotation KW - morphological tagger UR - http://link.springer.com/chapter/10.1007/978-3-319-45510-5_34 L2 - http://link.springer.com/chapter/10.1007/978-3-319-45510-5_34 N2 - Amharic is one of under-resourced languages. The paper presents two text corpora. The first one is a substantially cleaned version of existing morphologically annotated WIC Corpus (210,000 words). The second one is the largest Amharic text corpus (17 million words). It was created from Web pages automatically crawled in 2013, 2015 and 2016. It is part-of-speech annotated by a tagger trained and evaluated on the WIC Corpus. ER -
RYCHLÝ, Pavel a Vít SUCHOMEL. Annotated Amharic Corpora. In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala. \textit{Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings}. Switzerland: Springer International Publishing, 2016, s.~295-302. ISBN~978-3-319-45509-9. Dostupné z: https://dx.doi.org/10.1007/978-3-319-45510-5\_{}34.
|