Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{913650, author = {Němčík, Václav}, address = {Brno}, booktitle = {Proceedings of Recent Advances in Slavonic Natural Language Processing 2010}, keywords = {linguistic resources; corpora; theory; practice}, howpublished = {tištěná verze "print"}, language = {eng}, location = {Brno}, isbn = {978-80-7399-246-0}, pages = {47-51}, publisher = {Masarykova Univerzita}, title = {Utilizing Linguistic Resources: Theory and Practical Experience}, url = {https://nlp.fi.muni.cz/raslan/2010/paper04.pdf}, year = {2010} }
TY - JOUR ID - 913650 AU - Němčík, Václav PY - 2010 TI - Utilizing Linguistic Resources: Theory and Practical Experience PB - Masarykova Univerzita CY - Brno SN - 9788073992460 KW - linguistic resources KW - corpora KW - theory KW - practice UR - https://nlp.fi.muni.cz/raslan/2010/paper04.pdf N2 - The Prague Dependency Treebank (henceforth PDT) is a large collection of texts in Czech. It contains several layers of rich annotation, ranging from morphology to deep syntax. It is unique in its size and theoretical background, especially for a language like Czech, which can be, with regard to the number of its speakers, considered a small language. In this article, we use PDT 2.0 to demonstrate that within real NLP systems, complex annotations may cut both ways. We present several issues that might pose problems when extracting data from PDT, and complex structures in general, and hint on possible solutions. ER -
NĚMČÍK, Václav. Utilizing Linguistic Resources: Theory and Practical Experience. In \textit{Proceedings of Recent Advances in Slavonic Natural Language Processing 2010}. Brno: Masarykova Univerzita, 2010, s.~47-51. ISBN~978-80-7399-246-0.
|