Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{1069994, author = {Pala, Karel and Rychlý, Pavel}, address = {Praha}, edition = {první}, editor = {F. Čermák}, keywords = {corpora, corpus tools}, language = {eng}, location = {Praha}, isbn = {978-80-7422-114-9}, pages = {33-39}, publisher = {Nakladatelství Lidové Noviny}, title = {Do we need very large corpora?}, year = {2011} }
TY - JOUR ID - 1069994 AU - Pala, Karel - Rychlý, Pavel PY - 2011 TI - Do we need very large corpora? PB - Nakladatelství Lidové Noviny CY - Praha SN - 9788074221149 KW - corpora, corpus tools N2 - In the paper we are dealing with building very large corpora from Web. First, we discuss motivation and needs for this kind of resources both for linguists, lexicographers, and NLP specialists. Second, we mention the techniques used for building large (more than billion tokens) corpora and present the results obtained at NLP Centre FI MU, i.e. both tools and corpora. Then we pay attention to the analysis of the consequences following from building large text data resources and the ways in which they are used in corpus linguistics and various NLP applications. ER -
PALA, Karel a Pavel RYCHLÝ. \textit{Do we need very large corpora?}. první. Praha: Nakladatelství Lidové Noviny, 2011, s.~33-39, 379 s. ISBN~978-80-7422-114-9.
|