Další formáty:
BibTeX
LaTeX
RIS
@inproceedings{1658596, author = {Jakubíček, Miloš and Kovář, Vojtěch and Rychlý, Pavel and Suchomel, Vít}, address = {Marseille, France}, booktitle = {Proceedings of the 12th Web as Corpus Workshop}, editor = {Adrien Barbaresi, Felix Bildhauer, Roland Schafer and Egon Stemle}, keywords = {Web corpora; corpus building}, howpublished = {elektronická verze "online"}, language = {eng}, location = {Marseille, France}, isbn = {979-10-95546-68-9}, pages = {1-4}, publisher = {European Language Resources Association}, title = {Current Challenges in Web Corpus Building}, url = {https://www.aclweb.org/anthology/2020.wac-1.1}, year = {2020} }
TY - JOUR ID - 1658596 AU - Jakubíček, Miloš - Kovář, Vojtěch - Rychlý, Pavel - Suchomel, Vít PY - 2020 TI - Current Challenges in Web Corpus Building PB - European Language Resources Association CY - Marseille, France SN - 9791095546689 KW - Web corpora KW - corpus building UR - https://www.aclweb.org/anthology/2020.wac-1.1 N2 - In this paper we discuss some of the current challenges in web corpus building that we faced in the recent years when expanding the corpora in Sketch Engine. The purpose of the paper is to provide an overview and raise discussion on possible solutions, rather than bringing ready solutions to the readers. For every issue we try to assess its severity and briefly discuss possible mitigation options. ER -
JAKUBÍČEK, Miloš, Vojtěch KOVÁŘ, Pavel RYCHLÝ a Vít SUCHOMEL. Current Challenges in Web Corpus Building. Online. In Adrien Barbaresi, Felix Bildhauer, Roland Schafer and Egon Stemle. \textit{Proceedings of the 12th Web as Corpus Workshop}. Marseille, France: European Language Resources Association, 2020, s.~1-4. ISBN~979-10-95546-68-9.
|