Masarykova univerzita

Výpis publikací

česky | in English

Filtrování publikací

    2018

  1. SUCHOMEL, Vít. csTenTen17, a Recent Czech Web Corpus. In Twelveth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2018. s. 111-123, 13 s.
  2. 2017

  3. KALLAS, Jelena, Vít SUCHOMEL a Maria KHOKHLOVA. Automated Identification of Domain Preferences of Collocations. In Iztok Kosem et al.. Electronic Lexicography in the 21st Century. Proceedings of Elex 2017 Conference. Brno, Czech Republic: Lexical Computing CZ s.r.o., 2017. s. 309-320, 12 s. ISSN 2533-5626.
  4. SUCHOMEL, Vít. Removing spam from web corpora through supervised learning using FastText. Birmingham, 2017.
  5. 2016

  6. RYCHLÝ, Pavel a Vít SUCHOMEL. Annotated Amharic Corpora. In Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala. Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings. Switzerland: Springer International Publishing, 2016. s. 295-302, 8 s. ISBN 978-3-319-45509-9. doi:10.1007/978-3-319-45510-5_34.
  7. HERMAN, Ondřej, Vít SUCHOMEL, Vít BAISA a Pavel RYCHLÝ. DSL Shared task 2016: Perfect Is The Enemy of Good Language Discrimination Through Expectation-Maximization and Chunk-based Language Model. In Preslav Nakov, Marcos Zampieri, Liling Tan, Nikola Ljubešić, Jörg Tiedemann, Shervin Malmasi. Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3). Osaka: Association for Natural Language Processing (ANLP), Osaka, Japan, 2016. s. 114-118, 5 s. ISBN 978-4-87974-716-7.
  8. SUCHOMEL, Vít a Pavel RYCHLÝ. Set of Ethiopian Web Corpora. 2016.
  9. FIŠER, Darja, Vít SUCHOMEL a Miloš JAKUBÍČEK. Terminology Extraction for Academic Slovene Using Sketch Engine. In Aleš Horák, Pavel Rychlý, Adam Rambousek. Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016. Brno: Tribun EU, 2016. s. 135-141, 7 s. ISBN 978-80-263-1095-2.
  10. 2015

  11. BAISA, Vít a Vít SUCHOMEL. Corpus Based Extraction of Hypernyms in Terminological Thesaurus for Land Surveying Domain. In Ninth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2015. s. 69-74, 6 s. ISBN 978-80-263-0974-1.
  12. BAISA, Vít, Vít SUCHOMEL, adam KILGARRIFF a Miloš JAKUBÍČEK. Sketch Engine for English Language Learning. In Corpus Linguistics 2015. 2015.
  13. RAMBOUSEK, Adam, Vít BAISA, Vít SUCHOMEL, Aleš HORÁK a Lucia KOCINCOVÁ. Terminologický tezaurus pro obor zeměměřictví a katastru nemovitostí: Certifikovaná metodika. 2015.
  14. BAISA, Vít a Vít SUCHOMEL. Turkic Language Support in Sketch Engine. In Proceedings of the international conference "Turkic Languages processing: TurkLang 2015". Kazan: Academy of Sciences of the Republic of Tatarstan Press, 2015. s. 214-223, 10 s. ISBN 978-5-9690-0262-3.
  15. 2014

  16. ARTS, Tressy, Yonatan BELINKOV, Nizar HABASH, Adam KILGARRIFF a Vít SUCHOMEL. arTenTen: Arabic Corpus and Word Sketches. Journal of King Saud University-Computer and Information Sciences, Elsevier, 2014, roč. 2014, č. 26, s. 381-395. ISSN 1319-1578. doi:10.1016/j.jksuci.2014.06.009.
  17. KILGARRIFF, Adam, Miloš JAKUBÍČEK, Vojtěch KOVÁŘ, Pavel RYCHLÝ a Vít SUCHOMEL. Finding Terms in Corpora for Many Languages with the Sketch Engine. In Proceedings of the Demonstrations at the 14th Conferencethe European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden: The Association for Computational Linguistics, 2014. s. 53-56, 4 s. ISBN 978-1-937284-75-6.
  18. BOJAR, Ondřej, Vojtěch DIATKA, Pavel RYCHLÝ, Pavel STRAŇÁK, Vít SUCHOMEL, Aleš TAMCHYNA a Daniel ZEMAN. HindEnCorp – Hindi-English and Hindi-only Corpus for Machine Translation. In Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Reykjavik, Iceland: European Language Resources Association (ELRA), 2014. s. 3550-3555, 6 s. ISBN 978-2-9517408-8-4.
  19. NEVĚŘILOVÁ, Zuzana a Vít SUCHOMEL. Intelligent Search and Replace for Czech Phrases. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014. s. 97-105, 9 s. ISSN 2336-4289.
  20. HORÁK, Aleš, Adam RAMBOUSEK, Vít SUCHOMEL a Lucia KOCINCOVÁ. Semiautomatic Building and Extension of Terminological Thesaurus for Land Surveying Domain. In Aleš Horák, Pavel Rychlý. Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014. s. 129-137, 9 s. ISSN 2336-4289.
  21. BAISA, Vít a Vít SUCHOMEL. SkELL: Web Interface for English Language Learning. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014. s. 63-70, 8 s. ISSN 2336-4289.
  22. SUCHOMEL, Vít, Jan MICHELFEIT a Jan POMIKÁLEK. Text Tokenisation Using unitok. In Eighth Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2014. s. 71-75, 5 s. ISSN 2336-4289.
  23. KILGARRIFF, Adam, Vít BAISA, Jan BUŠTA, Miloš JAKUBÍČEK, Vojtěch KOVÁŘ, Jan MICHELFEIT, Pavel RYCHLÝ a Vít SUCHOMEL. The Sketch Engine: ten years on. Lexicography, Springer Berlin Heidelberg, 2014, roč. 1, č. 1, s. 7-36. ISSN 2197-4292. doi:10.1007/s40607-014-0009-9.
  24. 2013

  25. SRDANOVIĆ, Irena, Vít SUCHOMEL, Adam KILGARRIFF a Toshinobu OGISO. 百億語のコーパスを用いた日本語の語彙・文法情報のプロファイリング. 2013. s. 229-238, 10 s.
  26. BELINKOV, Yonatan, Nizar HABASH, Adam KILGARRIFF, Noam ORDAN, Ryan ROTH a Vít SUCHOMEL. arTenTen: a new, vast corpus for Arabic. In Eric Atwell and Andrew Hardie. Proceedings of WACL’2 Second Workshop on Arabic Corpus Linguistics. 2013. s. 20.
  27. BAISA, Vít a Vít SUCHOMEL. Intrinsic Methods for Comparison of Corpora. In A. Horák, P. Rychlý. RASLAN 2013 Recent Advances in Slavonic Natural Language Processing. první. Brno: Tribun EU, 2013. s. 51-58, 8 s. ISBN 978-80-263-0520-0.
  28. JAKUBÍČEK, Miloš, Adam KILGARRIFF, Vojtěch KOVÁŘ, Pavel RYCHLÝ a Vít SUCHOMEL. The TenTen Corpus Family. In 7th International Corpus Linguistics Conference CL 2013. Lancaster, 2013. s. 125-127, 3 s.
  29. KILGARRIFF, Adam a Vít SUCHOMEL. Web Spam. In Stefan Evert , Egon Stemle, Paul Rayson. Proceedings of the 8th Web as Corpus Workshop (WAC-8) @Corpus Linguistics 2013. 2013. s. 46-52, 7 s.
  30. 2012

  31. BAISA, Vít a Vít SUCHOMEL. Detecting Spam in Web Corpora. In Aleš Horák, Pavel Rychlý. 6th Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2012. s. 69-76, 8 s. ISBN 978-80-263-0313-8.
  32. SUCHOMEL, Vít a Jan POMIKÁLEK. Efficient Web Crawling for Large Text Corpora. In Adam Kilgarriff, Serge Sharoff. Proceedings of the seventh Web as Corpus Workshop (WAC7). Lyon, 2012. s. 39-43, 5 s.
  33. BAISA, Vít a Vít SUCHOMEL. Large Corpora for Turkic Languages and Unsupervised Morphological Analysis. In Seniz Demir, Ilknur Durgar El-Kahlout, Mehmet Ugur Dogan. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12). Istanbul, Turkey: European Language Resources Association (ELRA), 2012. s. 28-32, 5 s. ISBN 978-2-9517408-7-7.
  34. DOVUDOV, Gulshan, Vít SUCHOMEL a Pavel ŠMERK. POS Annotated 50M Corpus of Tajik Language. In Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages (SALTMIL 8/AfLaT 2012). Istanbul: European Language Resources Association (ELRA), 2012. s. 93-98, 6 s. ISBN 978-2-9517408-7-7.
  35. SUCHOMEL, Vít. Recent Czech Web Corpora. In Aleš Horák, Pavel Rychlý. 6th Workshop on Recent Advances in Slavonic Natural Language Processing. Brno: Tribun EU, 2012. s. 77-83, 7 s. ISBN 978-80-263-0313-8.
  36. SpiderLing (software)
    SUCHOMEL, Vít. SpiderLing. 2012.
  37. DOVUDOV, Gulshan, Vít SUCHOMEL a Pavel ŠMERK. Towards 100M Morphologically Annotated Corpus of Tajik. In Aleš Horák, Pavel Rychlý. Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2012. Brno: Tribun EU, 2012. s. 91-94, 4 s. ISBN 978-80-263-0313-8.
  38. 2011

  39. DOVUDOV, Gulshan, Jan POMIKÁLEK, Vít SUCHOMEL a Pavel ŠMERK. Building a 50M Corpus of Tajik Language. In Aleš Horák, Pavel Rychlý. Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2011. Brno: Tribun EU, 2011. s. 89-95, 7 s. ISBN 978-80-263-0077-9.
  40. Chared (software)
    POMIKÁLEK, Jan a Vít SUCHOMEL. Chared. 2011.
  41. POMIKÁLEK, Jan a Vít SUCHOMEL. chared: Character Encoding Detection with a Known Language. In Aleš Horák, Pavel Rychlý. RASLAN 2011. 5. vyd. Brno, Czech Republic: Tribun EU, 2011. s. 125-129, 5 s. ISBN 978-80-263-0077-9.
  42. SUCHOMEL, Vít a Jan POMIKÁLEK. Practical Web Crawling for Text Corpora. In A. Horák, P. Rychlý. Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2011. Brno: Tribun EU, 2011. s. 97-108, 12 s. ISBN 978-80-263-0077-9.
Zobrazit podrobně
Zobrazeno: 19. 12. 2018 05:07