Odborné články k jednotlivým tématům

Přečtěte si článek a prezentujte jej svým kolegům v semináři se stejnou tematikou. Práci je možné udělat max. ve dvou lidech, na prezentaci nebude více než 8 minut. Snažte se kolegům sdělit to nejdůležitější, co pro vás bylo v článku nové nebo překvapující, na moc podrobností nebudete mít čas.

Morfologie
John Goldsmith. Unsupervised Learning of the Morphology of a Natural Language. Computational Linguistics 27(2), 153--198. DOI: 10.1162/089120101750300490, 2001.
Syntax
Reprezentace lexikálního významu
R. Navigli and S. Ponzetto. BabelNet: The Automatic Construction, Evaluation and Application of a Wide-Coverage Multilingual Semantic Network. Artificial Intelligence, 193, Elsevier, 2012, pp. 217-250.
Větná sémantika
Karin Kipper, Anna Korhonen, Neville Ryant, and Martha Palmer. Extensive Classifications of English verbs. Proceedings of the 12th EURALEX International Congress. Turin, Italy. September, 2006.
http://verbs.colorado.edu/~kipper/Papers/euralex-final.doc
Pragmatika
Geurts, Bart, Beaver, David I. and Maier, Emar, Discourse Representation Theory, The Stanford Encyclopedia of Philosophy (Spring 2016 Edition), Edward N. Zalta (ed.).
https://plato.stanford.edu/entries/discourse-representation-theory/
Korpusy a korpusové nástroje: výběr ze dvou zdrojů
Schäfer, Roland, and Felix Bildhauer. Web corpus construction. Synthesis Lectures on Human Language Technologies 6, no. 4 (2013): 1-145.
Dostupné v knihovně MU, momentálně máme jeden exemplář v labu, možno vypůjčit. Důležité kapitoly, které by se měly v referátu objevit: 4 Linguistic
Processing, 5 Corpus Evaluation. Nutno stručně zmínit i to nejdůležitější z kapitol 2 Data Collection, 3 Basic Corpus Cleaning.

Kilgarriff, Adam, Vít Baisa, Jan Bušta, Miloš Jakubíček, Vojtěch Kovář, Jan Michelfeit, Pavel Rychlý, and Vít Suchomel. The Sketch Engine: Ten Years On. Lexicography 1, no. 1 (2014): 7-36.
Dostupné online: https://link.springer.com/article/10.1007/s40607-014-0009-9.
Důležité kapitoly, které by se měly v referátu objevit: 3 Users and uses, 5 Corpora, 6.2 GDEX, 6.5 Terminology.
Strojový překlad: výběr ze dvou článků
Mikolov, Tomas, Quoc V. Le, and Ilya Sutskever. Exploiting similarities among languages for machine translation. arXiv preprint arXiv:1309.4168 (2013).

Bengio, Yoshua, et al. A neural probabilistic language model. Journal of machine learning research 3.Feb (2003): 1137-1155.
Lexikografické nástroje
Antonio J. Roa-Valverde, Salvador Sanchez-Alonso, Miguel-Angel Sicilia, Dieter Fensel: An approach to measuring and annotating the confidence of Wiktionary translations. Language Resources and Evaluation. June 2017, Volume 51, Issue 2, pp 319–349
https://link.springer.com/article/10.1007/s10579-017-9384-9
Rozpoznávání pojmenovaných entit
Chris Manning: Doing Named Entity Recognition? Don't optimize for F1. NLPers blog. 2006
https://nlpers.blogspot.cz/2006/08/doing-named-entity-recognition-dont.html
Reprezentace znalostí a inference
John McCarthy: Circumscription---A Form of Non-Monotonic Reasoning. Artificial Intelligence. Vol. 13. pp. 27-39. 1980
http://jmc.stanford.edu/articles/circumscription/circumscription.pdf
Jazykové modelování: výběr ze dvou článků
Kirchhoff, Katrin, and Mei Yang. Improved language modeling for statistical machine translation. Proceedings of the ACL Workshop on Building and Using Parallel Texts. Association for Computational Linguistics, 2005.

Mikolov, Tomas, et al. Recurrent neural network based language model. Interspeech. Vol. 2. 2010.