Machine learning and natural language processing
doc. RNDr. Lubomír Popelínský, Ph.D.
Machine learning and natural language processing
Info
Období
podzim 2021

QUESTIONS AND TASKS:

  • Natural language (pre)processing techniques and their relevance for building machine learning models applicable to text

  • Bag of words representation of text - pros and cons

  • Text representations. When the ordering of words (does not) matter-s. Mining web

  • Main text mining tasks

  • ML for disambiguation

  • CNN for NLP

  • Distributional hypothesis - historical context, linguistic motivations and practical implementations

  • Distributional vs. formal semantics

  • Latent semantic analysis - basic principles, pros and cons

  • Word embedings - comparison of selected popular approaches

  • Basic principles of language models and their training process

  • Techniques for data augmentation

  • NN for machine translation

  • Text clustering

  • Recurrent NN for NLP. Describe one task.

  • Outliers in text data.

  • Methods for outlier detection in text

  • Why we need LSTM. Describe a task where LSTM areuseful

  • ILP. why we need it. Describe a task where ILP is useful

  • Key words, keyness. How to compute.

  • Relational learning  (ILP) for key words (key phrases) detection

  • Text sumarization.  Two main approaches.

  • Extractive sumarization. ROUGE-n.

  • Sentiment analysis - the definition of the field, justification of its practical relevance and main challenges

  • Detailed overview of a selected lexicon-based approach to sentiment analysis

  • Detailed overview of a selected classical machine learning approach to sentiment analysis

  • Detailed overview of a selected deep learning approach to sentiment analysis

  • Comparison of lexicon-based, classical machine learning and deep learning approaches to sentiment analysis

  • Basic principles of knowledge representation

  • Ontologies vs. knowledge graphs - pros and cons of each approach to knowledge representation

  • The stack of typical tasks in ontology learning

  • Main challenges and open problems of ontology learning

  • Techniques used for term extraction, synonym discovery and concept formation

  • Techniques used for taxonomy extraction

  • Techniques used for relation, rule and axiom extraction

  • Overview of a selected deep learning approach to knowledge extraction

Kapitola obsahuje:
1
PDF
1
Studijní materiály
1
Studijní text
1
Web
Učitel doporučuje studovat od 15. 9. 2021 do 21. 9. 2021.
Kapitola obsahuje:
1
Obrázek
2
PDF
1
Studijní materiály
1
Studijní text
4
Web
Učitel doporučuje studovat od 22. 9. 2021 do 28. 9. 2021.
Kapitola obsahuje:
1
PDF
1
Studijní text
1
Web
Učitel doporučuje studovat od 29. 9. 2021 do 5. 10. 2021.
Kapitola obsahuje:
3
PDF
1
Studijní materiály
1
Studijní text
Učitel doporučuje studovat od 6. 10. 2021 do 12. 10. 2021.
Kapitola obsahuje:
6
PDF
1
Studijní text
5
Web
Učitel doporučuje studovat od 13. 10. 2021 do 19. 10. 2021.
Kapitola obsahuje:
4
PDF
1
Studijní materiály
1
Studijní text
3
Web
Učitel doporučuje studovat od 20. 10. 2021 do 26. 10. 2021.
Učitel doporučuje studovat od 27. 10. 2021 do 2. 11. 2021.
Kapitola obsahuje:
2
PDF
1
Studijní materiály
1
Studijní text
11
Web
Učitel doporučuje studovat od 3. 11. 2021 do 9. 11. 2021.
Kapitola obsahuje:
1
PDF
1
Studijní text
Učitel doporučuje studovat od 10. 11. 2021 do 16. 11. 2021.
Kapitola obsahuje:
6
PDF
1
Studijní materiály
1
Studijní text
3
Web
Učitel doporučuje studovat od 17. 11. 2021 do 23. 11. 2021.
Kapitola obsahuje:
1
PDF
1
Studijní materiály
1
Studijní text
Učitel doporučuje studovat od 1. 12. 2021 do 8. 12. 2021.
Kapitola obsahuje:
1
PDF
1
Studijní text
Učitel doporučuje studovat od 8. 12. 2021 do 14. 12. 2021.
Předchozí