Machine learning and natural language processing
doc. RNDr. Lubomír Popelínský, Ph.D.
Machine learning and natural language processing
Info
Období
podzim 2022

QUESTIONS AND TASKS:

  • Natural language (pre)processing techniques and their relevance for building machine learning models applicable to text

  • Bag of words representation of text - pros and cons

  • Text representations. When the ordering of words (does not) matter-s. Mining web

  • Main text mining tasks

  • ML for disambiguation

  • CNN for NLP

  • Distributional hypothesis - historical context, linguistic motivations and practical implementations

  • Distributional vs. formal semantics

  • Latent semantic analysis - basic principles, pros and cons

  • Word embedings - comparison of selected popular approaches

  • Basic principles of language models and their training process

  • Techniques for data augmentation

  • NN for machine translation

  • Text clustering

  • Recurrent NN for NLP. Describe one task.

  • Outliers in text data.

  • Methods for outlier detection in text

  • Why we need LSTM. Describe a task where LSTM areuseful

  • What are transformers? How do they differ from "vanilla" recurrent neural networks?

  • Describe the basic intuition behind gradient descent and back-propagation

  • ILP. why we need it. Describe a task where ILP is useful

  • Key words, keyness. How to compute.

  • Relational learning  (ILP) for key words (key phrases) detection

  • Text sumarization.  Two main approaches.

  • Extractive sumarization. ROUGE-n.

  • Sentiment analysis - the definition of the field, justification of its practical relevance and main challenges

  • Detailed overview of a selected lexicon-based approach to sentiment analysis

  • Detailed overview of a selected classical machine learning approach to sentiment analysis

  • Detailed overview of a selected deep learning approach to sentiment analysis

  • Comparison of lexicon-based, classical machine learning and deep learning approaches to sentiment analysis

  • Basic principles of knowledge representation

  • Ontologies vs. knowledge graphs - pros and cons of each approach to knowledge representation

  • The stack of typical tasks in ontology learning

  • Main challenges and open problems of ontology learning

  • Techniques used for term extraction, synonym discovery and concept formation

  • Techniques used for taxonomy extraction

  • Techniques used for relation, rule and axiom extraction

  • Overview of a selected deep learning approach to knowledge extraction

Kapitola obsahuje:
1
PDF
1
Studijní materiály
1
Studijní text
1
Web
Učitel doporučuje studovat od 12. 9. 2022 do 18. 9. 2022.
Kapitola obsahuje:
3
PDF
1
Studijní text
5
Web
Učitel doporučuje studovat od 16. 9. 2022 do 25. 9. 2022.
Kapitola obsahuje:
1
PDF
1
Studijní text
1
Web
Učitel doporučuje studovat od 26. 9. 2022 do 2. 10. 2022.
Kapitola obsahuje:
5
PDF
1
Video
3
Web
Učitel doporučuje studovat od 3. 10. 2022 do 9. 10. 2022.
Kapitola obsahuje:
16
PDF
1
Studijní text
7
Web
Učitel doporučuje studovat od 10. 10. 2022 do 16. 10. 2022.
Kapitola obsahuje:
4
PDF
1
Studijní text
3
Web
Učitel doporučuje studovat od 17. 10. 2022 do 23. 10. 2022.
Kapitola obsahuje:
1
PDF
1
Studijní text
Učitel doporučuje studovat od 24. 10. 2022 do 30. 10. 2022.
Kapitola obsahuje:
1
PDF
1
Studijní text
Učitel doporučuje studovat od 31. 10. 2022 do 6. 11. 2022.
Kapitola obsahuje:
1
PDF
1
Studijní text
Učitel doporučuje studovat od 7. 11. 2022 do 13. 11. 2022.
Kapitola obsahuje:
1
Studijní text
Učitel doporučuje studovat od 14. 11. 2022 do 20. 11. 2022.
Kapitola obsahuje:
1
Studijní text
Učitel doporučuje studovat od 21. 11. 2022 do 27. 11. 2022.
Kapitola obsahuje:
2
PDF
1
Studijní text
Učitel doporučuje studovat od 28. 11. 2022 do 4. 12. 2022.
Učitel doporučuje studovat od 5. 12. 2022 do 11. 12. 2022.
Předchozí