FI:IA161 Advanced NLP - Course Information
IA161 Advanced Techniques of Natural Language Processing
Faculty of InformaticsAutumn 2020
- Extent and Intensity
- 1/1/0. 2 credit(s) (plus extra credits for completion). Type of Completion: k (colloquium).
Taught online. - Teacher(s)
- doc. RNDr. Aleš Horák, Ph.D. (lecturer)
RNDr. Miloš Jakubíček, Ph.D. (lecturer)
RNDr. Marek Medveď, Ph.D. (lecturer)
RNDr. Zuzana Nevěřilová, Ph.D. (lecturer)
RNDr. Adam Rambousek, Ph.D. (lecturer)
doc. Mgr. Pavel Rychlý, Ph.D. (lecturer)
RNDr. Vít Suchomel, Ph.D. (lecturer) - Guaranteed by
- doc. RNDr. Aleš Horák, Ph.D.
Department of Machine Learning and Data Processing – Faculty of Informatics
Supplier department: Department of Machine Learning and Data Processing – Faculty of Informatics - Timetable
- Thu 10:00–11:50 A219
- Prerequisites
- All students should have basic practical knowledge of programming in Python. Overview knowledge of the natural language processing field at the level of introductory courses such as IB030 Introduction to Natural Language Processing or PA153 Natural Language Processing is expected.
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- Image Processing and Analysis (programme FI, N-VIZ)
- Applied Informatics (programme FI, B-AP)
- Applied Informatics (programme FI, N-AP)
- Information Technology Security (eng.) (programme FI, N-IN)
- Information Technology Security (programme FI, N-IN)
- Bioinformatics and systems biology (programme FI, N-UIZD)
- Bioinformatics (programme FI, B-AP)
- Bioinformatics (programme FI, N-AP)
- Computer Games Development (programme FI, N-VIZ_A)
- Computer Graphics and Visualisation (programme FI, N-VIZ_A)
- Computer Networks and Communications (programme FI, N-PSKB_A)
- Computing Technology and Methodology (programme FI, D-INF_A)
- Cybersecurity Management (programme FI, N-RSSS_A)
- Digital Linguistics (programme FI, N-DL)
- Formal analysis of computer systems (programme FI, N-TEI)
- Fundamentals of Computer Science (programme FI, D-INF_A)
- Fundamentals of Computer Science (programme FI, D-INF)
- Graphic design (programme FI, N-VIZ)
- Graphic Design (programme FI, N-VIZ_A)
- Hardware Systems (programme FI, N-PSKB_A)
- Hardware systems (programme FI, N-PSKB)
- Image Processing and Analysis (programme FI, N-VIZ_A)
- Information security (programme FI, N-PSKB)
- Information Systems (programme FI, N-IN)
- Informatics with another discipline (programme FI, B-EB)
- Informatics with another discipline (programme FI, B-FY)
- Informatics with another discipline (programme FI, B-GE)
- Informatics with another discipline (programme FI, B-GK)
- Informatics with another discipline (programme FI, B-CH)
- Informatics with another discipline (programme FI, B-IO)
- Informatics with another discipline (programme FI, B-MA)
- Informatics with another discipline (programme FI, B-TV)
- Informatics (eng.) (programme FI, D-IN4)
- Informatics (programme FI, B-INF) (3)
- Informatics (programme FI, D-IN4)
- Public Administration Informatics (programme FI, B-AP)
- Informatics in education (programme FI, B-IVV) (2)
- Information Security (programme FI, N-PSKB_A)
- Human-Computer Interaction (programme FI, N-IZU)
- Quantum and Other Nonclassical Computational Models (programme FI, N-TEI)
- Cybersecurity (programme FI, B-CS)
- Mathematical Informatics (programme FI, B-IN)
- Deployment and operations of software systems (programme FI, N-SWE)
- Design and development of software systems (programme FI, N-SWE)
- Parallel and Distributed Systems (programme FI, B-IN)
- Parallel and Distributed Systems (programme FI, N-IN)
- Computer graphics and visualisation (programme FI, N-VIZ)
- Computer Graphics and Image Processing (programme FI, B-IN)
- Computer Graphics (programme FI, N-IN)
- Computational Linguistics (programme FF, N-PLIN_) (3)
- Computer Networks and Communication (programme FI, B-IN)
- Computer Networks and Communication (programme FI, N-IN)
- Computer Networks and Communications (programme FI, N-PSKB)
- Computer Systems and Technologies (eng.) (programme FI, D-IN4)
- Computer Systems and Technologies (programme FI, D-IN4)
- Computer Systems and Data Processing (programme FI, B-IN)
- Computer Systems (programme FI, N-IN)
- Usable Security (programme FI, N-IZU)
- Principles of programming languages (programme FI, N-TEI)
- Programming and development (programme FI, B-PVA)
- Embedded Systems (eng.) (programme FI, N-IN)
- Programmable Technical Structures (programme FI, B-IN)
- Embedded Systems (programme FI, N-IN)
- Cybersecurity management (programme FI, N-RSSS)
- Services development management (programme FI, N-RSSS)
- Software Systems Development Management (programme FI, N-RSSS)
- Services Development Management (programme FI, N-RSSS_A)
- Service Science, Management and Engineering (eng.) (programme FI, N-AP)
- Service Science, Management and Engineering (programme FI, N-AP)
- Social Informatics (programme FI, B-AP)
- Software Systems Development Management (programme FI, N-RSSS_A)
- Software Systems (programme FI, N-PSKB_A)
- Software systems (programme FI, N-PSKB)
- Machine learning and artificial intelligence (programme FI, N-UIZD)
- Computing Technology and Methodology (programme FI, D-INF)
- Theoretical Informatics (programme FI, N-IN)
- Teacher of Informatics and IT administrator (programme FI, N-UCI)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-EB)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-FY)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-GK)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-MA)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-SS)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-TV)
- Informatics for secondary school teachers (programme FI, N-UCI) (2)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-FY)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-GK)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-MA)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-SS) (2)
- Upper Secondary School Teacher Training in Informatics (programme FI, N-TV)
- Artificial Intelligence and Natural Language Processing (programme FI, B-IN)
- Artificial Intelligence and Natural Language Processing (programme FI, N-IN)
- User Experience in Visual Informatics (programme FI, N-IZU)
- Computer Games Development (programme FI, N-VIZ)
- Big data (programme FI, N-UIZD)
- Image Processing (programme FI, N-AP)
- Natural language processing (programme FI, N-UIZD)
- Course objectives
- The course participants will have the opportunity to learn about, test and experiment with advanced techniques of natural language processing (NLP) and to develop an understanding of the limits of those techniques. The course aims to introduce current research issues, and to meet in practice with particular programming techniques used in language technology applications.
- Learning outcomes
- After studying the course, the students will be able to:
- explain a selected NLP problem and list its main aspects;
- implement a basic or intermediate application for complex tasks in language processing, typically for Czech, Slovak, or English;
- create data resources (models, test sets) for a selected NLP problem and evaluate their assets;
- compare selected available tools for complex NLP tasks and apply them to chosen data resources with possible adaptations to particular purposes. - Syllabus
- The presented NLP problems will concentrate on practical problems connected with processing human-produced textual data. Particular topics include:
- Opinion mining, sentiment analysis
- Machine translation
- Parsing of Czech: Between Rules and Statistics
- Named Entity Recognition
- Building Language Resources from the Web (effective crawling, boilerplate removal, tokenisation, near duplicates identification)
- Language modelling
- Topic identification, topic modelling
- Extracting structured information from text
- Automatic relation extraction (hypernyms, synonyms, ...)
- Adaptive electronic dictionaries
- Terminology identification (keywords, key phrases)
- Anaphora resolution
- Stylometry
- Automatic language corrections
- Literature
- JURAFSKY, Dan and James H. MARTIN. Speech and language processing : an introduction to natural language processing, computational linguistics and speech recognition. 2nd ed. New Jersey: Pearson, 2009, 1024 s. ISBN 9780135041963. info
- Dan Jurafsky and James H. Martin. Speech and Language Processing (3rd ed. draft). https://web.stanford.edu/~jurafsky/slp3/
- MANNING, Christopher D. and Hinrich SCHÜTZE. Foundations of statistical natural language processing. Cambridge, Mass.: MIT Press, 1999, xxxvii, 68. ISBN 9780262133609. info
- MERLO, Paola, Harry BUNT and Joakim NIVRE. Trends in Parsing Technology: Dependency Parsing, Domain Adaptation, and Deep Parsing. Springer Netherlands, 2011, 297 pp. Text, Speech and Language Technology, Vol. 43. ISBN 978-90-481-9351-6. URL info
- https://www.aclweb.org/anthology/
- Teaching methods
- Each lecture consists of a one-hour lesson about the theoretical issues connected with a particular NLP problem, and a one-hour practical work in a computer laboratory devoted to the implementation, adaptation and evaluation of the presented techniques on real-world data.
- Assessment methods
- Solving tasks in the practical part of the lecture or in homeworks.
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught annually. - Teacher's information
- http://nlp.fi.muni.cz/AdvancedNlpCourse
- Enrolment Statistics (Autumn 2020, recent)
- Permalink: https://is.muni.cz/course/fi/autumn2020/IA161