IB030 Introduction to Computational Linguistics

Faculty of Informatics
Autumn 2005
Extent and Intensity
2/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
Teacher(s)
doc. RNDr. Aleš Horák, Ph.D. (lecturer)
Guaranteed by
prof. Ing. Václav Přenosil, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: doc. RNDr. Aleš Horák, Ph.D.
Timetable
Wed 12:00–13:50 B204
Prerequisites (in Czech)
! I030 Introduction to CL
Před IB030 doporučuji zapsat PV122 Formální struktura přirozeného jazyka. Vhodná je znalost Prologu.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
there are 14 fields of study the course is directly associated with, display
Course objectives
In this course the main principles of natural language processing are offered. The algorithmic description of the main language levels will be discussed, particularly, morphology, syntax, semantics and pragmatics. Also the resources of natural language data - corpora will be mentioned. The role of knowledge representation, inference and relations to AI will be touched as well.
Syllabus
  • Introduction to Computational Linguistics.
  • Natural language as a main tool of human communication. Language data in corpora, information about corpus linguistics.
  • Levels of description: phonetics and phonology, morphology, syntax, semantics and pragmatics. Traditional vs. formal grammars: representation of morphological and syntactic structures -- DAGs, meaning representation. Grammars: context-free, context-sensitive, logical - DCG, transformational. Generating and recognition: morphological, syntactic, sémantic. Parsing: morphological parser -- AJKA, syntactic -- KLARA, Techniques of analysis: top-down, bottom-up, mixed, heuristics. Problem of ambiguity and searching.
  • Electronic or machine readable dictionaries: representation of lexical knowledge. Types of the machine readable dictionaries: monolingual, thesauri, idiomatic, morphological dictionaries (stems), translation dictionaries, - bi- or multilingual, the ways of their formalization.
  • Semantic representation of sentece meanings: logical vs. lexical sémantics. The Compositionality Principle.
  • Semantic classification of verbs, valency frames, predicates, transparent intensional logic (TIL) and its application to semantic analysis of Czech sentences.
  • Pragmatics: sémantic and pragmatic nature of noun groups, discourse structure, deictic expressions, verbal and non-verbal contexts. Natural Language Understanding: semantic representation, inference and knowledge representations - are they the same? Structure of dialog systems.
Literature
  • PALA, Karel. Počítačové zpracování přirozeného jazyka (Natural Language Processing). 1st ed. Brno: FI MU, 2000, 190 pp. info
  • ALLEN, James. Natural Language Understanding. 2nd ed. Redwood City: Benjamin/Cummings Publishing Company, 1995, xv, 654 s. ISBN 0-8053-0334-0. info
  • The Oxford handbook of computational linguistics. Edited by Ruslan Mitkov. Oxford: Oxford University Press, 2003, xx, 784. ISBN 0198238827. info
  • CHOMSKY, Noam. Syntaktické struktury., Logický základ teorie jazyka., O pojmu gramatické pravidlo (Syntactic Structures). 1st ed. Praha: Academia, 1966, 209 s. info
  • MATERNA, Pavel and Jan ŠTĚPÁN. Filozofická logika: nová cesta? (Philosophical logic: a new way?). Olomouc: Olomouc (Univerzita Palackého), 2000, 127 pp. ISBN 80-244-0109-6. info
Assessment methods (in Czech)
Závěrečné hodnocení se děje na základě písemné zkoušky. Účast na přednáškách není povinná.
Language of instruction
Czech
Further Comments
The course is taught annually.
Teacher's information
http://nlp.fi.muni.cz/poc_lingv/
The course is also listed under the following terms Autumn 2002, Autumn 2003, Autumn 2004, Spring 2007, Spring 2008, Spring 2009, Spring 2010, Spring 2011, Spring 2012, Spring 2013, Spring 2014, Spring 2015, Spring 2016, Spring 2017, Spring 2018, Spring 2019, Spring 2020, Spring 2021, Spring 2022, Spring 2023, Spring 2024, Spring 2025.
  • Enrolment Statistics (Autumn 2005, recent)
  • Permalink: https://is.muni.cz/course/fi/autumn2005/IB030