FI:IB030 Introduction to CL - Course Information
IB030 Introduction to Computational Linguistics
Faculty of InformaticsAutumn 2005
- Extent and Intensity
- 2/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
- Teacher(s)
- doc. RNDr. Aleš Horák, Ph.D. (lecturer)
- Guaranteed by
- prof. Ing. Václav Přenosil, CSc.
Department of Machine Learning and Data Processing – Faculty of Informatics
Contact Person: doc. RNDr. Aleš Horák, Ph.D. - Timetable
- Wed 12:00–13:50 B204
- Prerequisites (in Czech)
- ! I030 Introduction to CL
Před IB030 doporučuji zapsat PV122 Formální struktura přirozeného jazyka. Vhodná je znalost Prologu. - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- In this course the main principles of natural language processing are offered. The algorithmic description of the main language levels will be discussed, particularly, morphology, syntax, semantics and pragmatics. Also the resources of natural language data - corpora will be mentioned. The role of knowledge representation, inference and relations to AI will be touched as well.
- Syllabus
- Introduction to Computational Linguistics.
- Natural language as a main tool of human communication. Language data in corpora, information about corpus linguistics.
- Levels of description: phonetics and phonology, morphology, syntax, semantics and pragmatics. Traditional vs. formal grammars: representation of morphological and syntactic structures -- DAGs, meaning representation. Grammars: context-free, context-sensitive, logical - DCG, transformational. Generating and recognition: morphological, syntactic, sémantic. Parsing: morphological parser -- AJKA, syntactic -- KLARA, Techniques of analysis: top-down, bottom-up, mixed, heuristics. Problem of ambiguity and searching.
- Electronic or machine readable dictionaries: representation of lexical knowledge. Types of the machine readable dictionaries: monolingual, thesauri, idiomatic, morphological dictionaries (stems), translation dictionaries, - bi- or multilingual, the ways of their formalization.
- Semantic representation of sentece meanings: logical vs. lexical sémantics. The Compositionality Principle.
- Semantic classification of verbs, valency frames, predicates, transparent intensional logic (TIL) and its application to semantic analysis of Czech sentences.
- Pragmatics: sémantic and pragmatic nature of noun groups, discourse structure, deictic expressions, verbal and non-verbal contexts. Natural Language Understanding: semantic representation, inference and knowledge representations - are they the same? Structure of dialog systems.
- Literature
- PALA, Karel. Počítačové zpracování přirozeného jazyka (Natural Language Processing). 1st ed. Brno: FI MU, 2000, 190 pp. info
- ALLEN, James. Natural Language Understanding. 2nd ed. Redwood City: Benjamin/Cummings Publishing Company, 1995, xv, 654 s. ISBN 0-8053-0334-0. info
- The Oxford handbook of computational linguistics. Edited by Ruslan Mitkov. Oxford: Oxford University Press, 2003, xx, 784. ISBN 0198238827. info
- CHOMSKY, Noam. Syntaktické struktury., Logický základ teorie jazyka., O pojmu gramatické pravidlo (Syntactic Structures). 1st ed. Praha: Academia, 1966, 209 s. info
- MATERNA, Pavel and Jan ŠTĚPÁN. Filozofická logika: nová cesta? (Philosophical logic: a new way?). Olomouc: Olomouc (Univerzita Palackého), 2000, 127 pp. ISBN 80-244-0109-6. info
- Assessment methods (in Czech)
- Závěrečné hodnocení se děje na základě písemné zkoušky. Účast na přednáškách není povinná.
- Language of instruction
- Czech
- Further Comments
- The course is taught annually.
- Teacher's information
- http://nlp.fi.muni.cz/poc_lingv/
- Enrolment Statistics (Autumn 2005, recent)
- Permalink: https://is.muni.cz/course/fi/autumn2005/IB030