P030 Textual Information Systems

Faculty of Informatics
Spring 1998
Extent and Intensity
2/1. 3 credit(s). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
Teacher(s)
doc. RNDr. Petr Sojka, Ph.D. (lecturer)
Guaranteed by
Contact Person: doc. RNDr. Petr Sojka, Ph.D.
Prerequisites (in Czech)
I005 Formal Languages and Automata I && P002 Introduction to Database Systems && I030 Introduction to Computer Linguistics
Je potřeba absolvovat předměty I005 Formal Languages and Automata I, P002 a I030 Introduction to Computational Linguistics.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Syllabus
  • Basic notions. TIS - text information system. Classification of information systems.
  • Searching in TIS. Searching and pattern matching classification and data structures.
  • Algorithms of Knuth-Morris-Pratt, Aho-Corasick. Boyer-Moore, Commentz-Walter. Theory of automata for searching.
  • Indexes. Indexing methods. Signature methods.
  • Languages for searching.
  • Data compression. Statistic methods.
  • Compression methods based on dictionary. Neural nets for text compression.
  • Syntactic methods. Context modelling.
  • Spell checking.
Language of instruction
Czech
Teacher's information
http://www.fi.muni.cz/~sojka/tis/
The course is also listed under the following terms Spring 1996, Spring 1997, Spring 1999, Spring 2000, Spring 2001, Spring 2002.
  • Enrolment Statistics (Spring 1998, recent)
  • Permalink: https://is.muni.cz/course/fi/spring1998/P030