E0034 Analysis and classification of data

Faculty of Science
Autumn 2024
Extent and Intensity
2/1/0. 3 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: z (credit).
In-person direct teaching
Teacher(s)
RNDr. Eva Koriťáková, Ph.D. (lecturer)
prof. Ing. Jiří Holčík, CSc. (lecturer)
RNDr. Roman Vyškovský, Ph.D. (lecturer)
Mgr. Tereza Jurková (seminar tutor)
Guaranteed by
prof. Ing. Jiří Holčík, CSc.
RECETOX – Faculty of Science
Contact Person: RNDr. Eva Koriťáková, Ph.D.
Supplier department: RECETOX – Faculty of Science
Timetable
Wed 8:00–10:50 F01B1/709
Prerequisites
It is recommended to attend the course Bi8600 Multivariate Methods or the course Bi8601 Advanced Statistical Methods beforehand.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
The aim of the course is to introduce objectives and principles of data analysis and classification methods to students. During the course, the students will acquire knowledge of fundamental theoretical and methodological principles of pattern analysis and recognition methods with emphasis to processing of medical and biological data. Besides, they will master application of the methods on real-world data and interpretation of achieved results.
Learning outcomes
At the end of the course, students should be able to:
- Choose and apply appropriate data analysis method to obtain required results;
- Reduce and transform multivariate data using advanced ordination methods;
- Classify data using various classification methods;
- Design modified algorithms to process data of given particular characteristics;
- Interpret obtained results including evaluation of classification performance.
Syllabus
  • Introduction to analysis and classification of data.
  • Classification based on discriminant functions.
  • Classification based on minimal distance – deepening of knowledge about distance and similarity metrics.
  • Classification based on minimal distance.
  • Classification based on boundaries – Fisher’s Linear Discriminant Analysis (FLDA).
  • Classification based on boundaries – Support Vector Machines (SVM).
  • Sequential classification.
  • Evaluation of classification performance – cross-validation, comparison of classification results with random classification, comparison of classification performance of two or more classifiers.
  • Feature extraction – deepening of knowledge about Principal Component Analysis (PCA).
  • Advanced methods for feature extraction – Manifold Learning, Independent Component Analysis (ICA).
  • Feature selection.
  • Application of classification, pattern analysis and recognition methods on real-world data.
Literature
  • Dunham,M.H.: Data Mining: Introductory and Advanced Topics. Prentice Hall 2002
  • Holčík,J.: Analýza a klasifikace signálů. [Učební texty vysokých škol] Brno, Nakladatelství VUT v Brně 1992.
  • Mitchel,T.M.: Machine Learning. McGraw Hill 1997
Teaching methods
Online lectures and full-time form of practices. Understanding of principles, methods and algorithms is emphasized.
Assessment methods
Oral exam
Language of instruction
Czech
Further Comments
Study Materials
The course is taught annually.
Listed among pre-requisites of other courses
The course is also listed under the following terms Autumn 2022, Autumn 2023.
  • Enrolment Statistics (recent)
  • Permalink: https://is.muni.cz/course/sci/autumn2024/E0034