FI:PB095 Intro to Speech Processing - Course Information
PB095 Introduction to Speech Processing
Faculty of InformaticsAutumn 2023
- Extent and Intensity
- 2/0/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
- Teacher(s)
- Mgr. Luděk Bártek, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Luděk Bártek, Ph.D.
Department of Machine Learning and Data Processing – Faculty of Informatics
Supplier department: Department of Machine Learning and Data Processing – Faculty of Informatics - Timetable
- Mon 12:00–13:50 B204
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 65 fields of study the course is directly associated with, display
- Course objectives
- The course provides an introduction to speech processing oriented to human-computer interaction, i.e. especially to speech synthesis, speech recognition and dialogue systems. Main objectives can be summarized as follows: To understand the basic principles of sound and speech production and perception; To understand basic principles of speech regognition,synthesis and dialogue systems; To obtain an introductory overview in the field.
- Learning outcomes
- Student will be able after finishing the course to describe and explain the basic terms, methods and standards in following areas:
- physical acoustics
- physiological acoustics, especially the processes of forming and understanding the human speech
- phonetics and phonology
- signal digitization and basic signal processing in time and frequency domains
- isolated words and commands recognition
- continues speech recognition
- time and frequency domain text-to-speech synthesis
- relation of prosody a emotions to tts and speech recognition
- dialogue communication
- dialogue systems
- user modeling in dialogue systems
- dialogue systems applicaiton.
- Syllabus
- Introduction
- Brief history
- State of the art
- Physical and physiological acoustics
- Creation and perception of human speech
- Phonetics a phonology
- Signal processing
- Principles of speech synthesis
- Speech segments and concantenative speech synthesis
- Prosody, emotions
- Principles of speech recognition
- Statistical approaches
- Modelling by means of HMM
- Language modelling
- Human-human and human-computer communication
- Dialogue
- Dialogue Systems - Voice Browser Activity Standards (VoiceXML, SRGS, SISR, etc.)
- User modelling
- Dialogue systems and applications
- Literature
- Teaching methods
- Introductory course. Basic information on theoretical frameworks.
- Assessment methods
- The students are passing both written test and oral examination when student finishes the course by an exam. When the course is finished another way student must answer practically oriented questions based on topics covered by the course. The students will elaborate evaluated home assignments during semester.
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught annually.
- Enrolment Statistics (Autumn 2023, recent)
- Permalink: https://is.muni.cz/course/fi/autumn2023/PB095