PB095 Introduction to Speech Processing

Faculty of Informatics
Autumn 2023
Extent and Intensity
2/0/0. 2 credit(s) (plus extra credits for completion). Recommended Type of Completion: zk (examination). Other types of completion: k (colloquium), z (credit).
Teacher(s)
Mgr. Luděk Bártek, Ph.D. (lecturer)
Guaranteed by
Mgr. Luděk Bártek, Ph.D.
Department of Machine Learning and Data Processing – Faculty of Informatics
Supplier department: Department of Machine Learning and Data Processing – Faculty of Informatics
Timetable
Mon 12:00–13:50 B204
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
The course provides an introduction to speech processing oriented to human-computer interaction, i.e. especially to speech synthesis, speech recognition and dialogue systems. Main objectives can be summarized as follows: To understand the basic principles of sound and speech production and perception; To understand basic principles of speech regognition,synthesis and dialogue systems; To obtain an introductory overview in the field.
Learning outcomes
Student will be able after finishing the course to describe and explain the basic terms, methods and standards in following areas:
  • physical acoustics
  • physiological acoustics, especially the processes of forming and understanding the human speech
  • phonetics and phonology
  • signal digitization and basic signal processing in time and frequency domains
  • isolated words and commands recognition
  • continues speech recognition
  • time and frequency domain text-to-speech synthesis
  • relation of prosody a emotions to tts and speech recognition
  • dialogue communication
  • dialogue systems
  • user modeling in dialogue systems
  • dialogue systems applicaiton.
  • Syllabus
    • Introduction
    • Brief history
    • State of the art
    • Physical and physiological acoustics
    • Creation and perception of human speech
    • Phonetics a phonology
    • Signal processing
    • Principles of speech synthesis
    • Speech segments and concantenative speech synthesis
    • Prosody, emotions
    • Principles of speech recognition
    • Statistical approaches
    • Modelling by means of HMM
    • Language modelling
    • Human-human and human-computer communication
    • Dialogue
    • Dialogue Systems - Voice Browser Activity Standards (VoiceXML, SRGS, SISR, etc.)
    • User modelling
    • Dialogue systems and applications
    Literature
    • MCTEAR, Michael. Spoken dialogue technology : toward the conversational user interface. Edited by T. V. Raman. New York: Springer, 2004, xii, 432. ISBN 1852336722. info
    • PSUTKA, Josef. Komunikace s počítačem mluvenou řečí. Praha: Academia, 1995, 287 s. ISBN 8020002030. info
    Teaching methods
    Introductory course. Basic information on theoretical frameworks.
    Assessment methods
    The students are passing both written test and oral examination when student finishes the course by an exam. When the course is finished another way student must answer practically oriented questions based on topics covered by the course. The students will elaborate evaluated home assignments during semester.
    Language of instruction
    Czech
    Further Comments
    Study Materials
    The course is taught annually.
    The course is also listed under the following terms Autumn 2002, Autumn 2003, Autumn 2004, Autumn 2005, Autumn 2006, Autumn 2007, Spring 2008, Autumn 2008, Autumn 2009, Autumn 2010, Autumn 2011, Autumn 2012, Autumn 2013, Autumn 2014, Autumn 2015, Autumn 2016, Autumn 2017, Autumn 2018, Autumn 2019, Autumn 2020, Autumn 2021, Autumn 2022, Autumn 2024.
    • Enrolment Statistics (Autumn 2023, recent)
    • Permalink: https://is.muni.cz/course/fi/autumn2023/PB095