PLIN013 Proseminar, Pt. I

Faculty of Arts
Autumn 2020
Extent and Intensity
0/2/0. 3 credit(s). Type of Completion: z (credit).
Taught online.
Teacher(s)
Mgr. et Mgr. Markéta Audy Masopustová (lecturer)
Mgr. Dana Hlaváčková, Ph.D. (lecturer)
Mgr. Vojtěch Mrkývka, Ph.D. (lecturer)
Guaranteed by
doc. PhDr. Zdeňka Hladká, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Mgr. et Mgr. Markéta Audy Masopustová
Supplier department: Department of Czech Language – Faculty of Arts
Timetable
Tue 16:00–17:40 G13
Prerequisites
Basic knowledge of how to work with a computer.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 20 student(s).
Current registration and enrolment status: enrolled: 0/20, only registered: 0/20, only registered with preference (fields directly associated with the programme): 0/20
fields of study / plans the course is directly associated with
Course objectives
A practical introduction to the basic skills used in natural language processing. The lecture is supplemented by practical tasks, which will help to adopt abstract and formal processes. Students can highly influence the content of the seminar in the form of discussion and consultation of topics discussed in other seminars and lectures.
Learning outcomes
After finishing the course, the student should be able to:
- explain, what problems does computer linguistics deal with;
- distinguish between different encoding formats and use regular expressions;
- describe what are corpora and search them using different interfaces;
- retrieve data from the internet and use them for research purposes including inter-annotator agreement;
- typeset documents using LaTeX.
Syllabus
  • Introduction: organisation of the course, what is a computer, what is computer linguistics;
  • Computer storage of text: encoding, introduction to regular expressions;
  • Work with corpora;
  • Work with text data;
  • Typesetting of text documents.
Literature
    recommended literature
  • SATRAPA, Pavel. Regulární výrazy [online]. Vydání první: Root.cz, 2007 [cit. 2019-04-30]. Dostupné z: https://www.root.cz/knihy/regularni-vyrazy/
  • WIKIBOOKS. LaTeX: Wikibooks, The Free Textbook Project [online]. 2019 [cit. 2019-04-30]. Dostupné z: https://en.wikibooks.org/w/index.php?title=LaTeX&oldid=3527944
  • OSOLSOBĚ, Klára. Česká morfologie a korpusy (Czech morphology and corpora). Online. Vyd. 1. Praha: Karolinum, 2014. 236 pp. ISBN 978-80-246-2562-1. [citováno 2024-04-23] URL info
  • MANNING, Christopher D. and Hinrich SCHÜTZE. Foundations of statistical natural language processing. Online. Cambridge, Mass.: MIT Press, 1999. xxxvii, 68. ISBN 9780262133609. [citováno 2024-04-23] info
Teaching methods
Lecture merged with practical tasks, problem analysis, discussion.
Assessment methods
Written final exam, attendance, homework.
Language of instruction
Czech
Further Comments
Study Materials
The course is taught annually.
The course is also listed under the following terms Autumn 2010, Autumn 2011, Autumn 2012, Autumn 2013, Autumn 2014, Autumn 2019, Autumn 2021, Autumn 2023, Autumn 2024.
  • Enrolment Statistics (Autumn 2020, recent)
  • Permalink: https://is.muni.cz/course/phil/autumn2020/PLIN013