CJc304 The Introduction to the Corpus Linguistics

Faculty of Education
Autumn 2018
Extent and Intensity
0/0/0. 2 credit(s). Type of Completion: z (credit).
Teacher(s)
Mgr. Hana Žižková, Ph.D. (lecturer)
Guaranteed by
doc. PhDr. Milena Šubrtová, Ph.D.
Department of Czech Language and Literature – Faculty of Education
Contact Person: Petra Rozbořilová
Supplier department: Department of Czech Language and Literature – Faculty of Education
Timetable of Seminar Groups
CJc304/01: Fri 5. 10. 14:00–15:50 učebna 28, Fri 9. 11. 14:00–15:50 učebna 28, Fri 30. 11. 8:00–9:50 učebna 28, H. Žižková
Course Enrolment Limitations
The course is only offered to the students of the study fields the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
At the end of the course students should be able:
1. To use Czech National Corpus, to find not only loud, morphological or lexical phenomena in the corpus SYN2015, but to create Query to find syntactical phenomena.
2. To Classify founded language phenomena using tolls of Czech Natonal Corpus and to elaborated them.
3. To choose suitable method of research of language phenomena in Czech National Corpus solving special problems.
4. To work with the corpora of spoken Czech.
Syllabus
  • 1. Types of corpora, characteristics of them. Corpora of written and spoken Czech.
  • 2. Morphological variants in Czech National Corpus; concurence of double-forms.
  • 3. Words and multiverbal units in Czech National Corpus. Phraseology in Czech National Corpus;
  • 4.Word-forming concurrents in CNK.
  • 5. Creating of queris for seeking of syntactical structures in corpus SYN2010. Combination of queries and other tools of interface KonText (positive and negative filters).
  • 6. Creating of subcorpora.
  • 7.Evaluation of linguistic data of CNK.
Literature
    recommended literature
  • ČERMÁK, František, Karel KUČERA and Vladimír PETKEVIČ. Korpusová lingvistika Praha 2011, 2 Výzkum a výstavba korpusů. Praha: Nakladatelství Lidové noviny, Ústav Českého národního korpusu, 2011. Studie z korpusové lingvistiky 15. ISBN 978-80-7422-115-6. info
  • ČERMÁK, František, Karel KUČERA, Vladimír PETKEVIČ and Alexander ROSEN. Korpusová lingvistika, Praha 2011. 3. Gramatika a značkování korpusů. Praha: Nakladatelství Lidové noviny, 2011, 225 pp. Studie z korpusové lingvistiky 16. ISBN 978-80-7422-116-3. info
  • Grammar & Corpora 2007 :selected contributions from the conference Gramar and Corpora, Sept. 25-27, 2007, Liblice. Edited by František Štícha - Mirjam Fried. Vyd. 1. Praha: Academia, 2008, 443 s. ISBN 9788020016348. info
  • Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
Teaching methods
A seminar - problem method, controlled discussion on the professional issues of the course.
Working with Czech National Corpus. Analysis of founded language phenomena.
Assessment methods
Credit requierements: Students have to presenttheir competence in working with the ČNK, it shall be proved by testing at last seminary. Students have to manage 7 of 10 practice taaks of test. During the semestr student shall work at the seminary regulary, too.
Language of instruction
Czech
Further comments (probably available only in Czech)
Study Materials
The course is taught annually.
Information on the extent and intensity of the course: 9 hodin.
Teacher's information
http://wiki.korpus.cz/doku.php/manualy:kontext:index
The course is also listed under the following terms Autumn 2019, autumn 2020, Autumn 2021, Autumn 2022, Autumn 2023.
  • Enrolment Statistics (Autumn 2018, recent)
  • Permalink: https://is.muni.cz/course/ped/autumn2018/CJc304