CJp022 Introduction to Corpus Linguistics

Faculty of Education
Autumn 2023
Extent and Intensity
0/2/0. 2 credit(s). Type of Completion: z (credit).
Taught in person.
Teacher(s)
PhDr. Ivana Kolářová, CSc. (lecturer)
Guaranteed by
PhDr. Ivana Kolářová, CSc.
Department of Czech Language and Literature – Faculty of Education
Contact Person: Petra Rozbořilová
Supplier department: Department of Czech Language and Literature – Faculty of Education
Timetable of Seminar Groups
CJp022/01: Thu 13:00–14:50 učebna 72, I. Kolářová
CJp022/02: Thu 15:00–16:50 učebna 72, I. Kolářová
CJp022/03: Tue 16:00–17:50 učebna 72, I. Kolářová
CJp022/04: Tue 13:00–14:50 učebna 72, I. Kolářová
Course Enrolment Limitations
The course is only offered to the students of the study fields the course is directly associated with.
fields of study / plans the course is directly associated with
Course objectives
The aim of the course is to acquaint students with searching in the linguistic corpus and to show them different possibilities of acquiring and processing language data.
The course takes into account the accreditation requirements for Information and Communication Technologies (ICT).
Learning outcomes
At the end of the course students should be able:
1. To use Czech National Corpus, to find loud/orthographical, morphological or lexical tasks in the corpus SYN2020 when using the types of Queries: "basic", "word", "lemma".
2. To create combined Query "CQL" to find grammatical form of word or phrase.
3. To choose suitable method of research of language phenomena in Czech National Corpus solving special problems.
4. To Classify founded language phenomena when using tolls of Czech Natonal Corpus (frequency, collocation).
5. To use Intercorp.
6. To use another tools of the Czech National Corpus: Morfio, Word at a Glance.
Syllabus
  • 1. Types of corpora, characteristics of them. Corpora of written and spoken Czech. Atributes and searching in Czech National Corpus.
  • 2. Orthographical/spelling variants in contemporary Czech language. Types of orthographical variants. Czech National Corpus as a tool for research of orthographical variants. Lemma, sublemma and words.
  • 3. Morphological variants in Czech National Corpus; concurence of double-forms of masculine nouns.
  • 4. Concurence of double-forms of feminine and neuter nouns in Czech National Corpus.
  • 5. Morphological variants of presents verbal forms in Czech National Corpus. Verbal types "krýt", "kupovat", "mazat".
  • 6. Another variants verbal forms in Czech National Corpus.
  • 7. Adverbs, particles and prepositions in Czech National Corpus.
  • 8. Words and multiverbal units in Czech National Corpus. Phraseology in Czech National Corpus.
  • 9.Word-forming concurrents in CNK. Substantives and adjectives by suffixes.
  • 10. Word-forming: verbs by suffixes and by prefixes.
  • 11. Combination of queries and other tools of interface KonText (positive and negative filters).
  • 12. Creating of subcorpora. Using of Intercorp.
  • 13. Morfio. Word at a Glance.
Literature
  • doporučená neurčeno Náhradní obsah: https://wiki.korpus.cz/doku.php/manualy:kontext:index
  • https://wiki.korpus.cz/doku.php/manualy:kontext:index
  • OSOLSOBĚ, Klára. Česká morfologie a korpusy (Czech morphology and corpora). Online. Vyd. 1. Praha: Karolinum, 2014. 236 pp. ISBN 978-80-246-2562-1. [citováno 2024-04-24] URL info
  • ČERMÁK, František, Karel KUČERA and Vladimír PETKEVIČ. Korpusová lingvistika Praha 2011, 2 Výzkum a výstavba korpusů.. Online. Praha: Nakladatelství Lidové noviny, Ústav Českého národního korpusu, 2011. Studie z korpusové lingvistiky 15. ISBN 978-80-7422-115-6. [citováno 2024-04-24] info
Teaching methods
A seminar - problem method, controlled discussion on the professional issues of the course.
Working with Czech National Corpus. Analysis of founded language phenomena.
Assessment methods
Credit requierements: Students have to presenttheir competence in working with the ČNK, it shall be proved by testing at last seminary. Students have to manage 10 of 15 practice taaks of test. During the semestr student shall work at the seminary regulary, too.
Language of instruction
Czech
Further Comments
Study Materials
The course is taught annually.
The course is also listed under the following terms Autumn 2018, Autumn 2019, autumn 2020, Autumn 2021, Autumn 2022.
  • Enrolment Statistics (recent)
  • Permalink: https://is.muni.cz/course/ped/autumn2023/CJp022