CJBB105 Corpus Linguistics

Faculty of Arts
Spring 2024
Extent and Intensity
2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught in person.
Teacher(s)
Mgr. Dana Hlaváčková, Ph.D. (lecturer)
doc. PhDr. Klára Osolsobě, Dr. (lecturer)
Guaranteed by
Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Bc. Silvie Hulewicz, DiS.
Supplier department: Department of Czech Language – Faculty of Arts
Timetable
Wed 10:00–11:40 D21, except Wed 17. 4.
Course Enrolment Limitations
The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 35/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40
fields of study / plans the course is directly associated with
there are 24 fields of study the course is directly associated with, display
Course objectives
The lecture provides a basic orientation in the field of corpus linguistics. Students are introduced to the following areas:
1) definition of corpus linguistics in the context of other fields, definition of the term language corpus,
2) history of corpus linguistics,
3) typology of corpora and methods of their building,
4) different types of corpus annotation,
5) use of corpora and corpus tools.
Learning outcomes
Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora.
Syllabus
  • 1. Language corpus and corpus linguistics.
  • 2. History of corpus linguistics.
  • 3. Typology of corpora.
  • 4. Building corpora.
  • 5. Corpora managers.
  • 6. Morphological and syntactic tagging.
  • 7. Use of corpora in linguistics and NLP.
  • 8. Corpus organizations, conferences, publications.
Literature
  • Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
  • ČERMÁK, František. Korpus a korpusová lingvistika. Vydání první. Praha: Univerzita Karlova, nakladatelství Karolinum, 2017, 268 stran. ISBN 9788024637105. URL info
  • MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
  • MCENERY, Tony and Andrew HARDIE. Corpus linguistics : method, theory and practice. 1st pub. Cambridge: Cambridge University Press, 2012, xv, 294. ISBN 9780521547369. info
  • https://wiki.korpus.cz/
  • https://www.czechency.org/
Teaching methods
A lecture with corpora and corpora tools presentation.
Assessment methods
Written test: terminology, definitions - (knowledge of texts for homereading).
Language of instruction
Czech
Follow-Up Courses
Further Comments
Study Materials
The course is taught annually.
Listed among pre-requisites of other courses
The course is also listed under the following terms Spring 2006, Autumn 2006, Spring 2007, Autumn 2007, Spring 2008, Autumn 2008, Spring 2009, Autumn 2009, Spring 2010, Autumn 2010, Spring 2011, Autumn 2011, Spring 2012, Autumn 2012, Spring 2013, Autumn 2013, Spring 2014, Autumn 2014, Spring 2015, Autumn 2015, Autumn 2016, Autumn 2017, Autumn 2018, Spring 2020, Spring 2021, Spring 2022, Spring 2023, Spring 2025.
  • Enrolment Statistics (recent)
  • Permalink: https://is.muni.cz/course/phil/spring2024/CJBB105