CJBB105 Corpus Linguistics
Faculty of ArtsSpring 2024
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught in person. - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
doc. PhDr. Klára Osolsobě, Dr. (lecturer) - Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Bc. Silvie Hulewicz, DiS.
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 10:00–11:40 D21, except Wed 17. 4.
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 35/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 24 fields of study the course is directly associated with, display
- Course objectives
- The lecture provides a basic orientation in the field of corpus linguistics. Students are introduced to the following areas:
1) definition of corpus linguistics in the context of other fields, definition of the term language corpus,
2) history of corpus linguistics,
3) typology of corpora and methods of their building,
4) different types of corpus annotation,
5) use of corpora and corpus tools. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Language corpus and corpus linguistics.
- 2. History of corpus linguistics.
- 3. Typology of corpora.
- 4. Building corpora.
- 5. Corpora managers.
- 6. Morphological and syntactic tagging.
- 7. Use of corpora in linguistics and NLP.
- 8. Corpus organizations, conferences, publications.
- Literature
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- ČERMÁK, František. Korpus a korpusová lingvistika. Vydání první. Praha: Univerzita Karlova, nakladatelství Karolinum, 2017, 268 stran. ISBN 9788024637105. URL info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- MCENERY, Tony and Andrew HARDIE. Corpus linguistics : method, theory and practice. 1st pub. Cambridge: Cambridge University Press, 2012, xv, 294. ISBN 9780521547369. info
- https://wiki.korpus.cz/
- https://www.czechency.org/
- Teaching methods
- A lecture with corpora and corpora tools presentation.
- Assessment methods
- Written test: terminology, definitions - (knowledge of texts for homereading).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2025
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught in person. - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
doc. PhDr. Klára Osolsobě, Dr. (lecturer) - Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Bc. Silvie Hulewicz, DiS.
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 18 fields of study the course is directly associated with, display
- Course objectives
- The lecture provides a basic orientation in the field of corpus linguistics. Students are introduced to the following areas:
1) definition of corpus linguistics in the context of other fields, definition of the term language corpus,
2) history of corpus linguistics,
3) typology of corpora and methods of their building,
4) different types of corpus annotation,
5) use of corpora and corpus tools. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Language corpus and corpus linguistics.
- 2. History of corpus linguistics.
- 3. Typology of corpora.
- 4. Building corpora.
- 5. Corpora managers.
- 6. Morphological and syntactic tagging.
- 7. Use of corpora in linguistics and NLP.
- 8. Corpus organizations, conferences, publications.
- Literature
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- ČERMÁK, František. Korpus a korpusová lingvistika. Vydání první. Praha: Univerzita Karlova, nakladatelství Karolinum, 2017, 268 stran. ISBN 9788024637105. URL info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- MCENERY, Tony and Andrew HARDIE. Corpus linguistics : method, theory and practice. 1st pub. Cambridge: Cambridge University Press, 2012, xv, 294. ISBN 9780521547369. info
- https://wiki.korpus.cz/
- https://www.czechency.org/
- Teaching methods
- A lecture with corpora and corpora tools presentation.
- Assessment methods
- Written test: terminology, definitions - (knowledge of texts for homereading).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2023
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught in person. - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
doc. PhDr. Klára Osolsobě, Dr. (lecturer) - Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 12:00–13:40 K23
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 16/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 17 fields of study the course is directly associated with, display
- Course objectives
- The lecture provides a basic orientation in the field of corpus linguistics. Students are introduced to the following areas:
1) definition of corpus linguistics in the context of other fields, definition of the term language corpus,
2) history of corpus linguistics,
3) typology of corpora and methods of their building,
4) different types of corpus annotation,
5) use of corpora and corpus tools. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Language corpus and corpus linguistics.
- 2. History of corpus linguistics.
- 3. Typology of corpora.
- 4. Building corpora.
- 5. Corpora managers.
- 6. Morphological and syntactic tagging.
- 7. Use of corpora in linguistics and NLP.
- 8. Corpus organizations, conferences, publications.
- Literature
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- ČERMÁK, František. Korpus a korpusová lingvistika. Vydání první. Praha: Univerzita Karlova, nakladatelství Karolinum, 2017, 268 stran. ISBN 9788024637105. URL info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- MCENERY, Tony and Andrew HARDIE. Corpus linguistics : method, theory and practice. 1st pub. Cambridge: Cambridge University Press, 2012, xv, 294. ISBN 9780521547369. info
- https://wiki.korpus.cz/
- https://www.czechency.org/
- Teaching methods
- A lecture with corpora and corpora tools presentation.
- Assessment methods
- Written test: terminology, definitions - (knowledge of texts for homereading).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2022
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught in person. - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
doc. PhDr. Klára Osolsobě, Dr. (lecturer) - Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Tue 12:00–13:40 D21
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 11/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 17 fields of study the course is directly associated with, display
- Course objectives
- The lecture provides a basic orientation in the field of corpus linguistics. Students are introduced to the following areas:
1) definition of corpus linguistics in the context of other fields, definition of the term language corpus,
2) history of corpus linguistics,
3) typology of corpora and methods of their building,
4) different types of corpus annotation,
5) use of corpora and corpus tools. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Language corpus and corpus linguistics.
- 2. History of corpus linguistics.
- 3. Typology of corpora.
- 4. Building corpora.
- 5. Corpora managers.
- 6. Morphological and syntactic tagging.
- 7. Use of corpora in linguistics and NLP.
- 8. Corpus organizations, conferences, publications.
- Literature
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- ČERMÁK, František. Korpus a korpusová lingvistika. Vydání první. Praha: Univerzita Karlova, nakladatelství Karolinum, 2017, 268 stran. ISBN 9788024637105. URL info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- MCENERY, Tony and Andrew HARDIE. Corpus linguistics : method, theory and practice. 1st pub. Cambridge: Cambridge University Press, 2012, xv, 294. ISBN 9780521547369. info
- https://wiki.korpus.cz/
- https://www.czechency.org/
- Teaching methods
- A lecture with corpora and corpora tools presentation.
- Assessment methods
- Written test: terminology, definitions - (knowledge of texts for homereading).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2021
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
Taught online. - Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Tue 12:00–13:40 D31
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 5/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 17 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics – history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics – History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis.
- 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2020
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Tue 10:00–11:40 D41
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 2/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 17 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics – history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics – History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis.
- 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsAutumn 2018
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Tue 10:00–11:40 G31
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 50 student(s).
Current registration and enrolment status: enrolled: 1/50, only registered: 0/50, only registered with preference (fields directly associated with the programme): 0/50 - fields of study / plans the course is directly associated with
- there are 15 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsAutumn 2017
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 9:10–10:45 T103
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsAutumn 2016
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Mon 14:10–15:45 M24
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsAutumn 2015
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 10:50–12:25 U34
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 9 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught annually. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2015
- Extent and Intensity
- 1/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 15:50–16:35 U35
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 9 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsAutumn 2014
- Extent and Intensity
- 1/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 14:10–14:55 U13
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 50 student(s).
Current registration and enrolment status: enrolled: 0/50, only registered: 0/50, only registered with preference (fields directly associated with the programme): 0/50 - fields of study / plans the course is directly associated with
- there are 9 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester.
General note: Předmět si nezapisují studenti, kteří již v minulosti absolvovali předmět CJBB105 Úvod do korpusové lingvistiky - přednáška. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2014
- Extent and Intensity
- 1/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 14:10–14:55 U4
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 60 student(s).
Current registration and enrolment status: enrolled: 0/60, only registered: 0/60, only registered with preference (fields directly associated with the programme): 0/60 - fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ucnk.ff.cuni.cz/
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2013
- Extent and Intensity
- 1/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 8:20–9:05 C11
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester.
General note: Předmět si nezapisují studenti, kteří již v minulosti absolvovali předmět CJBB105 Úvod do korpusové lingvistiky - přednáška. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2013
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Thu 9:10–10:45 VP
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1)Mathematical linguistics 2) Corpus linguistics - history 3) What is a corpus and what is in it? 4) Quantitative data 5) The use of corpora in language studies 6) Corpora and computational linguistics 7) Corpus managers 8) Part of speech analysis and tagging of a corpus 9) Czech national corpus 10) Corpora at MU 11) Tagging tools at MU 12)PDT
- Literature
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2012
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Timetable
- Wed 9:10–10:45 VP, Wed 10:50–12:25 VP
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1)Mathematical linguistics 2) Corpus linguistics - history 3) What is a corpus and what is in it? 4) Quantitative data 5) The use of corpora in language studies 6) Corpora and computational linguistics 7) Corpus managers 8) Part of speech analysis and tagging of a corpus 9) Czech national corpus 10) Corpora at MU 11) Tagging tools at MU 12)PDT
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- http://ucnk.ff.cuni.cz/
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester.
General note: Jedná se o inovovaný předmět pod dřívějším názvem Úvod do korpusové lingvistiky - přednáška. Nezapisují si ho studenti, kteří již v minulosti absolvovali předmět CJBB105 Úvod do korpusové lingvistiky - přednáška. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2012
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Burnard L. (1993): A Gentle Introduction to XML.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ucnk.ff.cuni.cz/
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2011
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 8 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- http://ucnk.ff.cuni.cz/
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Burnard L. (1993): A Gentle Introduction to XML.
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester.
The course is taught: every week.
General note: Jedná se o inovovaný předmět pod dřívějším názvem Úvod do korpusové lingvistiky - přednáška. Nezapisují si ho studenti, kteří již v minulosti absolvovali předmět CJBB105 Úvod do korpusové lingvistiky - přednáška. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2011
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 11 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Burnard L. (1993): A Gentle Introduction to XML.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- Study Materials
The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2010
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Thu 11:40–13:15 B12
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 11 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- http://ucnk.ff.cuni.cz/
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Burnard L. (1993): A Gentle Introduction to XML.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Teaching methods
- Reading, tutorial.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester.
General note: Jedná se o inovovaný předmět pod dřívějším názvem Úvod do korpusové lingvistiky - přednáška. Nezapisují si ho studenti, kteří již v minulosti absolvovali předmět CJBB105 Úvod do korpusové lingvistiky - přednáška. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2010
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Fri 11:40–13:15 A43 stara
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Burnard L. (1993): A Gentle Introduction to XML.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- http://ucnk.ff.cuni.cz/
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Teaching methods
- reading, tutorial
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2009
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Fri 8:20–9:55 zruseno C21
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- http://ucnk.ff.cuni.cz/
- Burnard L. (1993): A Gentle Introduction to XML.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Teaching methods
- reading, tutorial
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2009
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Fri 8:20–9:55 zruseno D31
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- http://ucnk.ff.cuni.cz/
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Burnard L. (1993): A Gentle Introduction to XML.
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2008
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Fri 8:20–9:55 zruseno D31
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- http://ucnk.ff.cuni.cz/
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Burnard L. (1993): A Gentle Introduction to XML.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Assessment methods
- Individual text studium and consult. Written test : terminology, definitions - (knowledge of entered texts); colloquium
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- The course is taught each semester.
- Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2008
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Thu 8:20–9:55 zruseno C21
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Burnard L. (1993): A Gentle Introduction to XML.
- Assessment methods (in Czech)
- Výuka probíhá formou pravidelných přednášek. Kolokvium: zvládnutí základního pojmosloví oboru a problematiky probírané v přednáškách; znalost základní literatury oboru.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2007
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Tue 10:00–11:35 VP
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2007
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Tue 8:20–9:55 zruseno D31
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Burnard L. (1993): A Gentle Introduction to XML.
- Assessment methods (in Czech)
- Výuka probíhá formou pravidelných přednášek. Kolokvium: zvládnutí základního pojmosloví oboru a problematiky probírané v přednáškách; znalost základní literatury oboru.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsAutumn 2006
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Tue 11:40–13:15 A49
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Language of instruction
- Czech
- Further Comments
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - lecture
Faculty of ArtsSpring 2006
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: k (colloquium).
- Teacher(s)
- doc. PhDr. Klára Osolsobě, Dr. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová - Timetable
- Thu 13:20–14:55 N01023
- Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
- fields of study / plans the course is directly associated with
- there are 10 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Syllabus
- 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU
- Literature
- McEnery A., Wilson A. (1996): Corpus Linguistics. Edinburgh University Press, Edinburgh.
- Barnbrook G. (1996): Language and Computers. Edinburgh University Press, Edinburgh. Boguraev B., Briscoe T. (1989): Computational Lexicography for Natural Language Processing. Longman, London - New York.
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F., Klímová J., Petkevič V. (eds.) (2000): Studie z korpusové lingvistiky , Praha: FF UK.
- Karlík P., Nekula M., Pleskalová J. (eds.) (2002): Encyklopedický slovník češtiny. Praha : Nakladatelství Lidové noviny.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Burnard L. (1993): A Gentle Introduction to XML.
- Assessment methods (in Czech)
- Výuka probíhá formou pravidelných přednášek. Kolokvium: zvládnutí základního pojmosloví oboru a problematiky probírané v přednáškách; znalost základní literatury oboru.
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- Study Materials
The course is taught each semester. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2024
The course is not taught in Autumn 2024
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Bc. Silvie Hulewicz, DiS.
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2023
The course is not taught in Autumn 2023
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2022
The course is not taught in Autumn 2022
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2021
The course is not taught in Autumn 2021
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2020
The course is not taught in Autumn 2020
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics – Lecture
Faculty of ArtsAutumn 2019
The course is not taught in Autumn 2019
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- Mgr. Dana Hlaváčková, Ph.D.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 14 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed:
1) Corpus linguistics - history.
2) What is a corpus and what is in it?
3) Quantitative data.
4) The use of corpora in language studies.
5) Corpora and computational linguistics.
6) Corpus managers.
7) Part of speech analysis and tagging of a corpus.
8) Czech national corpus.
9) Corpora at MU. - Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK).
- 2. Building Corpora.
- 3. Corpora of ČNK.
- 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation).
- 5. Some problems of Automatical Morphological Analysis. 6. Spoken Language Corpora.
- 7. Corpus of Private Corespondence.
- 8. Corpus Manager.
- 9. Quantitative Data.
- 10. Diachrony and Corpora.
- Literature
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- http://ucnk.ff.cuni.cz/
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught annually.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2019
The course is not taught in Spring 2019
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 15 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- http://ucnk.ff.cuni.cz/
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2018
The course is not taught in Spring 2018
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 15 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Learning outcomes
- Upon completion of the course the student will be able to:
- understand the issues of corpus linguistics,
- understand the basic terminology of the field and use it,
- orient themself in the corpus typology,
- know the possibilities of using corpora. - Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- http://ucnk.ff.cuni.cz/
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions.
- Language of instruction
- Czech
- Follow-Up Courses
- Further Comments
- The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2017
The course is not taught in Spring 2017
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 15 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
CJBB105 Introduction in Corpus Linguistics - Lecture
Faculty of ArtsSpring 2016
The course is not taught in Spring 2016
- Extent and Intensity
- 2/0/0. 4 credit(s). Type of Completion: zk (examination).
- Teacher(s)
- Mgr. Dana Hlaváčková, Ph.D. (lecturer)
- Guaranteed by
- doc. PhDr. Klára Osolsobě, Dr.
Department of Czech Language – Faculty of Arts
Contact Person: Jaroslava Vybíralová
Supplier department: Department of Czech Language – Faculty of Arts - Course Enrolment Limitations
- The course is also offered to the students of the fields other than those the course is directly associated with.
The capacity limit for the course is 40 student(s).
Current registration and enrolment status: enrolled: 0/40, only registered: 0/40, only registered with preference (fields directly associated with the programme): 0/40 - fields of study / plans the course is directly associated with
- there are 9 fields of study the course is directly associated with, display
- Course objectives
- The aim of the course is to give the first information about corpus-based approach to language and linguistics. Following issues are to be discussed: 1) Corpus linguistics - history 2) What is a corpus and what is in it? 3) Quantitative data 4) The use of corpora in language studies 5) Corpora and computational linguistics 6) Corpus managers 7) Part of speech analysis and tagging of a corpus 8) Czech national corpus 9) Corpora at MU.
- Syllabus
- 1. Corpus Linguistics - History (ÚČNK) 2. Building Corpora 3. Corpora of ČNK 4. Automatical Morphological Analysis (tokenization, tagging, disambiguation) 5. Some problems of Automatical Morphological Analysis 6. Spoken Language Corpora 7. Corpus of Private Corespondence 8. Corpus Manager 9. Quantitative Data 10. Diachrony and Corpora
- Literature
- Čermák František (1999): Oxfordská lexikografie přechází také plně na korpus. Slovo a slovesnost, 60, s. 136-141.
- Čermák, F.: Jazykový korpus: Prostředek a zdroj poznání. SaS, 56, 1995, s. 119-140.
- http://ucnk.ff.cuni.cz/
- Čermák F, Blatná R. (eds.) (1995): Manuál lexikografie. Jinočany : H&H.
- Čermák F., Králík J., Kučera K. (1997): Recepce současné češtiny a reprezentativnost korpusu (Výsledky a některé souvislosti jedné orientační sondy na pozadí budování Českého národního korpusu). SaS, 58, 2, s. 118-124.
- http://ufal.mff.cuni.cz/pdt2.0/index-cz.html
- Encyklopedický slovník češtiny. Edited by Petr Karlík - Marek Nekula - Jana Pleskalová. Praha: Nakladatelství Lidové noviny, 2002, 604 s. ISBN 80-7106-484-X. info
- Studie z korpusové lingvistiky. Edited by František Čermák - Jana Klímová - Vladimír Petkevič. Vyd. 1. V Praze: Karolinum, 2000, 531 s. ISBN 807184893X. info
- MCENERY, Tony and Andrew WILSON. Corpus linguistics. Edinburgh: Edinburgh University Press, 1996, 209 s. ISBN 0-7486-0482-0. info
- BARNBROOK, Geoff. Language and computers :a practical introduction to the computer analysis of language. Edinburgh: Edinburgh University Press, 1996, ix, 209 s. ISBN 0-7486-0785-4. info
- Teaching methods
- A lecture with corpora and corpora tools presentation. Homereading.
- Assessment methods
- Colloquium. Written test: terminology, definitions - (knowledge of texts for homereading). The test will contain ten questions (minimum pass level 66,6 %).
- Language of instruction
- Czech
- Follow-Up Courses
- Further comments (probably available only in Czech)
- The course is taught each semester.
The course is taught: every week. - Listed among pre-requisites of other courses
- Enrolment Statistics (recent)