Detailed Information on Publication Record
2019
Question and Answer Classification in Czech Question Answering Benchmark Dataset
MEDVEĎ, Marek, Aleš HORÁK and Daša KUŠNIRÁKOVÁBasic information
Original name
Question and Answer Classification in Czech Question Answering Benchmark Dataset
Authors
MEDVEĎ, Marek (703 Slovakia, guarantor, belonging to the institution), Aleš HORÁK (203 Czech Republic) and Daša KUŠNIRÁKOVÁ (703 Slovakia)
Edition
Prague, Czech Republic, Proceedings of the 11th International Conference on Agents and Artificial Intelligence, Volume 2, p. 701-706, 6 pp. 2019
Publisher
SCITEPRESS
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10200 1.2 Computer and information sciences
Country of publisher
Czech Republic
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
electronic version available online
RIV identification code
RIV/00216224:14330/19:00107362
Organization unit
Faculty of Informatics
ISBN
978-989-758-350-6
UT WoS
000671841000075
Keywords in English
Question Answering; Question Classification; Answer Classification; Czech; Simple Question Answering Database; SQAD
Tags
Tags
International impact, Reviewed
Změněno: 14/9/2024 10:29, doc. RNDr. Aleš Horák, Ph.D.
Abstract
V originále
In this paper, we introduce a new updated version of the Czech Question Answering database SQAD v2.1 (Simple Question Answering Database) with the update being devoted to improved question and answer classification. The SQAD v2.1 database contains more than 8,500 question-answer pairs with all appropriate metadata for QA training and evaluation. We present the details and changes in the database structure as well as a new algorithm for detecting the question type and the actual answer type from the text of the question. The algorithm is evaluated with more than 4,000 question answer pairs reaching the F1-measure of 88% for question typed and 85% for answer type detection.
Links
GA18-23891S, research and development project |
| ||
MUNI/A/1018/2018, interní kód MU |
|