D 2016

AQA: Automatic Question Answering System for Czech

MEDVEĎ, Marek and Aleš HORÁK

Basic information

Original name

AQA: Automatic Question Answering System for Czech

Authors

MEDVEĎ, Marek (703 Slovakia, guarantor, belonging to the institution) and Aleš HORÁK (203 Czech Republic)

Edition

Switzerland, Text, Speech, and Dialogue 19th International Conference, TSD 2016 Brno, Czech Republic, September 12–16, 2016 Proceedings, p. 270-278, 9 pp. 2016

Publisher

Springer International Publishing

Other information

Language

English

Type of outcome

Stať ve sborníku

Field of Study

60200 6.2 Languages and Literature

Country of publisher

Switzerland

Confidentiality degree

není předmětem státního či obchodního tajemství

Publication form

printed version "print"

References:

Impact factor

Impact factor: 0.402 in 2005

RIV identification code

RIV/00216224:14330/16:00088123

Organization unit

Faculty of Informatics

ISBN

978-3-319-45510-5

ISSN

UT WoS

000389707400031

Keywords in English

Question Answering; AQA; Simple Question Answering Database; SQAD; Named entity recognition

Tags

Tags

International impact, Reviewed
Změněno: 13/5/2020 19:14, RNDr. Pavel Šmerk, Ph.D.

Abstract

V originále

Question answering (QA) systems have become popular nowadays, however, a majority of them concentrates on the English language and most of them are oriented to a specific limited problem domain. In this paper, we present a new question answering system called AQA (Automatic Question Answering). AQA is an open-domain QA system which allows users to ask all common questions related to a selected text collection. The first version of the AQA system is developed and tested for the Czech language, but we also plan to include more languages in future versions. The AQA strategy consists of three main parts: question processing,answer selection and answer extraction. All modules are syntax-based with advanced scoring obtained by a combination of TF-IDF, tree distance between the question and candidate answers and other selected criteria. The answer extraction module utilizes named entity recognizer which allows the system to catch entities that are most likely to answer the question. Evaluation of the AQA system is performed on a previously published Simple Question-Answering Database, or SQAD, with more than 3,000 question-answer pairs.

Links

GA15-13277S, research and development project
Name: Hyperintensionální logika pro analýzu přirozeného jazyka
Investor: Czech Science Foundation
MUNI/A/0945/2015, interní kód MU
Name: Rozsáhlé výpočetní systémy: modely, aplikace a verifikace V.
Investor: Masaryk University, Category A
7F14047, research and development project
Name: Harvesting big text data for under-resourced languages (Acronym: HaBiT)
Investor: Ministry of Education, Youth and Sports of the CR