ŠČAVNICKÁ, Šárka, Michal ŠTEFÁNIK, Marek KADLČÍK, Martin GELETKA and Petr SOJKA. Towards General Document Understanding through Question Answering. In Recent Advances in Slavonic Natural Language Processing (RASLAN 2022). Recent Advances in Slavonic. Brno: Tribun EU, 2022, p. 181-188. ISBN 978-80-263-1752-4.
Other formats:   BibTeX LaTeX RIS
Basic information
Original name Towards General Document Understanding through Question Answering
Authors ŠČAVNICKÁ, Šárka (703 Slovakia, guarantor, belonging to the institution), Michal ŠTEFÁNIK (703 Slovakia, belonging to the institution), Marek KADLČÍK (203 Czech Republic, belonging to the institution), Martin GELETKA (703 Slovakia, belonging to the institution) and Petr SOJKA (203 Czech Republic, belonging to the institution).
Edition Recent Advances in Slavonic. Brno, Recent Advances in Slavonic Natural Language Processing (RASLAN 2022), p. 181-188, 8 pp. 2022.
Publisher Tribun EU
Other information
Original language English
Type of outcome Proceedings paper
Field of Study 10201 Computer sciences, information science, bioinformatics
Country of publisher Czech Republic
Confidentiality degree is not subject to a state or trade secret
Publication form printed version "print"
WWW fulltext PDF
RIV identification code RIV/00216224:14330/22:00127251
Organization unit Faculty of Informatics
ISBN 978-80-263-1752-4
ISSN 2336-4289
Keywords in English Question Answering; Visual Question Answering; Document Visual Question Answering
Tags International impact
Changed by Changed by: RNDr. Pavel Šmerk, Ph.D., učo 3880. Changed: 15/5/2024 09:25.
Abstract
Document Visual Question Answering is a relatively new extension of Visual Question Answering. The aim is to understand the documents and to be able to obtain information that corresponds to the question that was asked. This proposition aims to approach the problem of the lack of datasets and a model for Slavic languages. Therefore we would like to create a model and dataset for Document VQA suitable for the non-English language. This paper overviews the field of Question Answering and also describes the first Czech Document VQA dataset and model.
Links
CZ.01.1.02/0.0/0.0/21_374/0026711, interní kód MUName: Inteligentní back office
Investor: Ministry of Industry and Trade of the CR
EG21_374/0026711, research and development projectName: Inteligentní back office
MUNI/A/1195/2021, interní kód MUName: Aplikovaný výzkum v oblastech vyhledávání, analýz a vizualizací rozsáhlých dat, zpracování přirozeného jazyka a aplikované umělé inteligence
Investor: Masaryk University
PrintDisplayed: 25/8/2024 16:00