Detailed Information on Publication Record
2023
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
KADLČÍK, Marek, Michal ŠTEFÁNIK, Ondřej SOTOLÁŘ and Vlastimil MARTINEKBasic information
Original name
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems
Authors
KADLČÍK, Marek (203 Czech Republic, belonging to the institution), Michal ŠTEFÁNIK (703 Slovakia, guarantor, belonging to the institution), Ondřej SOTOLÁŘ (203 Czech Republic) and Vlastimil MARTINEK (203 Czech Republic, belonging to the institution)
Edition
Singapore, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Main track, p. 12101-12108, 8 pp. 2023
Publisher
Association for Computational Linguistics
Other information
Language
English
Type of outcome
Stať ve sborníku
Field of Study
10201 Computer sciences, information science, bioinformatics
Country of publisher
United States of America
Confidentiality degree
není předmětem státního či obchodního tajemství
Publication form
electronic version available online
References:
RIV identification code
RIV/00216224:14330/23:00131954
Organization unit
Faculty of Informatics
ISBN
979-8-89176-060-8
Keywords in English
language models; dataset; arithmetic reasoning; multistep reasoning
Tags
International impact, Reviewed
Změněno: 21/5/2024 08:54, Mgr. Michal Štefánik
Abstract
V originále
Despite outstanding performance on many generation tasks, language models are notoriously inclined to make factual errors in tasks requiring arithmetic reasoning. To enable language models to circumvent this deficiency and offload critical computation to a symbolic system, we create a collection of Calc-X datasets that demonstrates the appropriate use of a calculator in reasoning chains. We survey and unify several existing chain-of-thoughts datasets into a proposed novel format, resulting in a standard collection of over 300,000 samples requiring arithmetic reasoning. Finally, we use the new collection to train open-source calculator-assisted language models and show that models trained on Calc-X almost double the accuracy of generating correct results compared to baselines. We make all Calc-X datasets and models publicly available.
Links
MUNI/A/1339/2022, interní kód MU |
|