SEER: A knapsack approach to exemplar selection for in-context hybridQA
SEER: A knapsack approach to exemplar selection for in-context hybridQA
Question answering over hybrid contexts is a complex task, which requires the combina tion of information extracted from unstructured texts and structured tables in various ways. Re cently, In-Context Learning demonstrated sig nificant performance advances for reasoning tasks. In this paradigm, a large language model performs predictions based on a small set of supporting exemplars. The performance of In-Context Learning depends heavily on the selection procedure of the supporting exem plars, particularly in the case of HybridQA where considering the diversity of reasoning chains and the large size of the hybrid con texts becomes crucial. In this work, we present Selection of ExEmplars for hybrid Reasoning (SEER), a novel method for selecting a set of exemplars that is both representative and di verse. The key novelty of SEER is that it for mulates exemplar selection as a Knapsack Inte ger Linear Program. The Knapsack framework provides the flexibility to incorporate diversity constraints that prioritize exemplars with desir able attributes, and capacity constraints that en sure that the prompt size respects the provided capacity budgets. The effectiveness of SEER is demonstrated on FinQA and TAT-QA, two real-world benchmarks for HybridQA, where it outperforms previous exemplar selection meth ods.
13569-13583
Association for Computational Linguistics (ACL)
Tonglet, Jonathan
4f72888a-9922-41e5-b8c0-c2ad5c68e0df
Reusens, Manon
3dc14c4b-793a-41d6-b7bd-64303cda1c42
Borchert, Philipp
a57c31fb-9fdc-4ca5-bbd1-b6c6fc8a8483
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
1 December 2023
Tonglet, Jonathan
4f72888a-9922-41e5-b8c0-c2ad5c68e0df
Reusens, Manon
3dc14c4b-793a-41d6-b7bd-64303cda1c42
Borchert, Philipp
a57c31fb-9fdc-4ca5-bbd1-b6c6fc8a8483
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Tonglet, Jonathan, Reusens, Manon, Borchert, Philipp and Baesens, Bart
(2023)
SEER: A knapsack approach to exemplar selection for in-context hybridQA.
Bouamor, Houda, Pino, Juan and Bali, Kalika
(eds.)
In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings.
Association for Computational Linguistics (ACL).
.
(doi:10.18653/v1/2023.emnlp-main.837).
Record type:
Conference or Workshop Item
(Paper)
Abstract
Question answering over hybrid contexts is a complex task, which requires the combina tion of information extracted from unstructured texts and structured tables in various ways. Re cently, In-Context Learning demonstrated sig nificant performance advances for reasoning tasks. In this paradigm, a large language model performs predictions based on a small set of supporting exemplars. The performance of In-Context Learning depends heavily on the selection procedure of the supporting exem plars, particularly in the case of HybridQA where considering the diversity of reasoning chains and the large size of the hybrid con texts becomes crucial. In this work, we present Selection of ExEmplars for hybrid Reasoning (SEER), a novel method for selecting a set of exemplars that is both representative and di verse. The key novelty of SEER is that it for mulates exemplar selection as a Knapsack Inte ger Linear Program. The Knapsack framework provides the flexibility to incorporate diversity constraints that prioritize exemplars with desir able attributes, and capacity constraints that en sure that the prompt size respects the provided capacity budgets. The effectiveness of SEER is demonstrated on FinQA and TAT-QA, two real-world benchmarks for HybridQA, where it outperforms previous exemplar selection meth ods.
This record has no associated files available for download.
More information
Published date: 1 December 2023
Venue - Dates:
2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, , Hybrid, Singapore, Singapore, 2023-12-06 - 2023-12-10
Identifiers
Local EPrints ID: 492062
URI: http://eprints.soton.ac.uk/id/eprint/492062
PURE UUID: 8d29de6e-9fdd-4897-a1f7-bd5266a16fe1
Catalogue record
Date deposited: 15 Jul 2024 16:48
Last modified: 20 Jul 2024 01:40
Export record
Altmetrics
Contributors
Author:
Jonathan Tonglet
Author:
Manon Reusens
Author:
Philipp Borchert
Editor:
Houda Bouamor
Editor:
Juan Pino
Editor:
Kalika Bali
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics