The University of Southampton
University of Southampton Institutional Repository

SEER: A knapsack approach to exemplar selection for in-context hybridQA

SEER: A knapsack approach to exemplar selection for in-context hybridQA
SEER: A knapsack approach to exemplar selection for in-context hybridQA

Question answering over hybrid contexts is a complex task, which requires the combina tion of information extracted from unstructured texts and structured tables in various ways. Re cently, In-Context Learning demonstrated sig nificant performance advances for reasoning tasks. In this paradigm, a large language model performs predictions based on a small set of supporting exemplars. The performance of In-Context Learning depends heavily on the selection procedure of the supporting exem plars, particularly in the case of HybridQA where considering the diversity of reasoning chains and the large size of the hybrid con texts becomes crucial. In this work, we present Selection of ExEmplars for hybrid Reasoning (SEER), a novel method for selecting a set of exemplars that is both representative and di verse. The key novelty of SEER is that it for mulates exemplar selection as a Knapsack Inte ger Linear Program. The Knapsack framework provides the flexibility to incorporate diversity constraints that prioritize exemplars with desir able attributes, and capacity constraints that en sure that the prompt size respects the provided capacity budgets. The effectiveness of SEER is demonstrated on FinQA and TAT-QA, two real-world benchmarks for HybridQA, where it outperforms previous exemplar selection meth ods.

13569-13583
Association for Computational Linguistics (ACL)
Tonglet, Jonathan
4f72888a-9922-41e5-b8c0-c2ad5c68e0df
Reusens, Manon
3dc14c4b-793a-41d6-b7bd-64303cda1c42
Borchert, Philipp
a57c31fb-9fdc-4ca5-bbd1-b6c6fc8a8483
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Bouamor, Houda
Pino, Juan
Bali, Kalika
Tonglet, Jonathan
4f72888a-9922-41e5-b8c0-c2ad5c68e0df
Reusens, Manon
3dc14c4b-793a-41d6-b7bd-64303cda1c42
Borchert, Philipp
a57c31fb-9fdc-4ca5-bbd1-b6c6fc8a8483
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Bouamor, Houda
Pino, Juan
Bali, Kalika

Tonglet, Jonathan, Reusens, Manon, Borchert, Philipp and Baesens, Bart (2023) SEER: A knapsack approach to exemplar selection for in-context hybridQA. Bouamor, Houda, Pino, Juan and Bali, Kalika (eds.) In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings. Association for Computational Linguistics (ACL). pp. 13569-13583 . (doi:10.18653/v1/2023.emnlp-main.837).

Record type: Conference or Workshop Item (Paper)

Abstract

Question answering over hybrid contexts is a complex task, which requires the combina tion of information extracted from unstructured texts and structured tables in various ways. Re cently, In-Context Learning demonstrated sig nificant performance advances for reasoning tasks. In this paradigm, a large language model performs predictions based on a small set of supporting exemplars. The performance of In-Context Learning depends heavily on the selection procedure of the supporting exem plars, particularly in the case of HybridQA where considering the diversity of reasoning chains and the large size of the hybrid con texts becomes crucial. In this work, we present Selection of ExEmplars for hybrid Reasoning (SEER), a novel method for selecting a set of exemplars that is both representative and di verse. The key novelty of SEER is that it for mulates exemplar selection as a Knapsack Inte ger Linear Program. The Knapsack framework provides the flexibility to incorporate diversity constraints that prioritize exemplars with desir able attributes, and capacity constraints that en sure that the prompt size respects the provided capacity budgets. The effectiveness of SEER is demonstrated on FinQA and TAT-QA, two real-world benchmarks for HybridQA, where it outperforms previous exemplar selection meth ods.

This record has no associated files available for download.

More information

Published date: 1 December 2023
Venue - Dates: 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, , Hybrid, Singapore, Singapore, 2023-12-06 - 2023-12-10

Identifiers

Local EPrints ID: 492062
URI: http://eprints.soton.ac.uk/id/eprint/492062
PURE UUID: 8d29de6e-9fdd-4897-a1f7-bd5266a16fe1
ORCID for Bart Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 15 Jul 2024 16:48
Last modified: 20 Jul 2024 01:40

Export record

Altmetrics

Contributors

Author: Jonathan Tonglet
Author: Manon Reusens
Author: Philipp Borchert
Author: Bart Baesens ORCID iD
Editor: Houda Bouamor
Editor: Juan Pino
Editor: Kalika Bali

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×