Efficient query repair for aggregate constraints
Efficient query repair for aggregate constraints
In many real-world scenarios, query results must satisfy domain-specific constraints such as fairness or financial stability. For example, selecting interview candidates based on their qualifications may require that at least a given percentage be female, or a report on purchase costs may need to ensure the average cost stays below a liability threshold. These requirements can be expressed as constraints over an arithmetic combination of aggregates evaluated on the result of the query.
This thesis studies how to repair a query to fulfill such constraints by modifying the filter predicates of the query. These constraints are non monotone and more complex than those considered in prior work, such as query based explanations for missing answers or fairness enforcement in query results. The constraints considered in this thesis invalidate many existing optimizations considered in prior work. The work in this thesis introduces a novel query repair technique that computes the top-k candidate repairs with respect to their distance to the user query. These techniques leverage materialization and data clustering to avoid unnecessary computation. It also exploits bounds on sets of candidate solutions and interval arithmetic to efficiently prune the search space. Experimental evaluation on real-world and benchmark datasets shows that the proposed pruning technique significantly outperforms baselines that consider a single candidate at a time.
query refinement, query repair, query rewriting
University of Southampton
Algarni, Shatha Saad
127e4ee1-5a46-4d27-86d1-b5d3108866ba
30 December 2025
Algarni, Shatha Saad
127e4ee1-5a46-4d27-86d1-b5d3108866ba
Chapman, Age
721b7321-8904-4be2-9b01-876c430743f1
Staab, Steffen
bf48d51b-bd11-4d58-8e1c-4e6e03b30c49
Algarni, Shatha Saad
(2025)
Efficient query repair for aggregate constraints.
University of Southampton, Doctoral Thesis, 139pp.
Record type:
Thesis
(Doctoral)
Abstract
In many real-world scenarios, query results must satisfy domain-specific constraints such as fairness or financial stability. For example, selecting interview candidates based on their qualifications may require that at least a given percentage be female, or a report on purchase costs may need to ensure the average cost stays below a liability threshold. These requirements can be expressed as constraints over an arithmetic combination of aggregates evaluated on the result of the query.
This thesis studies how to repair a query to fulfill such constraints by modifying the filter predicates of the query. These constraints are non monotone and more complex than those considered in prior work, such as query based explanations for missing answers or fairness enforcement in query results. The constraints considered in this thesis invalidate many existing optimizations considered in prior work. The work in this thesis introduces a novel query repair technique that computes the top-k candidate repairs with respect to their distance to the user query. These techniques leverage materialization and data clustering to avoid unnecessary computation. It also exploits bounds on sets of candidate solutions and interval arithmetic to efficiently prune the search space. Experimental evaluation on real-world and benchmark datasets shows that the proposed pruning technique significantly outperforms baselines that consider a single candidate at a time.
Text
Southampton_PhD_Thesis_PDFA
- Version of Record
Text
Final-thesis-submission-Examination-Mrs-Shatha-Algarni
Restricted to Repository staff only
More information
Published date: 30 December 2025
Keywords:
query refinement, query repair, query rewriting
Identifiers
Local EPrints ID: 508047
URI: http://eprints.soton.ac.uk/id/eprint/508047
PURE UUID: 3beb077a-4f54-417d-8949-4ffb3159ae92
Catalogue record
Date deposited: 12 Jan 2026 17:50
Last modified: 13 Jan 2026 03:02
Export record
Contributors
Author:
Shatha Saad Algarni
Thesis advisor:
Steffen Staab
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics