Boosting credit risk data quality using machine learning and eXplainable AI techniques
Boosting credit risk data quality using machine learning and eXplainable AI techniques
Data play a crucial role in decision making for modern businesses, and the reliability of these decisions highly depends on the data quality. This problem is particularly relevant when we deal with financial data used for risk assessment and reporting. Basel regulations mandate that banks hold a certain amount of capital based on the level of risk in their portfolios expressed in terms of risk-weighted assets (RWAs). The quality of a bank’s risk management is directly impacted by the quality of the asset data used to calculate RWAs. In this study, we present a data quality (DQ) framework and show how machine learning paired with eXplainable Artificial Intelligence (XAI) techniques can be used to perform automatic DQ monitoring using a human-in-the-loop approach in credit risk. By obtaining expert feedback about model output enriched with XAI explanations, we clearly demonstrate the power of ML in terms of enhancing credit risk data quality and the advantages of using XAI to assist experts in analysing model outputs.
Credit risk, Data Quality, Model ensembles, SHAP, XAI
420-429
Tiukhova, Elena
d892421d-5c0a-4091-9af2-a738e71518e7
Salcuni, Adriano
25d3ff55-e6c3-44ab-977e-e36c91f005ed
Oguz, Can
d557a13a-758f-4e65-9135-384111315954
Niglio, Marcella
45756ccf-4e29-4268-bc9f-e4c9c318bb30
Storti, Giuseppe
18824ee4-f42f-4120-9893-b4ff76bc32aa
Forte, Fabio
26f37cd9-1d89-4496-a9c7-bad292c3efee
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Snoeck, Monique
9aee96bc-8a57-4c37-bcd7-e83f0b173ee1
1 January 2025
Tiukhova, Elena
d892421d-5c0a-4091-9af2-a738e71518e7
Salcuni, Adriano
25d3ff55-e6c3-44ab-977e-e36c91f005ed
Oguz, Can
d557a13a-758f-4e65-9135-384111315954
Niglio, Marcella
45756ccf-4e29-4268-bc9f-e4c9c318bb30
Storti, Giuseppe
18824ee4-f42f-4120-9893-b4ff76bc32aa
Forte, Fabio
26f37cd9-1d89-4496-a9c7-bad292c3efee
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Snoeck, Monique
9aee96bc-8a57-4c37-bcd7-e83f0b173ee1
Tiukhova, Elena, Salcuni, Adriano, Oguz, Can, Niglio, Marcella, Storti, Giuseppe, Forte, Fabio, Baesens, Bart and Snoeck, Monique
(2025)
Boosting credit risk data quality using machine learning and eXplainable AI techniques.
In,
Meo, Rosa and Silvestri, Fabrizio
(eds.)
Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2023.
(Communications in Computer and Information Science, 2137)
1 ed.
Springer Cham, .
(doi:10.1007/978-3-031-74643-7_30).
Record type:
Book Section
Abstract
Data play a crucial role in decision making for modern businesses, and the reliability of these decisions highly depends on the data quality. This problem is particularly relevant when we deal with financial data used for risk assessment and reporting. Basel regulations mandate that banks hold a certain amount of capital based on the level of risk in their portfolios expressed in terms of risk-weighted assets (RWAs). The quality of a bank’s risk management is directly impacted by the quality of the asset data used to calculate RWAs. In this study, we present a data quality (DQ) framework and show how machine learning paired with eXplainable Artificial Intelligence (XAI) techniques can be used to perform automatic DQ monitoring using a human-in-the-loop approach in credit risk. By obtaining expert feedback about model output enriched with XAI explanations, we clearly demonstrate the power of ML in terms of enhancing credit risk data quality and the advantages of using XAI to assist experts in analysing model outputs.
Text
MIDAS camera-ready
- Accepted Manuscript
Restricted to Repository staff only until 1 January 2026.
Request a copy
More information
Published date: 1 January 2025
Keywords:
Credit risk, Data Quality, Model ensembles, SHAP, XAI
Identifiers
Local EPrints ID: 499416
URI: http://eprints.soton.ac.uk/id/eprint/499416
ISSN: 1865-0929
PURE UUID: a0dddca3-c3e3-4185-9d85-4ee35de1ed8b
Catalogue record
Date deposited: 19 Mar 2025 17:41
Last modified: 20 Mar 2025 02:40
Export record
Altmetrics
Contributors
Author:
Elena Tiukhova
Author:
Adriano Salcuni
Author:
Can Oguz
Author:
Marcella Niglio
Author:
Giuseppe Storti
Author:
Fabio Forte
Author:
Monique Snoeck
Editor:
Rosa Meo
Editor:
Fabrizio Silvestri
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics