The University of Southampton
University of Southampton Institutional Repository

Boosting credit risk data quality using machine learning and eXplainable AI techniques

Boosting credit risk data quality using machine learning and eXplainable AI techniques
Boosting credit risk data quality using machine learning and eXplainable AI techniques

Data play a crucial role in decision making for modern businesses, and the reliability of these decisions highly depends on the data quality. This problem is particularly relevant when we deal with financial data used for risk assessment and reporting. Basel regulations mandate that banks hold a certain amount of capital based on the level of risk in their portfolios expressed in terms of risk-weighted assets (RWAs). The quality of a bank’s risk management is directly impacted by the quality of the asset data used to calculate RWAs. In this study, we present a data quality (DQ) framework and show how machine learning paired with eXplainable Artificial Intelligence (XAI) techniques can be used to perform automatic DQ monitoring using a human-in-the-loop approach in credit risk. By obtaining expert feedback about model output enriched with XAI explanations, we clearly demonstrate the power of ML in terms of enhancing credit risk data quality and the advantages of using XAI to assist experts in analysing model outputs.

Credit risk, Data Quality, Model ensembles, SHAP, XAI
1865-0929
420-429
Springer Cham
Tiukhova, Elena
d892421d-5c0a-4091-9af2-a738e71518e7
Salcuni, Adriano
25d3ff55-e6c3-44ab-977e-e36c91f005ed
Oguz, Can
d557a13a-758f-4e65-9135-384111315954
Niglio, Marcella
45756ccf-4e29-4268-bc9f-e4c9c318bb30
Storti, Giuseppe
18824ee4-f42f-4120-9893-b4ff76bc32aa
Forte, Fabio
26f37cd9-1d89-4496-a9c7-bad292c3efee
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Snoeck, Monique
9aee96bc-8a57-4c37-bcd7-e83f0b173ee1
Meo, Rosa
Silvestri, Fabrizio
Tiukhova, Elena
d892421d-5c0a-4091-9af2-a738e71518e7
Salcuni, Adriano
25d3ff55-e6c3-44ab-977e-e36c91f005ed
Oguz, Can
d557a13a-758f-4e65-9135-384111315954
Niglio, Marcella
45756ccf-4e29-4268-bc9f-e4c9c318bb30
Storti, Giuseppe
18824ee4-f42f-4120-9893-b4ff76bc32aa
Forte, Fabio
26f37cd9-1d89-4496-a9c7-bad292c3efee
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Snoeck, Monique
9aee96bc-8a57-4c37-bcd7-e83f0b173ee1
Meo, Rosa
Silvestri, Fabrizio

Tiukhova, Elena, Salcuni, Adriano, Oguz, Can, Niglio, Marcella, Storti, Giuseppe, Forte, Fabio, Baesens, Bart and Snoeck, Monique (2025) Boosting credit risk data quality using machine learning and eXplainable AI techniques. In, Meo, Rosa and Silvestri, Fabrizio (eds.) Machine Learning and Principles and Practice of Knowledge Discovery in Databases. ECML PKDD 2023. (Communications in Computer and Information Science, 2137) 1 ed. Springer Cham, pp. 420-429. (doi:10.1007/978-3-031-74643-7_30).

Record type: Book Section

Abstract

Data play a crucial role in decision making for modern businesses, and the reliability of these decisions highly depends on the data quality. This problem is particularly relevant when we deal with financial data used for risk assessment and reporting. Basel regulations mandate that banks hold a certain amount of capital based on the level of risk in their portfolios expressed in terms of risk-weighted assets (RWAs). The quality of a bank’s risk management is directly impacted by the quality of the asset data used to calculate RWAs. In this study, we present a data quality (DQ) framework and show how machine learning paired with eXplainable Artificial Intelligence (XAI) techniques can be used to perform automatic DQ monitoring using a human-in-the-loop approach in credit risk. By obtaining expert feedback about model output enriched with XAI explanations, we clearly demonstrate the power of ML in terms of enhancing credit risk data quality and the advantages of using XAI to assist experts in analysing model outputs.

Text
MIDAS camera-ready - Accepted Manuscript
Restricted to Repository staff only until 1 January 2026.
Request a copy

More information

Published date: 1 January 2025
Keywords: Credit risk, Data Quality, Model ensembles, SHAP, XAI

Identifiers

Local EPrints ID: 499416
URI: http://eprints.soton.ac.uk/id/eprint/499416
ISSN: 1865-0929
PURE UUID: a0dddca3-c3e3-4185-9d85-4ee35de1ed8b
ORCID for Bart Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 19 Mar 2025 17:41
Last modified: 20 Mar 2025 02:40

Export record

Altmetrics

Contributors

Author: Elena Tiukhova
Author: Adriano Salcuni
Author: Can Oguz
Author: Marcella Niglio
Author: Giuseppe Storti
Author: Fabio Forte
Author: Bart Baesens ORCID iD
Author: Monique Snoeck
Editor: Rosa Meo
Editor: Fabrizio Silvestri

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×