The University of Southampton
University of Southampton Institutional Repository

Interpretable and accurate identification of job seekers at risk of long-term unemployment: explainable ML-based profiling

Interpretable and accurate identification of job seekers at risk of long-term unemployment: explainable ML-based profiling
Interpretable and accurate identification of job seekers at risk of long-term unemployment: explainable ML-based profiling
To tackle the societal and person-specific adverse consequences of long-term
unemployment, many public employment services (PES) have implemented datadriven profiling systems to promptly identify vulnerable jobseekers. More recently, PES increasingly rely on more complex machine learning (ML) models due to their enhanced accuracy. However, increasing concerns are raised regarding the algorithmic opacity, which hinders comprehension and trust in the predictions. The current study focuses on the explainability of the ML-based profiling model deployed at the Flemish PES (VDAB), aiming to predict clients’ likelihood of securing sustainable employment. We compare two explainability techniques: (1) TreeSHAP is a state-of the art method grounded in the theoretical properties of the Shapley values, and (2) TreeInterpreter is a computationally feasible approximation that foregoes some of these properties. Leveraging multiple evaluation metrics, our findings suggest that for tree-based models, approximations to the SHAP (SHapley Additive exPlanations) values yield very similar insights and maintain explanatory performance while minimizing computational overhead. This enables institutions with large client bases to generate real-time explanations without being compelled to deteriorate the model’s accuracy. Additionally, our analysis identifies key predictors of job seekers’ employer prospects, offering valuable insights for PES and related agencies striving to improve their support for jobseekers in need. Clients’ online behavior, acting as a proxy for hard-to-measure job search intensity and motivation, emerges as a key component in the profiling model, presenting promising opportunities for future profiling efforts.
2662-995X
Dossche, Wouter
d31e5e3a-5a25-4cd0-b665-214366bb5e77
Vansteenkiste, Sarah
b01dc6fb-109d-44e3-b28d-b7a753afce8f
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Lemahieu, Wilfried
be4bae3f-12b9-417a-91a1-c3c264ffe068
Dossche, Wouter
d31e5e3a-5a25-4cd0-b665-214366bb5e77
Vansteenkiste, Sarah
b01dc6fb-109d-44e3-b28d-b7a753afce8f
Baesens, Bart
f7c6496b-aa7f-4026-8616-ca61d9e216f0
Lemahieu, Wilfried
be4bae3f-12b9-417a-91a1-c3c264ffe068

Dossche, Wouter, Vansteenkiste, Sarah, Baesens, Bart and Lemahieu, Wilfried (2024) Interpretable and accurate identification of job seekers at risk of long-term unemployment: explainable ML-based profiling. SN Computer Science. (In Press)

Record type: Article

Abstract

To tackle the societal and person-specific adverse consequences of long-term
unemployment, many public employment services (PES) have implemented datadriven profiling systems to promptly identify vulnerable jobseekers. More recently, PES increasingly rely on more complex machine learning (ML) models due to their enhanced accuracy. However, increasing concerns are raised regarding the algorithmic opacity, which hinders comprehension and trust in the predictions. The current study focuses on the explainability of the ML-based profiling model deployed at the Flemish PES (VDAB), aiming to predict clients’ likelihood of securing sustainable employment. We compare two explainability techniques: (1) TreeSHAP is a state-of the art method grounded in the theoretical properties of the Shapley values, and (2) TreeInterpreter is a computationally feasible approximation that foregoes some of these properties. Leveraging multiple evaluation metrics, our findings suggest that for tree-based models, approximations to the SHAP (SHapley Additive exPlanations) values yield very similar insights and maintain explanatory performance while minimizing computational overhead. This enables institutions with large client bases to generate real-time explanations without being compelled to deteriorate the model’s accuracy. Additionally, our analysis identifies key predictors of job seekers’ employer prospects, offering valuable insights for PES and related agencies striving to improve their support for jobseekers in need. Clients’ online behavior, acting as a proxy for hard-to-measure job search intensity and motivation, emerges as a key component in the profiling model, presenting promising opportunities for future profiling efforts.

Text
Submission SN Computer Science (SNCS-D-23-04687_R1) - Accepted Manuscript
Restricted to Repository staff only until 9 April 2025.
Available under License Other.
Request a copy

More information

Accepted/In Press date: 9 April 2024

Identifiers

Local EPrints ID: 489282
URI: http://eprints.soton.ac.uk/id/eprint/489282
ISSN: 2662-995X
PURE UUID: e95c3404-1ffd-4237-bdfc-9d6c40c3a085
ORCID for Bart Baesens: ORCID iD orcid.org/0000-0002-5831-5668

Catalogue record

Date deposited: 19 Apr 2024 16:33
Last modified: 20 Apr 2024 01:42

Export record

Contributors

Author: Wouter Dossche
Author: Sarah Vansteenkiste
Author: Bart Baesens ORCID iD
Author: Wilfried Lemahieu

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×