Bilevel hyperparameter optimization for nonlinear support vector machines
Bilevel hyperparameter optimization for nonlinear support vector machines
While the problem of tuning the hyperparameters of a support vector machine (SVM) via cross-validation is easily understood as a bilevel optimization problem, so far, the corresponding literature has mainly focused on the linear-kernel case. In this paper, we establish a theoretical framework for the development of bilevel optimization-based methods for tuning the hyperparameters of an SVM in the case where a nonlinear kernel is adopted, which affords the ability to capture highly-complex relationships between the points in the dataset. By leveraging a Karush-Kuhn-Tucker (KKT)/mathematical program with equilibrium constraints (MPEC) reformulation of the (lower-level) training problem, we develop atheoretical framework for the SVM hyperparameter-tuning problem that established under which assumptions and conditions suitable qualification conditions including the Mangasarian–Fromovitz, the linear-independence, and the strong second order sufficient conditions are satisfied. We then illustrate the need for this theoretical framework in the context of the well-known Scholtes relaxation algorithm for solving the MPEC reformulation of our bilevel hyperparameter problem for SVMs. Numerical experiments are conducted to demonstrate the potential of this algorithm for examples of nonlinear SVM problems.
Zemkoho, Alain
30c79e30-9879-48bd-8d0b-e2fbbc01269e
29 August 2023
Zemkoho, Alain
30c79e30-9879-48bd-8d0b-e2fbbc01269e
[Unknown type: UNSPECIFIED]
Abstract
While the problem of tuning the hyperparameters of a support vector machine (SVM) via cross-validation is easily understood as a bilevel optimization problem, so far, the corresponding literature has mainly focused on the linear-kernel case. In this paper, we establish a theoretical framework for the development of bilevel optimization-based methods for tuning the hyperparameters of an SVM in the case where a nonlinear kernel is adopted, which affords the ability to capture highly-complex relationships between the points in the dataset. By leveraging a Karush-Kuhn-Tucker (KKT)/mathematical program with equilibrium constraints (MPEC) reformulation of the (lower-level) training problem, we develop atheoretical framework for the SVM hyperparameter-tuning problem that established under which assumptions and conditions suitable qualification conditions including the Mangasarian–Fromovitz, the linear-independence, and the strong second order sufficient conditions are satisfied. We then illustrate the need for this theoretical framework in the context of the well-known Scholtes relaxation algorithm for solving the MPEC reformulation of our bilevel hyperparameter problem for SVMs. Numerical experiments are conducted to demonstrate the potential of this algorithm for examples of nonlinear SVM problems.
Text
BHO-PaperV2
- Author's Original
Restricted to Repository staff only
Request a copy
More information
Published date: 29 August 2023
Identifiers
Local EPrints ID: 509412
URI: http://eprints.soton.ac.uk/id/eprint/509412
PURE UUID: fd66ed46-175b-42e0-be2c-1bb016957cb8
Catalogue record
Date deposited: 20 Feb 2026 17:46
Last modified: 21 Feb 2026 02:51
Export record
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics