Playing repeated network interdiction games with semi-bandit feedback

Guo, Qingyu, An, Bo and Tran-Thanh, Long (2017) Playing repeated network interdiction games with semi-bandit feedback. In Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17). 9 pp . (In Press)

Record type: Conference or Workshop Item (Paper)

Abstract

We study repeated network interdiction games with no prior knowledge of the adversary and the environment, which can model many real world network security domains. Existing works often require plenty of available information for the defender and neglect the frequent interactions between both players, which are unrealistic and impractical, and thus, are not suitable for our settings. As such, we provide the first defender strategy, that enjoys nice theoretical and practical performance guarantees, by applying the adversarial online learning approach. In particular, we model the repeated network interdiction game with no prior knowledge as an online linear optimization problem, for which a novel and efficient online learning algorithm, SBGA, is proposed, which exploits the unique semi-bandit feedback in network security domains. We prove that SBGA achieves sublinear regret against adaptive adversary, compared with both the best fixed strategy in hindsight and a near optimal adaptive strategy. Extensive experiments also show that SBGA significantly outperforms existing approaches with fast convergence rate.

Text

online learning in network flow interdiction game - Accepted Manuscript

Available under License Creative Commons GNU GPL (Software).

Download (3MB)