The University of Southampton
University of Southampton Institutional Repository

A Unifying Framework for Iterative Approximate Best–Response Algorithms for Distributed Constraint Optimisation Problems

A Unifying Framework for Iterative Approximate Best–Response Algorithms for Distributed Constraint Optimisation Problems
A Unifying Framework for Iterative Approximate Best–Response Algorithms for Distributed Constraint Optimisation Problems
Distributed constraint optimisation problems (DCOPs) are important in many areas of computer science and optimisation. In a DCOP, each variable is controlled by one of many autonomous agents, who together have the joint goal of maximising a global objective function. A wide variety of techniques have been explored to solve such problems, and here we focus on one of the main families, namely iterative approximate best–response algorithms used as local search algorithms for DCOPs. We define these algorithms as those in which, at each iteration, agents communicate only the states of the variables under their control to their neighbours on the constraint graph, and that reason about their next state based on the messages received from their neighbours. These algorithms include the distributed stochastic algorithm and stochastic coordination algorithms, the maximum–gain messaging algorithms, the families of fictitious play and adaptive play algorithms, and algorithms that use regret–based heuristics. This family of algorithms is commonly employed in real world systems, as they can be used in domains where communication is difficult or costly, where it is appropriate to trade timeliness off against optimality, or where hardware limitations render complete or more computationally intensive algorithms unusable. However, until now, no overarching framework has existed for analysing this broad family of algorithms, resulting in similar and overlapping work being published independently in several different literatures. The main contribution of this paper, then, is the development of a unified analytical framework for studying such algorithms. This framework is built on our insight that when formulated as noncooperative games, DCOPs form a subset of the class of potential games. This result allows us to prove convergence properties of iterative approximate best–response algorithms developed in the computer science literature using game theoretic methods (which also shows that such algorithms can also be applied to the more general problem of finding Nash equilibria in potential games), and, conversely, also allows us to show that many game–theoretic algorithms can be used to solve DCOPs. By so doing, our framework can assist system designers by making the pros and cons of, and the synergies between, the various iterative approximate best–response DCOP algorithm components clear.
411-444
Chapman, Archie
2eac6920-2aff-49ab-8d8e-a0ea3e39ba60
Rogers, Alex
f9130bc6-da32-474e-9fab-6c6cb8077fdc
Jennings, Nicholas R.
ab3d94cc-247c-4545-9d1e-65873d6cdb30
Leslie, David
e4fa8a59-bf57-41c8-ba80-3cffe7e5076e
Chapman, Archie
2eac6920-2aff-49ab-8d8e-a0ea3e39ba60
Rogers, Alex
f9130bc6-da32-474e-9fab-6c6cb8077fdc
Jennings, Nicholas R.
ab3d94cc-247c-4545-9d1e-65873d6cdb30
Leslie, David
e4fa8a59-bf57-41c8-ba80-3cffe7e5076e

Chapman, Archie, Rogers, Alex, Jennings, Nicholas R. and Leslie, David (2011) A Unifying Framework for Iterative Approximate Best–Response Algorithms for Distributed Constraint Optimisation Problems. The Knowledge Engineering Review, 26 (4), 411-444.

Record type: Article

Abstract

Distributed constraint optimisation problems (DCOPs) are important in many areas of computer science and optimisation. In a DCOP, each variable is controlled by one of many autonomous agents, who together have the joint goal of maximising a global objective function. A wide variety of techniques have been explored to solve such problems, and here we focus on one of the main families, namely iterative approximate best–response algorithms used as local search algorithms for DCOPs. We define these algorithms as those in which, at each iteration, agents communicate only the states of the variables under their control to their neighbours on the constraint graph, and that reason about their next state based on the messages received from their neighbours. These algorithms include the distributed stochastic algorithm and stochastic coordination algorithms, the maximum–gain messaging algorithms, the families of fictitious play and adaptive play algorithms, and algorithms that use regret–based heuristics. This family of algorithms is commonly employed in real world systems, as they can be used in domains where communication is difficult or costly, where it is appropriate to trade timeliness off against optimality, or where hardware limitations render complete or more computationally intensive algorithms unusable. However, until now, no overarching framework has existed for analysing this broad family of algorithms, resulting in similar and overlapping work being published independently in several different literatures. The main contribution of this paper, then, is the development of a unified analytical framework for studying such algorithms. This framework is built on our insight that when formulated as noncooperative games, DCOPs form a subset of the class of potential games. This result allows us to prove convergence properties of iterative approximate best–response algorithms developed in the computer science literature using game theoretic methods (which also shows that such algorithms can also be applied to the more general problem of finding Nash equilibria in potential games), and, conversely, also allows us to show that many game–theoretic algorithms can be used to solve DCOPs. By so doing, our framework can assist system designers by making the pros and cons of, and the synergies between, the various iterative approximate best–response DCOP algorithm components clear.

Text
ChapmanEtalKER09_preprint.pdf - Accepted Manuscript
Download (302kB)
Text
download.php.pdf - Version of Record
Download (352kB)

More information

Published date: 2011
Organisations: Agents, Interactions & Complexity

Identifiers

Local EPrints ID: 268172
URI: http://eprints.soton.ac.uk/id/eprint/268172
PURE UUID: 38ddb1ba-ec54-41ba-9925-476f3c1ec2be

Catalogue record

Date deposited: 03 Nov 2009 11:39
Last modified: 14 Mar 2024 09:04

Export record

Contributors

Author: Archie Chapman
Author: Alex Rogers
Author: Nicholas R. Jennings
Author: David Leslie

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×