Bayesian learning for multi-agent coordination

Multi-agent systems draw together a number of significant trends in modern technology: ubiquity, decentralisation, openness, dynamism and uncertainty. As work in these fields develops, such systems face increasing challenges. Two particular challenges are decision making in uncertain and partially-observable environments, and coordination with other agents in such environments. Although uncertainty and coordination have been tackled as separate problems, formal models for an integrated approach are typically restricted to simple classes of problem and are not scalable to problems with tens of agents and millions of states.
We improve on these approaches by extending a principled Bayesian model into more challenging domains, using Bayesian networks to visualise specific cases of the model and thus as an aid in deriving the update equations for the system. One approach which has been shown to scale well for networked offline problems uses finite state machines to model other agents. We used this insight to develop an approximate scalable algorithm applicable to our general model, in combination with adapting a number of existing approximation techniques, including state clustering.
We examine the performance of this approximate algorithm on several cases of an urban rescue problem with respect to differing problem parameters. Specifically, we consider first scenarios where agents are aware of the complete situation, but are not certain about the behaviour of others; that is, our model with all elements but the actions observable. Secondly, we examine the more complex case where agents can see the actions of others, but cannot see the full state and thus are not sure about the beliefs of others. Finally, we look at the performance of the partially observable state model when the system is dynamic or open. We find that our best response algorithm consistently outperforms a handwritten strategy for the problem, more noticeably as the number of agents and the number of states involved in the problem increase.

Allen-Williams, Mair

42b38da6-4f58-48f2-8951-7d2a30664517

March 2009

Allen-Williams, Mair

42b38da6-4f58-48f2-8951-7d2a30664517

Jennings, Nick

ab3d94cc-247c-4545-9d1e-65873d6cdb30

Allen-Williams, Mair (2009) Bayesian learning for multi-agent coordination. University of Southampton, School of Electronics and Computer Science, Doctoral Thesis, 223pp.

Record type: Thesis (Doctoral)

Abstract

Text

thesis-for-binding.pdf - Other

Download (1MB)

More information

Published date: March 2009

Organisations: University of Southampton

Identifiers

Local EPrints ID: 65956

URI: http://eprints.soton.ac.uk/id/eprint/65956

PURE UUID: 38448470-3af4-4d7f-bdbe-ee778ea3ca7b

Catalogue record

Date deposited: 08 Apr 2009

Last modified: 13 Mar 2024 18:02

Export record

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: Mair Allen-Williams

Thesis advisor: Nick Jennings

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information