|
SESSION: Invited talks |
|
|
|
Using agents and autonomic computing to build next generation seamless mobility services |
|
John Strassner
|
|
Article No.: 1 |
|
doi>10.1145/1329125.1329127 |
|
Full text: PDF
|
|
Next generation networks seek to integrate wide-area and local-area wireless and wired systems in order to provide seamless services to the end user, providing freedom of movement between metropolitan/enterprise and indoor/outdoor coverage while maintaining ...
Next generation networks seek to integrate wide-area and local-area wireless and wired systems in order to provide seamless services to the end user, providing freedom of movement between metropolitan/enterprise and indoor/outdoor coverage while maintaining continuity of applications experience. Seamless Mobility, and its vision of seamless service delivery, requires significant changes to existing wired and wireless network management systems. Seamless Mobility is an experiential architecture, predicated on providing mechanisms that enable a user to accomplish his or her tasks without regard to technology. This talk will describe a novel autonomic architecture that uses a variety of different types of agents to synthesize knowledge about the environment, the context of users and their service and resource needs, and the capabilities and constraints placed on the network at any given time. Agents act according to a unique context-aware policy governance approach, and work autonomously to orchestrate system behavior according to business goals. This talk will conclude with a summary of research directions at Motorola Labs and future work to be undertaken. expand
|
|
Multiagent systems for autonomic computing |
|
Jeffrey A. Kephart
|
|
Article No.: 2 |
|
doi>10.1145/1329125.1329128 |
|
Full text: PDF
|
|
Autonomic computing is a perfect application domain for autonomous agents and multiagent systems. Expanding on this theme, I will illustrate it with various prototypes my team has developed. Rather than building on existing agent platforms, we have started ...
Autonomic computing is a perfect application domain for autonomous agents and multiagent systems. Expanding on this theme, I will illustrate it with various prototypes my team has developed. Rather than building on existing agent platforms, we have started from existing products and tried to extend them to have some agent-like properties. Therefore, the agents are somewhat coarse and primitive, and "multi" means 2 or 3 in our prototypes, but these agents have the advantage of being very real and practical. And, even with just two agents, we have run into some nasty issues where the two try to adapt simultaneously and work at cross purposes unless we do something explicit to prevent it. We also have some novel machine learning approaches and applications. I will conclude by describing several research challenges for the agent research community. For close to 15 years, researchers in agents have provided a growing body of evidence that taskwork and teamwork are separable, and have repeatedly demonstrated the benefits of such separation. I will survey my group's role in investigating this hypothesis with physical robots, the lessons learned, and the challenges and opportunities provided by viewing robots as agents, too. expand
|
|
Automated negotiation in open environments: (by ACM/SIGART autonomous agents research award winner) |
|
Sarit Kraus
|
|
Article No.: 3 |
|
doi>10.1145/1329125.1329129 |
|
Full text: PDF
|
|
Agents in open environments lack a central mechanism for controlling their behavior, and they may encounter human decision-makers whose behavior is affected by social and psychological factors. Examples include online markets, patient care-delivery systems; ...
Agents in open environments lack a central mechanism for controlling their behavior, and they may encounter human decision-makers whose behavior is affected by social and psychological factors. Examples include online markets, patient care-delivery systems; virtual reality and simulation systems used for training and IT systems administration. As open environments increase in number and complexity, they pose significant challenges for agent designers. In this talk, I will discuss several research thrusts that address some of the challenges of automated agent design and evaluation in open environments. These agents are self-interested in the sense that they aim to fulfill their own goals as efficiently as possible. However, they may still cooperate if such behavior can serve either their short-or long-term interests or goals. I will present algorithms for representing and learning agents' goals and capabilities that have been evaluated under diverse settings varying in the extent to which agents are cooperative or competitive. Lastly, I will describe Color Trails, a game infrastructure for investigating negotiation strategies in open environments. Color Trails provides a realistic but modeling-tractable setting that facilitates the design and evaluation of automated decision making by agents. expand
|
|
Robots are agents, too! |
|
Gal Kaminka
|
|
Article No.: 4 |
|
doi>10.1145/1329125.1329130 |
|
Full text: PDF
|
|
Recent years are seeing dramatically growing interest in robots, side-by-side, paradoxically, with a worrisome decline in robotics work within the autonomous agent community. This, despite the significant opportunities enabled by viewing robots as agents. ...
Recent years are seeing dramatically growing interest in robots, side-by-side, paradoxically, with a worrisome decline in robotics work within the autonomous agent community. This, despite the significant opportunities enabled by viewing robots as agents. In this talk, I will argue for such an inclusive view, by examining multi-robot teams. From an agent perspective, challenges in building such teams are many: Some are related to a particular task (taskwork), and some are related to the interactions between agents (teamwork). For close to 15 years, researchers in agents have provided a growing body of evidence that taskwork and teamwork are separable, and have repeatedly demonstrated the benefits of such separation. I will survey my group's role in investigating this hypothesis with physical robots, the lessons learned, and the challenges and opportunities provided by viewing robots as agents, too. expand
|
|
SESSION: Embodied agents and architectures: full papers |
|
|
|
What can i do with this?: finding possible interactions between characters and objects |
|
Pedro Sequeira,
Marco Vala,
Ana Paiva
|
|
Article No.: 5 |
|
doi>10.1145/1329125.1329132 |
|
Full text: PDF
|
|
Virtual environments are often populated by autonomous synthetic agents capable of acting and interacting with other agents as well as with humans. These virtual worlds also include objects that may have different uses and types of interactions. As such, ...
Virtual environments are often populated by autonomous synthetic agents capable of acting and interacting with other agents as well as with humans. These virtual worlds also include objects that may have different uses and types of interactions. As such, these agents need to identify possible interactions with the objects in the environment and measure the consequences of these interactions. This is particularly difficult when the agents never interacted with some of the objects beforehand. This paper describes SOTAI - Smart ObjecT-Agent Interaction, a framework that will help agents to identify possible interactions with unknown objects based on their past experiences. In SOTAI, agents can learn world regularities, like object attributes and frequent relations between attributes. They gather qualitative symbolic descriptions from their sensorial data when interacting with objects and perform inductive reasoning to acquire concepts about them. We implemented an initial case study and the results show that our agents are able to acquire valid conceptual knowledge. expand
|
|
Aborting tasks in BDI agents |
|
John Thangarajah,
James Harland,
David Morley,
Neil Yorke-Smith
|
|
Article No.: 6 |
|
doi>10.1145/1329125.1329133 |
|
Full text: PDF
|
|
Intelligent agents that are intended to work in dynamic environments must be able to gracefully handle unsuccessful tasks and plans. In addition, such agents should be able to make rational decisions about an appropriate course of action, which may include ...
Intelligent agents that are intended to work in dynamic environments must be able to gracefully handle unsuccessful tasks and plans. In addition, such agents should be able to make rational decisions about an appropriate course of action, which may include aborting a task or plan, either as a result of the agent's own deliberations, or potentially at the request of another agent. In this paper we investigate the incorporation of aborts into a BDI-style architecture. We discuss some conditions under which aborting a task or plan is appropriate, and how to determine the consequences of such a decision. We augment each plan with an optional abort-method, analogous to the failure method found in some agent programming languages. We provide an operational semantics for the execution cycle in the presence of aborts in the abstract agent language CAN, which enables us to specify a BDI-based execution model without limiting our attention to a particular agent system (such as JACK, Jadex, Jason, or SPARK). A key technical challenge we address is the presence of parallel execution threads and of sub-tasks, which require the agent to ensure that the abort methods for each plan are carried out in an appropriate sequence. expand
|
|
Goals in the context of BDI plan failure and planning |
|
Sebastian Sardina,
Lin Padgham
|
|
Article No.: 7 |
|
doi>10.1145/1329125.1329134 |
|
Full text: PDF
|
|
We develop a Belief-Desire-Intention (BDI) style agent-oriented programming language with special emphasis on the semantics of goals in the presence of the typical BDI failure handling present in many BDI systems and a novel account of hierarchical ...
We develop a Belief-Desire-Intention (BDI) style agent-oriented programming language with special emphasis on the semantics of goals in the presence of the typical BDI failure handling present in many BDI systems and a novel account of hierarchical lookahead planning. The work builds incrementally on two existing languages and accommodates three type of goals: classical BDI-style event goals, declarative goals, and planning goals. We mainly focus on the dynamics of these type of goals and, in particular, on a kind of commitment scheme that brings the new language closer to the solid existing work in agent theory. To that end, we develop a semantics that recognises the usual hierarchical structure of active goals as well as their declarative aspects. In contrast with previous languages, the new language prevents an agent from blindly persisting with a (blocked) subsidiary goal when an alternative strategy for achieving a higher-level motivating goal exists. In addition, the new semantics ensures watchfulness by the agent to ensure that goals that succeed or are deemed impossible are immediately dropped, thus conforming to the requirements of basic rational commitment strategy. Finally, a mechanism for the proactive adoption of new goals, other than the mere reaction to events, and a formal account of interaction with the external environment are provided. We believe that the new language is an important step towards turning practical BDI programming languages more compatible with the established results in the area of agent theory. expand
|
|
Evaluating a conversation-centered interactive drama |
|
Manish Mehta,
Steven Dow,
Michael Mateas,
Blair MacIntyre
|
|
Article No.: 8 |
|
doi>10.1145/1329125.1329135 |
|
Full text: PDF
|
|
There is a growing interest in developing technologies for creating interactive dramas [13, 22]. Evaluating them, however, remains an open research problem. In this paper, we present a method for evaluating the technical and design approaches employed ...
There is a growing interest in developing technologies for creating interactive dramas [13, 22]. Evaluating them, however, remains an open research problem. In this paper, we present a method for evaluating the technical and design approaches employed in a conversation-centered interactive drama. This method correlates players' subjective experience during conversational breakdowns, captured using retrospective protocols, with the corresponding AI processing in the input language understanding and dialog management subsystems. The methodology is employed to analyze conversation breakdowns in the interactive drama Façade. We find that the narrative cues offered by an interactive drama, coupled with believable character performance, can allow players to interpretively bridge system limitations and avoid experiencing a conversation breakdown. Further, we find that, contrary to standard practice for task-oriented conversation systems, using shallowly understood information as part of the system output hampers the player experience in an interactive drama. expand
|
|
SESSION: Embodied agents and architectures: poster papers |
|
|
|
Automatic annotation of team actions in observations of embodied agents |
|
Linus J. Luotsinen,
Hans Fernlund,
Ladislau Bölöni
|
|
Article No.: 9 |
|
doi>10.1145/1329125.1329137 |
|
Full text: PDF
|
|
Recognizing and annotating the occurrence of team actions in observations of embodied agents has applications in surveillance and in training of military or sport teams. We describe the team actions through a spatio-temporal correlated pattern of movement, ...
Recognizing and annotating the occurrence of team actions in observations of embodied agents has applications in surveillance and in training of military or sport teams. We describe the team actions through a spatio-temporal correlated pattern of movement, which can be modeled by a Hidden Markov Model. The hand-crafting of these models is a difficult task of knowledge engineering, even in application domains where explicit, natural language descriptions of the team actions are available. The main contribution of this paper is an approach through which the library of HMM representations can be acquired from a small number of hand annotated, representative samples of the specific movement patterns. A series of experiments, performed on a dataset describing a real-world terrestrial warfare exercise validates our method and shows good recognition accuracy even in the presence of noisy data. The speed of the recognition engine is sufficiently fast to allow real time annotation of incoming observations. expand
|
|
Agents that remember can tell stories: integrating autobiographic memory into emotional agents |
|
Wan Ching Ho,
João Dias,
Rui Figueiredo,
Ana Paiva
|
|
Article No.: 10 |
|
doi>10.1145/1329125.1329138 |
|
Full text: PDF
|
|
For the past few years many new applications are being developed featuring interactive environments populated with autonomous virtual agents capable of acting according to their goals, beliefs and even social relations. Such agents must be able ...
For the past few years many new applications are being developed featuring interactive environments populated with autonomous virtual agents capable of acting according to their goals, beliefs and even social relations. Such agents must be able to interact with each other, and more importantly with the user, thus involving the users in an engaging narrative experience. To achieve these, in this paper we describe the essential event structure in an autobiographic memory, event reconstructions in memory retrieval process and the influences of such past events in interpersonal relations. expand
|
|
EBDI: an architecture for emotional agents |
|
Hong Jiang,
Jose M. Vidal,
Michael N. Huhns
|
|
Article No.: 11 |
|
doi>10.1145/1329125.1329139 |
|
Full text: PDF
|
|
Most of the research on multiagent systems has focused on the development of rational utility-maximizing agents. However, research shows that emotions have a strong effect on peoples' physical states, motivations, beliefs, and desires. By introducing ...
Most of the research on multiagent systems has focused on the development of rational utility-maximizing agents. However, research shows that emotions have a strong effect on peoples' physical states, motivations, beliefs, and desires. By introducing primary and secondary emotion into BDI architecture, we present a generic architecture for an emotional agent, EBDI, which can merge various emotion theories with an agent's reasoning process. It implements practical reasoning techniques separately from the specific emotion mechanism. The separation allows us to plug in emotional models as needed or upgrade the agent's reasoning engine independently. expand
|
|
Extending character-based storytelling with awareness and feelings |
|
David Pizzi,
Marc Cavazza,
Jean-Luc Lugrin
|
|
Article No.: 12 |
|
doi>10.1145/1329125.1329140 |
|
Full text: PDF
|
|
Most Interactive Storytelling systems developed to date have followed a task-based approach to story representation, using planning techniques to drive the story by generating a sequence of actions, which essentially "solve" the task to which the story ...
Most Interactive Storytelling systems developed to date have followed a task-based approach to story representation, using planning techniques to drive the story by generating a sequence of actions, which essentially "solve" the task to which the story is equated. One major limitation of this approach has been that it fails to incorporate characters' psychology, and as a consequence important aesthetic aspects of the narrative cannot be easily captured by Interactive Storytelling. In this paper, we introduce a new approach to Interactive Storytelling, which aims at reconciling narrative actions with the characters' attributed psychology as stated in the narrative. Our long-term goal is to be able to explore Interactive Storytelling for those narrative genres which are based on the characters' psychology rather than solely on their actions. We used as a starting point the formalisation by Flaubert himself of his novel Madame Bovary, which includes a detailed account of characters' desires and feelings. We describe a prototype in which characters' behaviour is driven by a real-time search-based planning system applying operators whose content is based on a specific inventory of feelings. Furthermore, the actual pattern of evolution of the character's plan, as measured through the variation of the search heuristic, is used to confer a sense of awareness to the characters, which can be used to generate feelings about its overall situation, from feelings of boredom to hope. expand
|
|
Approximate state estimation in multiagent settings with continuous or large discrete state spaces |
|
Prashant Doshi
|
|
Article No.: 13 |
|
doi>10.1145/1329125.1329141 |
|
Full text: PDF
|
|
We present a new method for carrying out state estimation in multi-agent settings that are characterized by continuous or large discrete state spaces. State estimation in multiagent settings involves updating an agent's belief over the physical states ...
We present a new method for carrying out state estimation in multi-agent settings that are characterized by continuous or large discrete state spaces. State estimation in multiagent settings involves updating an agent's belief over the physical states and the space of other agents' models. We factor out the models of the other agents and update the agent's belief over these models, as exactly as possible. Simultaneously, we sample particles from the distribution over the large physical state space and project the particles in time. expand
|
|
Dynamic movement and positioning of embodied agents in multiparty conversations |
|
Dusan Jan,
David R. Traum
|
|
Article No.: 14 |
|
doi>10.1145/1329125.1329142 |
|
Full text: PDF
|
|
For embodied agents to engage in realistic multiparty conversation, they must stand in appropriate places with respect to other agents and the environment. When these factors change, such as an agent joining the conversation, the agents must dynamically ...
For embodied agents to engage in realistic multiparty conversation, they must stand in appropriate places with respect to other agents and the environment. When these factors change, such as an agent joining the conversation, the agents must dynamically move to a new location and/or orientation to accommodate. This paper presents an algorithm for simulating movement of agents based on observed human behavior using techniques developed for pedestrian movement in crowd simulations. We extend a previous group conversation simulation to include an agent motion algorithm. We examine several test cases and show how the simulation generates results that mirror real-life conversation settings. expand
|
|
SESSION: Partially cooperative multiagent systems: full papers |
|
|
|
Meta-level coordination for solving negotiation chains in semi-cooperative multi-agent systems |
|
Xiaoqin Zhang,
Victor Lesser
|
|
Article No.: 15 |
|
doi>10.1145/1329125.1329144 |
|
Full text: PDF
|
|
A negotiation chain is formed when multiple related negotiations are spread over multiple agents. In order to appropriately order and structure the negotiations occurring in the chain so as to optimize the expected utility, we present an extension to ...
A negotiation chain is formed when multiple related negotiations are spread over multiple agents. In order to appropriately order and structure the negotiations occurring in the chain so as to optimize the expected utility, we present an extension to a single-agent concurrent negotiation framework. This work is aimed at semi-cooperative multi-agent systems, where each agent has its own goals and works to maximize its local utility; however, the performance of each individual agent is tightly related to other agent's cooperation and the system's overall performance. We introduce a pre-negotiation phase that allows agents to transfer meta-level information. Using this information, the agent can build a more accurate model of the negotiation in terms of modeling the relationship of flexibility and success probability. This more accurate model helps the agent in choosing a better negotiation solution in the global negotiation chain context. The agent can also use this information to allocate appropriate time for each negotiation, hence to find a good ordering of all related negotiations. The experimental data shows that these mechanisms improve the agents' and the system's overall performance significantly. expand
|
|
Policy recognition for multi-player tactical scenarios |
|
Gita Sukthankar,
Katia Sycara
|
|
Article No.: 16 |
|
doi>10.1145/1329125.1329145 |
|
Full text: PDF
|
|
This paper addresses the problem of recognizing policies given logs of battle scenarios from multi-player games. The ability to identify individual and team policies from observations is important for a wide range of applications including automated ...
This paper addresses the problem of recognizing policies given logs of battle scenarios from multi-player games. The ability to identify individual and team policies from observations is important for a wide range of applications including automated commentary generation, game coaching, and opponent modeling. We define a policy as a preference model over possible actions based on the game state, and a team policy as a collection of individual policies along with an assignment of players to policies. This paper explores two promising approaches for policy recognition: (1) a model-based system for combining evidence from observed events using Dempster-Shafer theory, and (2) a data-driven discriminative classifier using support vector machines (SVMs). We evaluate our techniques on logs of real and simulated games played using Open Gaming Foundation d20, the rule system used by many popular tabletop games, including Dungeons and Dragons. expand
|
|
Emergence of global network property based on multi-agent voting model |
|
Kosuke Shinoda,
Yutaka Matsuo,
Hideyuki Nakashima
|
|
Article No.: 17 |
|
doi>10.1145/1329125.1329146 |
|
Full text: PDF
|
|
Recent studies have shown that various models can explain the emergence of complex networks, such as scale-free and small-world networks. This paper presents a different model to generate complex networks using a multi-agent approach. Each node is considered ...
Recent studies have shown that various models can explain the emergence of complex networks, such as scale-free and small-world networks. This paper presents a different model to generate complex networks using a multi-agent approach. Each node is considered as an agent. Based on voting by all agents, edges are added repeatedly. We use four different kinds of centrality measures as a utility functions for agents. Depending on the centrality measure, the resultant networks differ considerably: typically, closeness centrality generates a scale-free network, degree centrality produces a random graph, betweenness centrality favors a regular graph, and eigenvector centrality brings a complete subgraph. The importance of the network structure among agents is widely noted in the multi-agent research literature. This paper contributes new insights into the connection between agents' local behavior and the global property of the network structure. We describe a detailed analysis on why these structures emerge, and present a discussion of the possible expansion and application of the model. expand
|
|
Towards self-organising agent-based resource allocation in a multi-server environment |
|
Tino Schlegel,
Ryszard Kowalczyk
|
|
Article No.: 18 |
|
doi>10.1145/1329125.1329147 |
|
Full text: PDF
|
|
Distributed applications require distributed techniques for efficient resource allocation. These techniques need to take into account the heterogeneity and potential unreliability of resources and resource consumers in a distributed environments. In ...
Distributed applications require distributed techniques for efficient resource allocation. These techniques need to take into account the heterogeneity and potential unreliability of resources and resource consumers in a distributed environments. In this paper we propose a distributed algorithm that solves the resource allocation problem in distributed multi-agent systems. Our solution is based on the self-organisation of agents, which does not require any facilitator or management layer. The resource allocation in the system is a purely emergent effect. We present results of the proposed resource allocation mechanism in the simulated static and dynamic multi-server environment. expand
|
|
SESSION: Partially cooperative multiagent systems: poster papers |
|
|
|
Dynamics of contracts-based organizations: a formal approach based on institutions |
|
Yathiraj B. Udupi,
Munindar P. Singh
|
|
Article No.: 19 |
|
doi>10.1145/1329125.1329149 |
|
Full text: PDF
|
|
An organization comprises a group of collaborating agents (individual agents or nested organizations) that exhibit complex behaviors. Of particular interest are dynamic organizations that form and dissolve as their members' needs change. Such organizations ...
An organization comprises a group of collaborating agents (individual agents or nested organizations) that exhibit complex behaviors. Of particular interest are dynamic organizations that form and dissolve as their members' needs change. Such organizations are important in many applications, including scientific and business computing. Contracts among autonomous agents have long been used to facilitate their collaboration. This paper provides a contracts-based approach for managing organizations. The proposed approach places organizations within institutions, themselves modeled as specialized organizations. Commitments form the basis of contracts and this paper establishes some important dynamic aspects by providing a commitment life cycle analysis. This approach has been applied in a prototype tool to manage organizations. expand
|
|
Arguing for gaining access to information |
|
Sylvie Doutre,
Peter McBurney,
Laurent Perrussel,
Jean-Marc Thevenin
|
|
Article No.: 20 |
|
doi>10.1145/1329125.1329150 |
|
Full text: PDF
|
|
This paper presents a protocol for agents engaged in argumentation over access to information sources. Obtaining relevant information is essential for agents engaged in autonomous, goal-directed behavior, but access to such information is usually controlled ...
This paper presents a protocol for agents engaged in argumentation over access to information sources. Obtaining relevant information is essential for agents engaged in autonomous, goal-directed behavior, but access to such information is usually controlled by other autonomous agents having their own goals. Because these various goals may be in conflict with one another, rational interactions between the two agents may take the form of a dialog, in which requests for information are successively issued, considered, justified and criticized. Even when the agents involved in such discussions agree on all the arguments for and the arguments against granting access to some information source, they may still disagree on their preferences between these arguments. To represent such situations, we design a protocol for dialogs between two autonomous agents for seeking and granting authorization to access some information source. This protocol is based on an argumentation dialog where agents handle specific preferences and acceptability over arguments. expand
|
|
A unified framework for multi-agent agreement |
|
Kiran Lakkaraju,
Les Gasser
|
|
Article No.: 21 |
|
doi>10.1145/1329125.1329151 |
|
Full text: PDF
|
|
Multi-Agent Agreement Problems (MAP) - the ability of a population of agents to search out and converge on a common state - are central issues in many multi-agent settings, from distributed sensor networks, to meeting scheduling, to development of norms, ...
Multi-Agent Agreement Problems (MAP) - the ability of a population of agents to search out and converge on a common state - are central issues in many multi-agent settings, from distributed sensor networks, to meeting scheduling, to development of norms, conventions, and language. While much work has been done on particular agreement problems no unifying framework exists for comparing MAPs that vary in, e.g., strategy space complexity, inter-agent accessibility, and solution type, and understanding their relative complexities. We present such a unification, the Distributed Optimal Agreement (DOA) framework, and show how it captures a wide variety of agreement problems. To demonstrate DOA and its power we apply it to convention evolution. 1 expand
|
|
Autonomous nondeterministic tour guides: improving quality of experience with TTD-MDPs |
|
Andrew S. Cantino,
David L. Roberts,
Charles L. Isbell
|
|
Article No.: 22 |
|
doi>10.1145/1329125.1329152 |
|
Full text: PDF
|
|
In this paper, we address the problem of building a system of autonomous agents for a complex environment, in our case, a museum with many visitors. Visitors may have varying preferences for types of art or may wish to visit different exhibits on multiple ...
In this paper, we address the problem of building a system of autonomous agents for a complex environment, in our case, a museum with many visitors. Visitors may have varying preferences for types of art or may wish to visit different exhibits on multiple visits. Often, these goals conflict. For example, many visitors may wish to see the museum's most popular work, but that could cause congestion, ruining the experience. Thus, our task is to build a set of agents that can satisfy their visitors' goals, while simultaneously providing high quality experiences for all. expand
|
|
Strategic delay in bargaining |
|
Pinata Winoto,
Gordon I. McCalla,
Julita Vassileva
|
|
Article No.: 23 |
|
doi>10.1145/1329125.1329153 |
|
Full text: PDF
|
|
We study strategic delay in bargaining under uncertainty. First, we discuss bargaining situations in which dynamics and uncertainty may induce the use of strategic delay by intelligent agents. Then, we analyze strategic delay under various circumstances. ...
We study strategic delay in bargaining under uncertainty. First, we discuss bargaining situations in which dynamics and uncertainty may induce the use of strategic delay by intelligent agents. Then, we analyze strategic delay under various circumstances. Finally, we propose an agent reasoning mechanism to decide when and how to deploy strategic delay. expand
|
|
When price is not enough: combining logical and numerical issues in bilateral negotiation |
|
Azzurra Ragone,
Tommaso Di Noia,
Eugenio Di Sciascio,
Francesco M. Donini
|
|
Article No.: 24 |
|
doi>10.1145/1329125.1329154 |
|
Full text: PDF
|
|
We present a novel approach to knowledge-based automated one-shot multi-issue bilateral negotiation handling, in a homogeneous setting, both numerical features and non-numerical ones. To this aim we introduce P(N), a propositional logic extended ...
We present a novel approach to knowledge-based automated one-shot multi-issue bilateral negotiation handling, in a homogeneous setting, both numerical features and non-numerical ones. To this aim we introduce P(N), a propositional logic extended with concrete domains, which allows to: model relations among issues (both numerical and not numerical ones) via logical entailment, differently from well-known approaches that describe issues as uncorrelated; represent buyer's request, seller's supply and their respective preferences as formulas endowed with a formal semantics. expand
|
|
SESSION: Communications and commitments: full papers |
|
|
|
Dynamic semantics for agent communication languages |
|
Michael Rovatsos
|
|
Article No.: 25 |
|
doi>10.1145/1329125.1329156 |
|
Full text: PDF
|
|
This paper proposes dynamic semantics for agent communication languages (ACLs) as a method for tackling some of the fundamental problems associated with agent communication in open multiagent systems. Based on the idea of providing alternative semantic ...
This paper proposes dynamic semantics for agent communication languages (ACLs) as a method for tackling some of the fundamental problems associated with agent communication in open multiagent systems. Based on the idea of providing alternative semantic "variants" for speech acts and transition rules between them that are contingent on previous agent behaviour, our framework provides an improved notion of grounding semantics in ongoing interaction, a simple mechanism for distinguishing between compliant and expected behaviour, and a way to specify sanction and reward mechanisms as part of the ACL itself. We extend a common framework for commitment-based ACL semantics to obtain these properties, discuss desiderata for the design of concrete dynamic semantics together with examples, and analyse their properties. expand
|
|
Commitment and extortion |
|
Paul Harrenstein,
Felix Brandt,
Felix Fischer
|
|
Article No.: 26 |
|
doi>10.1145/1329125.1329157 |
|
Full text: PDF
|
|
Making commitments, e.g., through promises and threats, enables a player to exploit the strengths of his own strategic position as well as the weaknesses of that of his opponents. Which commitments a player can make with credibility depends on ...
Making commitments, e.g., through promises and threats, enables a player to exploit the strengths of his own strategic position as well as the weaknesses of that of his opponents. Which commitments a player can make with credibility depends on the circumstances. In some, a player can only commit to the performance of an action, in others, he can commit himself conditionally on the actions of the other players. Some situations even allow for commitments on commitments or for commitments to randomized actions. We explore the formal properties of these types of (conditional) commitment and their interrelationships. So as to preclude inconsistencies among conditional commitments, we assume an order in which the players make their commitments. Central to our analyses is the notion of an extortion, which we define, for a given order of the players, as a profile that contains, for each player, an optimal commitment given the commitments of the players that committed earlier. On this basis, we investigate for different commitment types whether it is advantageous to commit earlier rather than later, and how the outcomes obtained through extortions relate to backward induction and Pareto efficiency. expand
|
|
Enacting protocols by commitment concession |
|
Pinar Yolum,
Munindar P. Singh
|
|
Article No.: 27 |
|
doi>10.1145/1329125.1329158 |
|
Full text: PDF
|
|
Commitment protocols formalize interactions among autonomous, heterogeneous agents, leaving the agents' local policies unspecified. This paper studies the problem of agents enacting commitment protocols, which inherently requires that their policies ...
Commitment protocols formalize interactions among autonomous, heterogeneous agents, leaving the agents' local policies unspecified. This paper studies the problem of agents enacting commitment protocols, which inherently requires that their policies cohere with the given protocols. Specifically, in many important settings, if agents incautiously create and discharge commitments, they can expose themselves to certain risk; conversely, if the agents are (excessively) cautious, a protocol enactment may deadlock. This paper adopts the well-known idea of monotonic concession, but specializes and enhances it with the particular features of commitments. Specifically, this paper formulates inference rules for commitment concession that respect the nature of commitments. Next, it shows how commitments can be systematically revised as the agents incrementally engage each other in enacting their protocol. This paper demonstrates how such rules can be applied in practice, and identifies conditions under which progress and termination of protocol enactment can be guaranteed. expand
|
|
Temporal linear logic as a basis for flexible agent interactions |
|
Duc Q. Pham,
James Harland
|
|
Article No.: 28 |
|
doi>10.1145/1329125.1329159 |
|
Full text: PDF
|
|
Interactions between agents in an open system such as the Internet require a significant degree of flexibility. A crucial aspect of the development of such methods is the notion of commitments, which provides a mechanism for coordinating interactive ...
Interactions between agents in an open system such as the Internet require a significant degree of flexibility. A crucial aspect of the development of such methods is the notion of commitments, which provides a mechanism for coordinating interactive behaviors among agents. In this paper, we investigate an approach to model commitments with tight integration with protocol actions. This means that there is no need to have an explicit mapping from protocols actions to operations on commitments and an external mechanism to process and enforce commitments. We show how agents can reason about commitments and protocol actions to achieve the end results of protocols using a reasoning system based on temporal linear logic, which incorporates both temporal and resource-sensitive reasoning. We also discuss the application of this framework to scenarios such as online commerce. expand
|
|
SESSION: Communications and commitments: poster papers |
|
|
|
Choice and interoperation in protocol enactment |
|
Amit K. Chopra,
Munindar P. Singh
|
|
Article No.: 29 |
|
doi>10.1145/1329125.1329161 |
|
Full text: PDF
|
|
Protocols describe interactions among agents and thus underlie the engineering of multiagent systems. However, protocols are enacted by agents in physical systems. In particular, considerations of communication models and how distributed agents are able ...
Protocols describe interactions among agents and thus underlie the engineering of multiagent systems. However, protocols are enacted by agents in physical systems. In particular, considerations of communication models and how distributed agents are able to make compatible choices would greatly affect whether a protocol may in fact be enacted successfully. The objective of this paper is to study the conceptual underpinnings of protocol enactment in multiagent systems. It seeks to characterize the operationalization of agents so as to determine whether and when agents may be interoperable. expand
|
|
Joint conversation specification and compliance |
|
Shamimabi Paurobally,
Michael J. Wooldridge
|
|
Article No.: 30 |
|
doi>10.1145/1329125.1329162 |
|
Full text: PDF
|
|
Formal specifications of protocol-oriented agent interactions have focused mainly on the semantics of the constituent agent communication language (ACL). We argue that a proper theoretical treatment of conversations cannot be simply derived compositionally ...
Formal specifications of protocol-oriented agent interactions have focused mainly on the semantics of the constituent agent communication language (ACL). We argue that a proper theoretical treatment of conversations cannot be simply derived compositionally from the semantics of individual Communicative Acts (CAs). Accordingly, we develop a theory of joint conversations that is independent of its constituent CAs. We treat the process of a group following an interaction protocol as a persistent joint communicative action (JCA) by the group. We define compliance in a joint conversation and we prove salient properties of joint conversations. expand
|
|
Designing protocols for agent institutions |
|
Huib Aldewereld,
Frank Dignum,
John-Jules Ch. Meyer
|
|
Article No.: 31 |
|
doi>10.1145/1329125.1329163 |
|
Full text: PDF
|
|
Agent-mediated institutions (or e-institutions), introduced in [9], are open agent systems that allow heterogeneous agents to enter and perform tasks. The e-institutions specify the admissible behaviour of the agents by means of norms, which are ...
Agent-mediated institutions (or e-institutions), introduced in [9], are open agent systems that allow heterogeneous agents to enter and perform tasks. The e-institutions specify the admissible behaviour of the agents by means of norms, which are declarative and abstract by nature. On the one hand this allows for a stable specification suitable for almost any conceivable situation that arises in the institution, but in the other hand the norms hardly give any indication which interaction patterns would guarantee satisfaction of the norms. expand
|
|
Deriving agent-centred representations of protocols described using propositional statecharts |
|
Hywel Dunn-Davies,
Jim Cunningham
|
|
Article No.: 32 |
|
doi>10.1145/1329125.1329164 |
|
Full text: PDF
|
|
Diagrammatic methodologies for the representation of agent interaction protocols can be classified as joint representations that describe an interaction in terms of a single sequential process, or agent-centred representations that provide a distinct ...
Diagrammatic methodologies for the representation of agent interaction protocols can be classified as joint representations that describe an interaction in terms of a single sequential process, or agent-centred representations that provide a distinct description of the interaction protocol for each agent (or role) in the interaction. Here we discuss the process of deriving agent centred representations from joint representations and vice versa, using a variant of UML statecharts. expand
|
|
Toward verification of commitment protocols and their compositions |
|
Nirmit Desai,
Zhengang Cheng,
Amit K. Chopra,
Munindar P. Singh
|
|
Article No.: 33 |
|
doi>10.1145/1329125.1329165 |
|
Full text: PDF
|
|
Commitment protocols have been proposed as a basis for modeling and enacting interactions among agents, such as those needed to carry out business processes. A central idea is that protocols would be developed and shared via libraries, and refined and ...
Commitment protocols have been proposed as a basis for modeling and enacting interactions among agents, such as those needed to carry out business processes. A central idea is that protocols would be developed and shared via libraries, and refined and composed to produce protocols that serve specific needs. Success in this program, therefore, presupposes that individual protocols and their compositions can be formally verified with respect to the properties of interest. This paper outlines an approach for verifying the correctness of commitment protocols and their compositions that exploits the well-known software engineering technique of model checking. expand
|
|
Interactive dynamic influence diagrams |
|
Kyle Polich,
Piotr Gmytrasiewicz
|
|
Article No.: 34 |
|
doi>10.1145/1329125.1329166 |
|
Full text: PDF
|
|
Partially Observable Markov Decision Processes (POMDPs) emerged as the primary framework for decision-theoretic planning in single agent settings. Solutions to POMDPs are optimal plans which are conditional on future observations. Dynamic Influence Diagrams ...
Partially Observable Markov Decision Processes (POMDPs) emerged as the primary framework for decision-theoretic planning in single agent settings. Solutions to POMDPs are optimal plans which are conditional on future observations. Dynamic Influence Diagrams (DIDs) are computational representations of POMDPs which compute solutions for finite time horizons in an on-line fashion. Interactive POMDPs (I-POMDPs) [5] generalize POMDPs to multi-agent settings by including models of other agents in the state space. Interactive DIDs (I-DIDs), presented in this paper, are computational representations of I-POMDPs, and thus generalizations of DIDs. DIDs are themselves temporal generalizations of influence diagrams [6]. expand
|
|
A component-based approach to standardising agent communication |
|
Frank Guerin,
Wamberto Vasconcelos
|
|
Article No.: 35 |
|
doi>10.1145/1329125.1329167 |
|
Full text: PDF
|
|
We address the problem of standardising the semantics of agent communication. The diversity of existing approaches suggests that no single agent communication language can satisfactorily cater for all scenarios. However, standardising the way in which ...
We address the problem of standardising the semantics of agent communication. The diversity of existing approaches suggests that no single agent communication language can satisfactorily cater for all scenarios. However, standardising the way in which different languages are specified is a viable alternative. We describe a standard meta-language in which the rules of an arbitrary institution can be specified. In this way different agent communication languages can be given a common grounding. From this starting point, we describe a component-based approach to standardisation, whereby a standard can develop by adding component sets of rules; for example to handle various classes of dialogues and normative relations. Eventually we envisage different agent institutions publishing a specification of their rules by simply specifying the subset of standard components in use in that institution. Agents implementing the meta-language can then interoperate between institutions by downloading the appropriate components. expand
|
|
Computing effective communication policies in multiagent systems |
|
Doran Chakraborty,
Sandip Sen
|
|
Article No.: 36 |
|
doi>10.1145/1329125.1329168 |
|
Full text: PDF
|
|
Communication is a key tool for facilitating multiagent coordination in cooperative and uncertain domains. We focus on a class of multiagent problems modeled as Decentralized Markov Decision Processes with Communication (DEC-MDP-COM) with local observability. ...
Communication is a key tool for facilitating multiagent coordination in cooperative and uncertain domains. We focus on a class of multiagent problems modeled as Decentralized Markov Decision Processes with Communication (DEC-MDP-COM) with local observability. The planning problem for computing the optimal communication strategy in such domains is often formulated with the assumption of the knowledge of optimal domain-level policy. Computing the optimal communication policy is NP-complete. There is a need, then, for heuristic solutions that trade-off performance with efficiency. We present a decision theoretic approach for computing optimal communication policies in stochastic environments which uses a branching future representation and evaluates only those decisions that an agent is likely to encounter. The communication strategy computed off-line is used in the more probable scenarios that the agent would face in future. Our approach also allows agents to compute communication policies at run-time in the unlikely event of the agents facing scenarios that were discarded while computing the off-line policy. expand
|
|
SESSION: Multiagent learning: full papers |
|
|
|
Transfer via inter-task mappings in policy search reinforcement learning |
|
Matthew E. Taylor,
Shimon Whiteson,
Peter Stone
|
|
Article No.: 37 |
|
doi>10.1145/1329125.1329170 |
|
Full text: PDF
|
|
The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have focused on transferring value-functions, this paper presents a method for ...
The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have focused on transferring value-functions, this paper presents a method for transferring policies across tasks with different state and action spaces. In particular, this paper utilizes transfer via inter-task mappings for policy search methods (TVITM-PS) to construct a transfer functional that translates a population of neural network policies trained via policy search from a source task to a target task. Empirical results in robot soccer Keepaway and Server Job Scheduling show that TVITM-PS can markedly reduce learning time when full inter-task mappings are available. The results also demonstrate that TVITMPS still succeeds when given only incomplete inter-task mappings. Furthermore, we present a novel method for learning such mappings when they are not available, and give results showing they perform comparably to hand-coded mappings. expand
|
|
SMILE: Sound Multi-agent Incremental LEarning |
|
Gauvain Bourgne,
Amal El Fallah Segrouchni,
Henry Soldano
|
|
Article No.: 38 |
|
doi>10.1145/1329125.1329171 |
|
Full text: PDF
|
|
This article deals with the problem of collaborative learning in a multi-agent system. Here each agent can update incrementally its beliefs B (the concept representation) so that it is in a way kept consistent with the whole set of information ...
This article deals with the problem of collaborative learning in a multi-agent system. Here each agent can update incrementally its beliefs B (the concept representation) so that it is in a way kept consistent with the whole set of information K (the examples) that he has received from the environment or other agents. We extend this notion of consistency (or soundness) to the whole MAS and discuss how to obtain that, at any moment, a same consistent concept representation is present in each agent. The corresponding protocol is applied to supervised concept learning. The resulting method SMILE (standing for Sound Multi-agent Incremental LEarning) is described and experimented here. Surprisingly some difficult boolean formulas are better learned, given the same learning set, by a Multi agent system than by a single agent. expand
|
|
Multiagent reinforcement learning and self-organization in a network of agents |
|
Sherief Abdallah,
Victor Lesser
|
|
Article No.: 39 |
|
doi>10.1145/1329125.1329172 |
|
Full text: PDF
|
|
To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning techniques have been commonly used to optimize agents local policies in such a network ...
To cope with large scale, agents are usually organized in a network such that an agent interacts only with its immediate neighbors in the network. Reinforcement learning techniques have been commonly used to optimize agents local policies in such a network because they require little domain knowledge and can be fully distributed. However, all of the previous work assumed the underlying network was fixed throughout the learning process. This assumption was important because the underlying network defines the learning context of each agent. In particular, the set of actions and the state space for each agent is defined in terms of the agent's neighbors. If agents dynamically change the underlying network structure (also called self-organize) during learning, then one needs a mechanism for transferring what agents have learned so far before (in the old network structure) to their new learning context (in the new network structure). In this work we develop a novel self-organization mechanism that not only allows agents to self-organize the underlying network during the learning process, but also uses information from learning to guide the self-organization process. Consequently, our work is the first to study this interaction between learning and self-organization. Our self-organization mechanism uses heuristics to transfer the learned knowledge across the different steps of self-organization. We also present a more restricted version of our mechanism that is computationally less expensive and still achieve good performance. We use a simplified version of the distributed task allocation domain as our case study. Experimental results verify the stability of our approach and show a monotonic improvement in the performance of the learning process due to self-organization. expand
|
|
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective |
|
Liviu Panait,
Karl Tuyls
|
|
Article No.: 40 |
|
doi>10.1145/1329125.1329173 |
|
Full text: PDF
|
|
This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for traditional multiagent Q-learning, and we extend these differential equations ...
This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for traditional multiagent Q-learning, and we extend these differential equations to account for lenient learners: agents that forgive possible mistakes of their teammates that resulted in lower rewards. We use this extended formal model to visualize the basins of attraction of both traditional and lenient multiagent Q-learners in two benchmark coordination problems. The results indicate that lenience provides learners with more accurate estimates for the utility of their actions, resulting in higher likelihood of convergence to the globally optimal solution. In addition, our research supports the strength of EGT as a backbone for multiagent reinforcement learning. expand
|
|
Multiagent learning in adaptive dynamic systems |
|
Andriy Burkov,
Brahim Chaib-draa
|
|
Article No.: 41 |
|
doi>10.1145/1329125.1329174 |
|
Full text: PDF
|
|
Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all players, would find an interdependent solution called "equilibrium". Recently, ...
Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all players, would find an interdependent solution called "equilibrium". Recently, however, certain researchers question the necessity and the validity of the concept of equilibrium as the most important multiagent solution concept. They argue that a "good" learning algorithm is one that is efficient with respect to a certain class of counterparts. Adaptive players is an important class of agents that learn their policies separately from the maintenance of the beliefs about their counterparts' future actions and make their decisions based on that policy and the current belief. In this paper, we propose an efficient learning algorithm in presence of the adaptive counterparts called Adaptive Dynamics Learner (ADL), which is able to learn an efficient policy over the opponents' adaptive dynamics rather than over the simple actions and beliefs and, by so doing, to exploit these dynamics to obtain a higher utility than any equilibrium strategy can provide. We tested our algorithm on a substantial representative set of the most known and demonstrative matrix games and observed that ADL agent is highly efficient against Adaptive Play Q-learning (APQ) agent and Infinitesimal Gradient Ascent (IGA) agent. In self-play, when possible, ADL is able to converge to a Pareto optimal strategy maximizing the welfare of all players. expand
|
|
Regret based dynamics: convergence in weakly acyclic games |
|
Jason R. Marden,
Gürdal Arslan,
Jeff S. Shamma
|
|
Article No.: 42 |
|
doi>10.1145/1329125.1329175 |
|
Full text: PDF
|
|
No-regret algorithms have been proposed to control a wide variety of multi-agent systems. The appeal of no-regret algorithms is that they are easily implementable in large scale multi-agent systems because players make decisions using only retrospective ...
No-regret algorithms have been proposed to control a wide variety of multi-agent systems. The appeal of no-regret algorithms is that they are easily implementable in large scale multi-agent systems because players make decisions using only retrospective or "regret based" information. Furthermore, there are existing results proving that the collective behavior will asymptotically converge to a set of points of "no-regret" in any game. We illustrate, through a simple example, that no-regret points need not reflect desirable operating conditions for a multi-agent system. Multi-agent systems often exhibit an additional structure (i.e. being "weakly acyclic") that has not been exploited in the context of no-regret algorithms. In this paper, we introduce a modification of the traditional no-regret algorithms by (i) exponentially discounting the memory and (ii) bringing in a notion of inertia in players' decision process. We show how these modifications can lead to an entire class of regret based algorithms that provide almost sure convergence to a pure Nash equilibrium in any weakly acyclic game. expand
|
|
Sharing experiences to learn user characteristics in dynamic environments with sparse data |
|
David Sarne,
Barbara J. Grosz
|
|
Article No.: 43 |
|
doi>10.1145/1329125.1329176 |
|
Full text: PDF
|
|
This paper investigates the problem of estimating the value of probabilistic parameters needed for decision making in environments in which an agent, operating within a multi-agent system, has no a priori information about the structure of the distribution ...
This paper investigates the problem of estimating the value of probabilistic parameters needed for decision making in environments in which an agent, operating within a multi-agent system, has no a priori information about the structure of the distribution of parameter values. The agent must be able to produce estimations even when it may have made only a small number of direct observations, and thus it must be able to operate with sparse data. The paper describes a mechanism that enables the agent to significantly improve its estimation by augmenting its direct observations with those obtained by other agents with which it is coordinating. To avoid undesirable bias in relatively heterogeneous environments while effectively using relevant data to improve its estimations, the mechanism weighs the contributions of other agents' observations based on a real-time estimation of the level of similarity between each of these agents and itself. The "coordination autonomy" module of a coordination-manager system provided an empirical setting for evaluation. Simulation-based evaluations demonstrated that the proposed mechanism outperforms estimations based exclusively on an agent's own observations as well as estimations based on an unweighted aggregate of all other agents' observations. expand
|
|
SESSION: Multiagent learning: poster papers |
|
|
|
Reducing the complexity of multiagent reinforcement learning |
|
Andriy Burkov,
Brahim Chaib-draa
|
|
Article No.: 44 |
|
doi>10.1145/1329125.1329178 |
|
Full text: PDF
|
|
It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment's states. It was shown, however, that the learning complexity for the goal-directed problems may be substantially ...
It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment's states. It was shown, however, that the learning complexity for the goal-directed problems may be substantially reduced by initializing the Q-values with a "good" approximative function. In the multiagent case, there exists such a good approximation for a big class of problems, namely, for goal-directed stochastic games. These games, for example, can reflect coordination and common interest problems of cooperative robotics. The approximative function for these games is nothing but the relaxed, single-agent, problem solution, which can easily be found by each agent individually. In this article, we show that (1) an optimal single-agent solution is a "good" approximation for the goal-directed stochastic games with action-penalty representation and (b) the complexity is reduced when the learning is initialized with this approximative function, as compared to the uninformed case. expand
|
|
Parallel reinforcement learning with linear function approximation |
|
Matthew Grounds,
Daniel Kudenko
|
|
Article No.: 45 |
|
doi>10.1145/1329125.1329179 |
|
Full text: PDF
|
|
In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by using parallel hardware. Our approach is based on agents using the ...
In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by using parallel hardware. Our approach is based on agents using the SARSA(λ) algorithm, with value functions represented using linear function approximators. In our proposed method, each agent learns independently in a separate simulation of the single-agent problem. The agents periodically exchange information extracted from the weights of their approximators, accelerating convergence towards the optimal policy. We present empirical results for an implementation on a Beowulf cluster. expand
|
|
Reinforcement learning in extensive form games with incomplete information: the bargaining case study |
|
Alessandro Lazaric,
Enrique Munoz de Cote,
Nicola Gatti
|
|
Article No.: 46 |
|
doi>10.1145/1329125.1329180 |
|
Full text: PDF
|
|
We consider the problem of playing in repeated extensive form games where agents do not have any prior. In this situation classic game theoretical tools are inapplicable and it is common the resort to learning techniques. In this paper, we present a ...
We consider the problem of playing in repeated extensive form games where agents do not have any prior. In this situation classic game theoretical tools are inapplicable and it is common the resort to learning techniques. In this paper, we present a novel learning principle that aims at avoiding oscillations in the agents' strategies induced by the presence of concurrent learners. We apply our algorithm in bargaining, and we experimentally evaluate it showing that using this principle reinforcement learning algorithms can improve their convergence time. expand
|
|
SESSION: Applications and computational environments: full papers |
|
|
|
A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems |
|
Haizheng Zhang,
Victor Lesser
|
|
Article No.: 47 |
|
doi>10.1145/1329125.1329182 |
|
Full text: PDF
|
|
The dominant existing routing strategies employed in peer-to-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend on the content similarity between incoming queries and their direct neighboring ...
The dominant existing routing strategies employed in peer-to-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend on the content similarity between incoming queries and their direct neighboring agents to direct the distributed search sessions. However, such a heuristic is myopic in that the neighboring agents may not be connected to more relevant agents. In this paper, an online reinforcement-learning based approach is developed to take advantage of the dynamic run-time characteristics of P2P IR systems as represented by information about past search sessions. Specifically, agents maintain estimates on the downstream agents' abilities to provide relevant documents for incoming queries. These estimates are updated gradually by learning from the feedback information returned from previous search sessions. Based on this information, the agents derive corresponding routing policies. Thereafter, these agents route the queries based on the learned policies and update the estimates based on the new routing policies. Experimental results demonstrate that the learning algorithm improves considerably the routing performance on two test collection sets that have been used in a variety of distributed IR studies. expand
|
|
Estimating information value in collaborative multi-agent planning systems |
|
David Sarne,
Barbara J. Grosz
|
|
Article No.: 48 |
|
doi>10.1145/1329125.1329183 |
|
Full text: PDF
|
|
This paper addresses the problem of identifying the value of information held by a teammate on a distributed, multi-agent team. It focuses on a distributed scheduling task in which computer agents support people who are carrying out complex tasks in ...
This paper addresses the problem of identifying the value of information held by a teammate on a distributed, multi-agent team. It focuses on a distributed scheduling task in which computer agents support people who are carrying out complex tasks in a dynamic environment. The paper presents a decision-theoretic algorithm for determining the value of information that is potentially relevant to schedule revisions, but is directly available only to the person and not the computer agent. The design of a "coordination autonomy" (CA) module within a coordination-manager system provided the empirical setting for this work. By design, the CA module depends on an external scheduler module to determine the specific effect of additional information on overall system performance. The paper describes two methods for reducing the number of queries the CA issues to the scheduler, enabling it to satisfy computational resource constraints placed on it. Experimental results indicate the algorithm improves system performance and establish the exceptional efficiency---measured in terms of the number of queries required for estimating the value of information---that can be achieved by the query-reducing methods. expand
|
|
Information searching and sharing in large-scale dynamic networks |
|
George A. Vouros
|
|
Article No.: 49 |
|
doi>10.1145/1329125.1329184 |
|
Full text: PDF
|
|
Finding the right agents in a large and dynamic network to provide the needed resources in a timely fashion, is a long standing problem. This paper presents a method for information searching and sharing that combines routing indices with token-based ...
Finding the right agents in a large and dynamic network to provide the needed resources in a timely fashion, is a long standing problem. This paper presents a method for information searching and sharing that combines routing indices with token-based methods. The proposed method enables agents to search effectively by acquiring their neighbors' interests, advertising their information provision abilities and maintaining indices for routing queries, in an integrated way. Specifically, the paper demonstrates through performance experiments how static and dynamic networks of agents can be 'tuned' to answer queries effectively as they gather evidence for the interests and information provision abilities of others, without altering the topology or imposing an overlay structure to the network of acquaintances. expand
|
|
Modelling the provenance of data in autonomous systems |
|
Simon Miles,
Steve Munroe,
Michael Luck,
Luc Moreau
|
|
Article No.: 50 |
|
doi>10.1145/1329125.1329185 |
|
Full text: PDF
|
|
Determining the provenance of data, i.e. the process that led to that data, is vital in many disciplines. For example, in science, the process that produced a given result must be demonstrably rigorous for the result to be deemed reliable. A provenance ...
Determining the provenance of data, i.e. the process that led to that data, is vital in many disciplines. For example, in science, the process that produced a given result must be demonstrably rigorous for the result to be deemed reliable. A provenance system supports applications in recording adequate documentation about process executions to answer queries regarding provenance, and provides functionality to perform those queries. Several provenance systems are being developed, but all focus on systems in which the components are reactive, for example Web Services that act on the basis of a request, job submission system, etc. This limitation means that questions regarding the motives of autonomous actors, or agents, in such systems remain unanswerable in the general case. Such questions include: who was ultimately responsible for a given effect, what was their reason for initiating the process and does the effect of a process match what was intended to occur by those initiating the process? In this paper, we address this limitation by integrating two solutions: a generic, re-usable framework for representing the provenance of data in service-oriented architectures and a model for describing the goal-oriented delegation and engagement of agents in multi-agent systems. Using these solutions, we present algorithms to answer common questions regarding responsibility and success of a process and evaluate the approach with a simulated healthcare example. expand
|
|
An advanced bidding agent for advertisement selection on public displays |
|
Alex Rogers,
Esther David,
Terry R. Payne,
Nicholas R. Jennings
|
|
Article No.: 51 |
|
doi>10.1145/1329125.1329186 |
|
Full text: PDF
|
|
In this paper we present an advanced bidding agent that participates in first-price sealed bid auctions to allocate advertising space on BluScreen - an experimental public advertisement system that detects users through the presence of their Bluetooth ...
In this paper we present an advanced bidding agent that participates in first-price sealed bid auctions to allocate advertising space on BluScreen - an experimental public advertisement system that detects users through the presence of their Bluetooth enabled devices. Our bidding agent is able to build probabilistic models of both the behaviour of users who view the adverts, and the auctions that it participates within. It then uses these models to maximise the exposure that its adverts receive. We evaluate the effectiveness of this bidding agent through simulation against a range of alternative selection mechanisms including a simple bidding strategy, random allocation, and a centralised optimal allocation with perfect foresight. Our bidding agent significantly outperforms both the simple bidding strategy and the random allocation, and in a mixed population of agents it is able to expose its adverts to 25% more users than the simple bidding strategy. Moreover, its performance is within 7.5% of that of the centralised optimal allocation despite the highly uncertain environment in which it must operate. expand
|
|
An incentive mechanism for message relaying in unstructured peer-to-peer systems |
|
Cuihong Li,
Bin Yu,
Katia Sycara
|
|
Article No.: 52 |
|
doi>10.1145/1329125.1329187 |
|
Full text: PDF
|
|
Distributed message relaying is an important function of a peer-to-peer system to discover service providers. Existing search protocols in unstructured peer-to-peer systems either create huge burden on communications or cause long response time. Moreover, ...
Distributed message relaying is an important function of a peer-to-peer system to discover service providers. Existing search protocols in unstructured peer-to-peer systems either create huge burden on communications or cause long response time. Moreover, these systems are also vulnerable to the free riding problem. In this paper we present an incentive mechanism that not only mitigates the free riding problem, but also achieves good system efficiency in message relaying for peer discovery. In this mechanism promised rewards are passed along the message propagation process. A peer is rewarded if a service provider is found via a relaying path that includes this peer. We provide some analytic insights to the symmetric Nash equilibrium strategies of this game, and an approximate approach to calculate this equilibrium. Experiments show that this incentive mechanism brings a system utility generally higher than breadth-first search and random walks, based on both the estimated utility from our approximate equilibrium and the utility generated from learning in the incentive mechanism. expand
|
|
Bidding optimally in concurrent second-price auctions of perfectly substitutable goods |
|
Enrico H. Gerding,
Rajdeep K. Dash,
David C. K. Yuen,
Nicholas R. Jennings
|
|
Article No.: 53 |
|
doi>10.1145/1329125.1329188 |
|
Full text: PDF
|
|
We derive optimal bidding strategies for a global bidding agent that participates in multiple, simultaneous second-price auctions with perfect substitutes. We first consider a model where all other bidders are local and participate in a single auction. ...
We derive optimal bidding strategies for a global bidding agent that participates in multiple, simultaneous second-price auctions with perfect substitutes. We first consider a model where all other bidders are local and participate in a single auction. For this case, we prove that, assuming free disposal, the global bidder should always place non-zero bids in all available auctions, irrespective of the local bidders' valuation distribution. Furthermore, for non-decreasing valuation distributions, we prove that the problem of finding the optimal bids reduces to two dimensions. These results hold both in the case where the number of local bidders is known and when this number is determined by a Poisson distribution. This analysis extends to online markets where, typically, auctions occur both concurrently and sequentially. In addition, by combining analytical and simulation results, we demonstrate that similar results hold in the case of several global bidders, provided that the market consists of both global and local bidders. Finally, we address the efficiency of the overall market, and show that information about the number of local bidders is an important determinant for the way in which a global bidder affects efficiency. expand
|
|
Collaboration among a satellite swarm |
|
Grégory Bonnet,
Catherine Tessier
|
|
Article No.: 54 |
|
doi>10.1145/1329125.1329189 |
|
Full text: PDF
|
|
The paper deals with on-board planning for a satellite swarm via communication and negotiation. We aim at defining individual behaviours that result in a global behaviour that meets the mission requirements. We will present the formalization of the problem, ...
The paper deals with on-board planning for a satellite swarm via communication and negotiation. We aim at defining individual behaviours that result in a global behaviour that meets the mission requirements. We will present the formalization of the problem, a communication protocol, a solving method based on reactive decision rules, and first results. expand
|
|
Overlay networks for task allocation and coordination in dynamic large-scale networks of cooperative agents |
|
Christina Theocharopoulou,
Ioannis Partsakoulakis,
George A. Vouros,
Kostas Stergiou
|
|
Article No.: 55 |
|
doi>10.1145/1329125.1329190 |
|
Full text: PDF
|
|
This work proposes a method for allocating temporally interdependent tasks to homogeneous or heterogeneous cooperative agents in dynamic large-scale networks. This method views searching, task allocation and scheduling as an integrated problem that has ...
This work proposes a method for allocating temporally interdependent tasks to homogeneous or heterogeneous cooperative agents in dynamic large-scale networks. This method views searching, task allocation and scheduling as an integrated problem that has to be efficiently solved in such networks. Solving the general problem optimally in a decentralized way is very hard and can only be solved by a centralized method, be approximated by means of heuristics, or by relaxations of the original problem. Our method facilitates effective searching through the dynamic assignment of gateway roles to agents and the exploitation of routing indices. In combination to searching, it exploits distributed constraint satisfaction techniques and dynamic re-organization of agent teams to efficiently handle the allocation of complex tasks with interdependent subtasks. expand
|
|
Solving large TÆMS problems efficiently by selective exploration and decomposition |
|
Jianhui Wu,
Edmund H. Durfee
|
|
Article No.: 56 |
|
doi>10.1145/1329125.1329191 |
|
Full text: PDF
|
|
TÆMS is a hierarchical modeling language capable of representing complex task networks with intra-task uncertainties and inter-task dependencies. The uncertainty and complexity of the application domains represented in TÆMS models often lead ...
TÆMS is a hierarchical modeling language capable of representing complex task networks with intra-task uncertainties and inter-task dependencies. The uncertainty and complexity of the application domains represented in TÆMS models often lead to very large state spaces, which push the need to design efficient solution algorithms for TÆMS problems. In this paper, we present a solver that integrates selective state space search techniques with state space decomposition techniques. Our experiments demonstrate that the solver can find an (approximately) optimal solution much faster than prior approaches. expand
|
|
SESSION: Applications and computational environments: poster paper |
|
|
|
A methodology for 3D electronic institutions |
|
A. Bogdanovych,
M. Esteva,
S. Simoff,
C. Sierra,
H. Berger
|
|
Article No.: 57 |
|
doi>10.1145/1329125.1329193 |
|
Full text: PDF
|
|
In this paper we propose a methodology for the construction of 3D electronic institutions. 3D electronic institutions are normative environments where software and human agents can participate and collaborate in a joint 3D Virtual World. The proposed ...
In this paper we propose a methodology for the construction of 3D electronic institutions. 3D electronic institutions are normative environments where software and human agents can participate and collaborate in a joint 3D Virtual World. The proposed methodology covers the specification of the institutional rules, as well as the design and visualization of 3D environments for the specified institution. It is also supplied with a set of graphical tools that facilitate the development process on every level, from specification to deployment. The resulting system facilitates the direct integration of human users into Multi-Agent Systems as they participate by driving an avatar in the generated 3D environment. expand
|
|
SESSION: Cognitive models for agents: full papers |
|
|
|
Cognitive and social simulation of criminal behaviour: the intermittent explosive disorder case |
|
Tibor Bosse,
Charlotte Gerritsen,
Jan Treur
|
|
Article No.: 58 |
|
doi>10.1145/1329125.1329195 |
|
Full text: PDF
|
|
Criminal behaviour often involves a combination of physical, mental, social and environmental (multi-)agent aspects, such as neurological deviations, hormones, arousal, (non)empathy, targets and social control. This paper contributes a dynamical agent-based ...
Criminal behaviour often involves a combination of physical, mental, social and environmental (multi-)agent aspects, such as neurological deviations, hormones, arousal, (non)empathy, targets and social control. This paper contributes a dynamical agent-based approach for analysis and simulation of criminal behaviour, covering the above aspects, illustrated for the case of an Intermittent Explosive Disorder. It involves dynamically generated desires and beliefs in opportunities within the social environment, both based on literature on criminal behaviour. expand
|
|
Interactions between market barriers and communication networks in marketing systems |
|
Ian N. Durbach,
Jan H. Hofmeyr
|
|
Article No.: 59 |
|
doi>10.1145/1329125.1329196 |
|
Full text: PDF
|
|
We investigate a framework where agents search for satisfying products by using referrals from other agents. Our model of a mechanism for transmitting word-of-mouth and the resulting behavioural effects is based on integrating a module governing the ...
We investigate a framework where agents search for satisfying products by using referrals from other agents. Our model of a mechanism for transmitting word-of-mouth and the resulting behavioural effects is based on integrating a module governing the local behaviour of agents with a module governing the structure and function of the underlying network of agents. Local behaviour incorporates a satisficing model of choice, a set of rules governing the interactions between agents, including learning about the trustworthiness of other agents over time, and external constraints on behaviour that may be imposed by market barriers or switching costs. Local behaviour takes place on a network substrate across which agents exchange positive and negative information about products. We use various degree distributions dictating the extent of connectivity, and incorporate both small-world effects and the notion of preferential attachment in our network models. We compare the effectiveness of referral systems over various network structures for easy and hard choice tasks, and evaluate how this effectiveness changes with the imposition of market barriers. expand
|
|
Realistic cognitive load modeling for enhancing shared mental models in human-agent collaboration |
|
Xiaocong Fan,
John Yen
|
|
Article No.: 60 |
|
doi>10.1145/1329125.1329197 |
|
Full text: PDF
|
|
Human team members often develop shared expectations to predict each other's needs and coordinate their behaviors. In this paper the concept "Shared Belief Map" is proposed as a basis for developing realistic shared expectations among a team of Human-Agent-Pairs ...
Human team members often develop shared expectations to predict each other's needs and coordinate their behaviors. In this paper the concept "Shared Belief Map" is proposed as a basis for developing realistic shared expectations among a team of Human-Agent-Pairs (HAPs). The establishment of shared belief maps relies on inter-agent information sharing, the effectiveness of which highly depends on agents' processing loads and the instantaneous cognitive loads of their human partners. We investigate HMM-based cognitive load models to facilitate team members to "share the right information with the right party at the right time". The shared belief map concept and the cognitive/processing load models have been implemented in a cognitive agent architecture---SMMall. A series of experiments were conducted to evaluate the concept, the models, and their impacts on the evolving of shared mental models of HAP teams. expand
|
|
SESSION: Cognitive models for agents: poster papers |
|
|
|
A computational model of human interaction and planning for heterogeneous multi-agent systems |
|
Alexandre Pauchet,
Nathalie Chaignaud,
Amal El Fallah Seghrouchni
|
|
Article No.: 61 |
|
doi>10.1145/1329125.1329199 |
|
Full text: PDF
|
|
In the framework of heterogeneous multi-agent systems, this paper presents an implemented cognitive model of cooperative problem-solving, based on a psychological experiment. We are interested in simulating how human subjects elaborate plans in situations ...
In the framework of heterogeneous multi-agent systems, this paper presents an implemented cognitive model of cooperative problem-solving, based on a psychological experiment. We are interested in simulating how human subjects elaborate plans in situations where knowledge is incomplete and how they interact to obtain missing information. The system BDIGGY, a concurrent implementation of a planning model and an interaction model, is used to simulate the human processes during cooperative problem solving. expand
|
|
Forming and scaffolding human coalitions with a multi-agent framework |
|
Leen-Kiat Soh,
Nobel Khandaker
|
|
Article No.: 62 |
|
doi>10.1145/1329125.1329200 |
|
Full text: PDF
|
|
With the advancement of teleconferencing technologies, human users are collaborating online more than ever today. To improve the efficiency and effectiveness of online human coalitions, one needs to support and facilitate collaborations among human users ...
With the advancement of teleconferencing technologies, human users are collaborating online more than ever today. To improve the efficiency and effectiveness of online human coalitions, one needs to support and facilitate collaborations among human users who may or may not know of each other well and of how to work well together as a team or in a team. Here we propose the Integrated Human Coalition Formation and Scaffolding (iHUCOFS) framework. This multiagent framework considers the roles of an agent as both an advisor and a representative to a human user, the tradeoffs between forming and scaffolding human coalitions, and how scaffolding could impact human behaviors for future coalitions. Based on the axioms and design principles of iHUCOFS, we have developed VALCAM---an iterative auction based coalition formation algorithm. To investigate the feasibility and impact of VALCAM, we have conducted an experiment in a computer-supported collaborative learning environment and obtained promising results. expand
|
|
Multiagent based construction for human-like architecture |
|
Yifeng Zeng,
Dennis Plougman Buus,
Jorge Cordero H
|
|
Article No.: 63 |
|
doi>10.1145/1329125.1329201 |
|
Full text: PDF
|
|
Collaborative construction is a main application in the field of autonomous systems. An interesting subject in the area is the construction of realistic human-like architecture. However, the task of building a human-like architecture is non-trivial since ...
Collaborative construction is a main application in the field of autonomous systems. An interesting subject in the area is the construction of realistic human-like architecture. However, the task of building a human-like architecture is non-trivial since the construction is a real time process without human supervision. In this paper, we present a collective building algorithm based on stigmergy. A swarm of virtual agents construct edifications which resemble basic features in human-like architecture. The algorithm maps sensory information to appropriate building actions. expand
|
|
SESSION: Mechanism design: full papers |
|
|
|
Coalition formation under uncertainty: bargaining equilibria and the Bayesian core stability concept |
|
Georgios Chalkiadakis,
Evangelos Markakis,
Craig Boutilier
|
|
Article No.: 64 |
|
doi>10.1145/1329125.1329203 |
|
Full text: PDF
|
|
Coalition formation is a problem of great interest in AI, allowing groups of autonomous, rational agents to form stable teams. Furthermore, the study of coalitional stability concepts and their relation to equilibria that guide the strategic interactions ...
Coalition formation is a problem of great interest in AI, allowing groups of autonomous, rational agents to form stable teams. Furthermore, the study of coalitional stability concepts and their relation to equilibria that guide the strategic interactions of agents during bargaining has lately attracted much attention. However, research to date in both AI and economics has largely ignored the potential presence of uncertainty when studying either coalitional stability or coalitional bargaining. This paper is the first to relate a (cooperative) stability concept under uncertainty, the Bayesian core (BC), with (non-cooperative) equilibrium concepts of coalitional bargaining games. We prove that if the BC of a coalitional game (and of each subgame) is non-empty, then there exists an equilibrium of the corresponding bargaining game that produces a BC element; and conversely, if there exists a coalitional bargaining equilibrium (with certain properties), then it induces a BC configuration. We thus provide a non-cooperative justification of the BC stability concept. As a corollary, we establish a sufficient condition for the existence of the BC. Finally, for small games, we provide an algorithm to decide whether the BC is non-empty. expand
|
|
Eliciting single-peaked preferences using comparison queries |
|
Vincent Conitzer
|
|
Article No.: 65 |
|
doi>10.1145/1329125.1329204 |
|
Full text: PDF
|
|
Voting is a general method for aggregating the preferences of multiple agents. Each agent ranks all the possible alternatives, and based on this, an aggregate ranking of the alternatives (or at least a winning alternative) is produced. However, when ...
Voting is a general method for aggregating the preferences of multiple agents. Each agent ranks all the possible alternatives, and based on this, an aggregate ranking of the alternatives (or at least a winning alternative) is produced. However, when there are many alternatives, it is impractical to simply ask agents to report their complete preferences. Rather, the agents' preferences, or at least the relevant parts thereof, need to be elicited. This is done by asking the agents a (hopefully small) number of simple queries about their preferences, such as comparison queries, which ask an agent to compare two of the alternatives. Prior work on preference elicitation in voting has focused on the case of unrestricted preferences. It has been shown that in this setting, it is sometimes necessary to ask each agent (almost) as many queries as would be required to determine an arbitrary ranking of the alternatives. By contrast, in this paper, we focus on single-peaked preferences. We show that such preferences can be elicited using only a linear number of comparison queries, if either the order with respect to which preferences are single-peaked is known, or at least one other agent's complete preferences are known. We also show that using a sublinear number of queries will not suffice. Finally, we present experimental results. expand
|
|
On the robustness of preference aggregation in noisy environments |
|
Ariel D. Procaccia,
Jeffrey S. Rosenschein,
Gal A. Kaminka
|
|
Article No.: 66 |
|
doi>10.1145/1329125.1329205 |
|
Full text: PDF
|
|
In an election held in a noisy environment, agents may unintentionally perturb the outcome by communicating faulty preferences. We investigate this setting by introducing a theoretical model of noisy preference aggregation and formally defining the (worst-case) ...
In an election held in a noisy environment, agents may unintentionally perturb the outcome by communicating faulty preferences. We investigate this setting by introducing a theoretical model of noisy preference aggregation and formally defining the (worst-case) robustness of a voting rule. We use our model to analytically bound the robustness of various prominent rules. The results show that the robustness of voting rules is diverse, with different rules positioned at either end of the spectrum. These results allow selection of voting rules that support preference aggregation in the face of noise. expand
|
|
Constraint satisfaction algorithms for graphical games |
|
Vishal Soni,
Satinder Singh,
Michael P. Wellman
|
|
Article No.: 67 |
|
doi>10.1145/1329125.1329206 |
|
Full text: PDF
|
|
We formulate the problem of computing equilibria in multi-player games represented by arbitrary undirected graphs as a constraint satisfaction problem and present two algorithms. The first is PureProp: an algorithm for computing approximate Nash equilibria ...
We formulate the problem of computing equilibria in multi-player games represented by arbitrary undirected graphs as a constraint satisfaction problem and present two algorithms. The first is PureProp: an algorithm for computing approximate Nash equilibria in complete information one-shot games and approximate Bayes-Nash equilibria in one-shot games of incomplete information. PureProp unifies existing message-passing based algorithms for solving these classes of games. We also address repeated graphical games, and present a second algorithm, PureProp-R, for computing approximate Nash equilibria in these games. expand
|
|
Factoring games to isolate strategic interactions |
|
George B. Davis,
Michael Benisch,
Kathleen M. Carley,
Norman M. Sadeh
|
|
Article No.: 68 |
|
doi>10.1145/1329125.1329207 |
|
Full text: PDF
|
|
Game theoretic reasoning about multi-agent systems has been made more tractable by algorithms that exploit various types of independence in agents' utilities. However, previous work has assumed that a game's precise independence structure is given in ...
Game theoretic reasoning about multi-agent systems has been made more tractable by algorithms that exploit various types of independence in agents' utilities. However, previous work has assumed that a game's precise independence structure is given in advance. This paper addresses the problem of finding independence structure in a general form game when it is not known ahead of time, or of finding an approximation when full independence does not exist. We give an expected polynomial time algorithm to divide a bounded-payout normal form game into factor games that isolate independent strategic interactions. We also show that the best approximate factoring can be found in polynomial time for a specific interaction that is not fully independent. Once known, factors aide computation of game theoretic solution concepts, including Nash equilibria (or ε-equilibria for approximate factors), in some cases guaranteeing polynomial complexity where previous bounds were exponential. expand
|
|
Sequential decision making in parallel two-sided economic search |
|
David Sarne,
Teijo Arponen
|
|
Article No.: 69 |
|
doi>10.1145/1329125.1329208 |
|
Full text: PDF
|
|
This paper presents a two-sided economic search model in which agents are searching for beneficial pairwise partnerships. In each search stage, each of the agents is randomly matched with several other agents in parallel, and makes a decision whether ...
This paper presents a two-sided economic search model in which agents are searching for beneficial pairwise partnerships. In each search stage, each of the agents is randomly matched with several other agents in parallel, and makes a decision whether to accept a potential partnership with one of them. The distinguishing feature of the proposed model is that the agents are not restricted to maintaining a synchronized (instantaneous) decision protocol and can sequentially accept and reject partnerships within the same search stage. We analyze the dynamics which drive the agents' strategies towards a stable equilibrium in the new model and show that the proposed search strategy weakly dominates the one currently in use for the two-sided parallel economic search model. By identifying several unique characteristics of the equilibrium we manage to efficiently bound the strategy space that needs to be explored by the agents and propose an efficient means for extracting the distributed equilibrium strategies in common environments. expand
|
|
SESSION: Mechanism design: poster paper |
|
|
|
Requirements driven agent collaboration |
|
Liwei Zheng,
Zhi Jin
|
|
Article No.: 70 |
|
doi>10.1145/1329125.1329210 |
|
Full text: PDF
|
|
This paper proposes the requirements driven agent collaboration. This proposal assumes that there are plenty different service agents distributed in Internet. When a request for accomplishing a particular task occurs, these autonomous agents can recognize ...
This paper proposes the requirements driven agent collaboration. This proposal assumes that there are plenty different service agents distributed in Internet. When a request for accomplishing a particular task occurs, these autonomous agents can recognize the newly emergent requirements and dynamically aggregate together to compete with others for fulfilling the requirements. This paper presents a preliminary framework for the requirement driven agent collaboration based on the automated mechanism design. expand
|
|
SESSION: Cooperation, coordination, and teamwork: full papers |
|
|
|
Determining confidence when integrating contributions from multiple agents |
|
Raphen Becker,
Daniel D. Corkill
|
|
Article No.: 71 |
|
doi>10.1145/1329125.1329212 |
|
Full text: PDF
|
|
Integrating contributions received from other agents is an essential activity in multi-agent systems (MASs). Not only must related contributions be integrated together, but the confidence in each integrated contribution must be determined. In this paper ...
Integrating contributions received from other agents is an essential activity in multi-agent systems (MASs). Not only must related contributions be integrated together, but the confidence in each integrated contribution must be determined. In this paper we look specifically at the issue of confidence determination and its effect on developing "principled," highly collaborating MASs. Confidence determination is often masked by ad hoc contribution-integration techniques, viewed as being addressed by agent trust and reputation models, or simply assumed away. We present a domain-independent analysis model that can be used to measure the sensitivity of a collaborative problem-solving system to potentially incorrect confidence-integration assumptions. In analyses performed using our model, we focus on the typical assumption of independence among contributions and the effect that unaccounted-for dependencies have on the expected error in the confidence that the answers produced by the MAS are correct. We then demonstrate how the analysis model can be used to determine confidence bounds on integrated contributions and to identify where efforts to improve contribution-dependency estimates lead to the greatest improvement in solution-confidence accuracy. expand
|
|
Exploiting factored representations for decentralized execution in multiagent teams |
|
Maayan Roth,
Reid Simmons,
Manuela Veloso
|
|
Article No.: 72 |
|
doi>10.1145/1329125.1329213 |
|
Full text: PDF
|
|
In many cooperative multiagent domains, there exist some states in which the agents can act independently and others in which they need to coordinate with their teammates. In this paper, we explore how factored representations of state can be used to ...
In many cooperative multiagent domains, there exist some states in which the agents can act independently and others in which they need to coordinate with their teammates. In this paper, we explore how factored representations of state can be used to generate factored policies that can, with minimal communication, be executed distributedly by a multiagent team. The factored policies indicate those portions of the state where no coordination is necessary, automatically alert the agents when they reach a state in which they do need to coordinate, and determine what the agents should communicate in order to achieve this coordination. We evaluate the success of our approach experimentally by comparing the amount of communication needed by a team executing a factored policy to a team that needs to communicate in every timestep. expand
|
|
Towards collaborative task and team maintenance |
|
Gal A. Kaminka,
Ari Yakir,
Dan Erusalimchik,
Nirom Cohen-Nov
|
|
Article No.: 73 |
|
doi>10.1145/1329125.1329214 |
|
Full text: PDF
|
|
There is significant interest in modeling teamwork in agents. In recent years, it has become widely accepted that it is possible to separate teamwork from taskwork, providing support for domain-independent teamwork at an architectural level, using teamwork ...
There is significant interest in modeling teamwork in agents. In recent years, it has become widely accepted that it is possible to separate teamwork from taskwork, providing support for domain-independent teamwork at an architectural level, using teamwork models. However, existing teamwork models (both in theory and practice) focus almost exclusively on achievement goals, and ignore maintenance goals, where the value of a proposition is to be maintained over time. Such maintenance goals exist both in taskwork (i.e., agents take actions to maintain a condition while a task is executing), as well as in teamwork (i.e., agents take actions to maintain the team). This paper presents mechanisms for collaborative maintenance in both taskwork and teamwork, allowing for flexible selection of the maintenance protocol. The mechanism is integrated and evaluated in two teamwork architectures for situated agent teams: DIESEL, an implemented teamwork and taskwork architecture, built on top of Soar, and BITE, an architecture for physical behavior-based robots. We provide details of these implementations, and the results from experiments demonstrating the benefits of support for collaborative maintenance processes, in several dynamic rich domains. We show that the use of collaborative maintenance leads to significant improvement in task performance in all domains. expand
|
|
Distributed management of flexible times schedules |
|
Stephen F. Smith,
Anthony Gallagher,
Terry Zimmerman
|
|
Article No.: 74 |
|
doi>10.1145/1329125.1329215 |
|
Full text: PDF
|
|
We consider the problem of managing schedules in an uncertain, distributed environment. We assume a team of collaborative agents, each responsible for executing a portion of a globally pre-established schedule, but none possessing a global view of either ...
We consider the problem of managing schedules in an uncertain, distributed environment. We assume a team of collaborative agents, each responsible for executing a portion of a globally pre-established schedule, but none possessing a global view of either the problem or solution. The goal is to maximize the joint quality obtained from the activities executed by all agents, given that, during execution, unexpected events will force changes to some prescribed activities and reduce the utility of executing others. We describe an agent architecture for solving this problem that couples two basic mechanisms: (1) a "flexible times" representation of the agent's schedule (using a Simple Temporal Network) and (2) an incremental rescheduling procedure. The former hedges against temporal uncertainty by allowing execution to proceed from a set of feasible solutions, and the latter acts to revise the agent's schedule when execution is forced outside of this set of solutions or when execution events reduce the expected value of this feasible solution set. Basic coordination with other agents is achieved simply by communicating schedule changes to those agents with inter-dependent activities. Then, as time permits, the core local problem solving infra-structure is used to drive an inter-agent option generation and query process, aimed at identifying opportunities for solution improvement through joint change. Using a simulator to model the environment, we compare the performance of our multiagent system with that of an expected optimal (but non-scalable) centralized MDP solver. expand
|
|
Commitment-driven distributed joint policy search |
|
Stefan Witwicki,
Edmund Durfee
|
|
Article No.: 75 |
|
doi>10.1145/1329125.1329216 |
|
Full text: PDF
|
|
Decentralized MDPs provide powerful models of interactions in multiagent environments, but are often very difficult or even computationally infeasible to solve optimally. Here we develop a hierarchical approach to solving a restricted set of decentralized ...
Decentralized MDPs provide powerful models of interactions in multiagent environments, but are often very difficult or even computationally infeasible to solve optimally. Here we develop a hierarchical approach to solving a restricted set of decentralized MDPs. By forming commitments with other agents and modeling these concisely in their local MDPs, agents effectively, efficiently, and distributively formulate co-ordinated local policies. We introduce a novel construction that captures commitments as constraints on local policies and show how Linear Programming can be used to achieve local optimality subject to these constraints. In contrast to other commitment enforcement approaches, we show ours to be more robust in capturing the intended commitment semantics while maximizing local utility. We also describe a commitment-space heuristic search algorithm that can be used to approximate optimal joint policies. A preliminary empirical evaluation suggests that our approach yields faster approximate solutions than the conventional encoding of the problem as a multiagent MDP would allow and, when wrapped in an exhaustive commitment-space search, will find the optimal global solution. expand
|
|
Distributed task allocation in social networks |
|
Mathijs de Weerdt,
Yingqian Zhang,
Tomas Klos
|
|
Article No.: 76 |
|
doi>10.1145/1329125.1329217 |
|
Full text: PDF
|
|
This paper proposes a new variant of the task allocation problem, where the agents are connected in a social network and tasks arrive at the agents distributed over the network. We show that the complexity of this problem remains NP-hard. Moreover, it ...
This paper proposes a new variant of the task allocation problem, where the agents are connected in a social network and tasks arrive at the agents distributed over the network. We show that the complexity of this problem remains NP-hard. Moreover, it is not approximable within some factor. We develop an algorithm based on the contract-net protocol. Our algorithm is completely distributed, and it assumes that agents have only local knowledge about tasks and resources. We conduct a set of experiments to evaluate the performance and scalability of the proposed algorithm in terms of solution quality and computation time. Three different types of networks, namely small-world, random and scale-free networks, are used to represent various social relationships among agents in realistic applications. The results demonstrate that our algorithm works well and that it scales well to large-scale applications. expand
|
|
SESSION: Cooperation, coordination, and teamwork: poster papers |
|
|
|
Agent coordination by trade-off between locally diffusion effects and socially structural influences |
|
Yichuan Jiang,
Jiuchuan Jiang,
Toru Ishida
|
|
Article No.: 77 |
|
doi>10.1145/1329125.1329219 |
|
Full text: PDF
|
|
There were always two separated methods to make agent coordination: individual-local balance perspective and individual-society balance perspective. The first method only considered the balance between individual agents and their local neighbors; the ...
There were always two separated methods to make agent coordination: individual-local balance perspective and individual-society balance perspective. The first method only considered the balance between individual agents and their local neighbors; the second method only considered the balance between individual agents and the whole multiagent society. However, in reality, the agents will be diffused by their local neighbors as well as influenced by their social contexts simultaneously; therefore, it is necessary to deal with the social performance as well as local performance. To address such problem this paper presents an agent coordination method in an integrative model where we combine the two perspectives together and make trade-off between them. With our presented model, the individual, local and social concerns can be balanced well in a unified and flexible manner. Moreover, the experimental results show that there are often situations in which the two coordination perspectives aren't conflictive but often bring out the better in each other. expand
|
|
Provisioning heterogeneous and unreliable providers for service workflows |
|
Sebastian Stein,
Nicholas R. Jennings,
Terry R. Payne
|
|
Article No.: 78 |
|
doi>10.1145/1329125.1329220 |
|
Full text: PDF
|
|
In this paper, we address the problem of provisioning unreliable and heterogeneous service providers for the constituent tasks of abstract workflows. Specifically, we deal with unreliable providers by provisioning multiple service providers redundantly ...
In this paper, we address the problem of provisioning unreliable and heterogeneous service providers for the constituent tasks of abstract workflows. Specifically, we deal with unreliable providers by provisioning multiple service providers redundantly for specific tasks, and we employ a local search mechanism to choose among many heterogeneous providers that offer the same type of service. We empirically show that our strategy can achieve significant improvements over current approaches, and we demonstrate that it works well over a range of environments. expand
|
|
The effect of task and environment factors on M.A.S. coordination and reorganization |
|
Mattijs Ghijsen,
Wouter Jansweijer,
Bob Wielinga
|
|
Article No.: 79 |
|
doi>10.1145/1329125.1329221 |
|
Full text: PDF
|
|
Research on organization of Multiagent Systems (M.A.S.) has shown that by adapting its organization, a M.A.S. is better able to operate in dynamic environments. In this paper we describe an experiment with a M.A.S. that consists of agents where the capability ...
Research on organization of Multiagent Systems (M.A.S.) has shown that by adapting its organization, a M.A.S. is better able to operate in dynamic environments. In this paper we describe an experiment with a M.A.S. that consists of agents where the capability to reorganize is integrated in their coordination mechanism. In the RoboCupRescue simulator we have implemented a M.A.S. where work can be coordinated according to three different coordination styles; direct supervision and standardization of skills with and without a reorganization extension. An experiment shows the effects of unknown workload distribution and incomplete information on the performance of the three styles. Results show significant interaction effects between both workload distribution and coordination mechanism, and completeness of information and coordination mechanism. Furthermore, results show that standardization of skills with reorganization performs better and is more robust to heterogeneous workload distribution and incompleteness of information. expand
|
|
ARTS: agent-oriented robust transactional system |
|
Mingzhong Wang,
Amy Unruh,
Kotagiri Ramamohanarao
|
|
Article No.: 80 |
|
doi>10.1145/1329125.1329222 |
|
Full text: PDF
|
|
This paper presents the ARTS (Agent-oriented Robust Transactional System) model, which applies transaction concepts to provide agent developers with high-level support for agent system robustness and reliability. ARTS abstractly considers agents as executors ...
This paper presents the ARTS (Agent-oriented Robust Transactional System) model, which applies transaction concepts to provide agent developers with high-level support for agent system robustness and reliability. ARTS abstractly considers agents as executors of encapsulated task entities which comply with a set of execution constraints on both normative execution and compensation (repair) semantics. ARTS then defines the task interface in terms of predictable terminating states to support a contract-like interaction among agents. In conjunction with this encapsulation of task semantics, ARTS defines a model for specifying scoped compensation and exception-handling plans for a given task, and for systematically selecting and executing these plans --- triggered by subtask events --- so that the enclosing task semantics are enforced. These capabilities together define a model that reduces design complexity while increasing system robustness, by allowing an agent developer to compose recursively-defined, atomically-handled tasks. expand
|
|
Towards a logical theory of coordination and joint ability |
|
Hojjat Ghaderi,
Hector Levesque,
Yves Lespérance
|
|
Article No.: 81 |
|
doi>10.1145/1329125.1329223 |
|
Full text: PDF
|
|
The coordination of teams of cooperating but autonomous agents is a core problem in multiagent systems research. A team of agents is jointly able to achieve a goal if despite any incomplete knowledge or even false beliefs that they may have about ...
The coordination of teams of cooperating but autonomous agents is a core problem in multiagent systems research. A team of agents is jointly able to achieve a goal if despite any incomplete knowledge or even false beliefs that they may have about the world or each other, they still know enough to be able to get to a goal state, should they choose to do so. Unlike in the single-agent case, the mere existence of a working plan is not sufficient since there may be several incompatible working plans and the agents may not be able to choose a share that coordinates with the others'. expand
|
|
Implementing the maximum of monotone algorithms |
|
Liad Blumrosen
|
|
Article No.: 82 |
|
doi>10.1145/1329125.1329224 |
|
Full text: PDF
|
|
Running several sub-optimal algorithms and choosing-the optimal one is a common procedure in computer science, most notably in the design of approximation algorithms. This paper deals with one significant flaw of this technique in environments where ...
Running several sub-optimal algorithms and choosing-the optimal one is a common procedure in computer science, most notably in the design of approximation algorithms. This paper deals with one significant flaw of this technique in environments where the inputs are provided by rational agents: such protocols are not necessarily incentive compatible even when the underlying algorithms are. We characterize sufficient and necessary conditions for such best-outcome protocols to be incentive compatible in a general model for agents with one-dimensional private data. We show how our techniques apply in several settings. expand
|
|
SESSION: Formal models of agency: full papers |
|
|
|
An adversarial environment model for bounded rational agents in zero-sum interactions |
|
Inon Zuckerman,
Sarit Kraus,
Jeffrey S. Rosenschein,
Gal Kaminka
|
|
Article No.: 83 |
|
doi>10.1145/1329125.1329226 |
|
Full text: PDF
|
|
Multiagent environments are often not cooperative nor collaborative; in many cases, agents have conflicting interests, leading to adversarial interactions. This paper presents a formal Adversarial Environment model for bounded rational agents ...
Multiagent environments are often not cooperative nor collaborative; in many cases, agents have conflicting interests, leading to adversarial interactions. This paper presents a formal Adversarial Environment model for bounded rational agents operating in a zero-sum environment. In such environments, attempts to use classical utility-based search methods can raise a variety of difficulties (e.g., implicitly modeling the opponent as an omniscient utility maximizer, rather than leveraging a more nuanced, explicit opponent model). We define an Adversarial Environment by describing the mental states of an agent in such an environment. We then present behavioral axioms that are intended to serve as design principles for building such adversarial agents. We explore the application of our approach by analyzing log files of completed Connect-Four games, and present an empirical analysis of the axioms' appropriateness. expand
|
|
Incentive compatible ranking systems |
|
Alon Altman,
Moshe Tennenholtz
|
|
Article No.: 84 |
|
doi>10.1145/1329125.1329227 |
|
Full text: PDF
|
|
Ranking systems are a fundamental ingredient of multiagent environments and Internet Technologies. These settings can be viewed as social choice settings with two distinguished properties: the set of agents and the set of alternatives coincide, and the ...
Ranking systems are a fundamental ingredient of multiagent environments and Internet Technologies. These settings can be viewed as social choice settings with two distinguished properties: the set of agents and the set of alternatives coincide, and the agents' preferences are dichotomous, and therefore classical impossibility results do not apply. In this paper we initiate the study of incentives in ranking systems, where agents act in order to maximize their position in the ranking, rather than to obtain a correct outcome. We consider several basic properties of ranking systems, and fully characterize the conditions under which incentive compatible ranking systems exist, demonstrating that in general no such system satisfying all the properties exists. expand
|
|
Reasoning about judgment and preference aggregation |
|
Thomas Ågotnes,
Wiebe van der Hoek,
Michael Wooldridge
|
|
Article No.: 85 |
|
doi>10.1145/1329125.1329228 |
|
Full text: PDF
|
|
Agents that must reach agreements with other agents need to reason about how their preferences, judgments, and beliefs might be aggregated with those of others by the social choice mechanisms that govern their interactions. The recently emerging field ...
Agents that must reach agreements with other agents need to reason about how their preferences, judgments, and beliefs might be aggregated with those of others by the social choice mechanisms that govern their interactions. The recently emerging field of judgment aggregation studies aggregation from a logical perspective, and considers how multiple sets of logical formulae can be aggregated to a single consistent set. As a special case, judgment aggregation can be seen to subsume classical preference aggregation. We present a modal logic that is intended to support reasoning about judgment aggregation scenarios (and hence, as a special case, about preference aggregation): the logical language is interpreted directly in judgment aggregation rules. We present a sound and complete axiomatisation of such rules. We show that the logic can express aggregation rules such as majority voting; rule properties such as independence; and results such as the discursive paradox, Arrow's theorem and Condorcet's paradox - which are derivable as formal theorems of the logic. The logic is parameterised in such a way that it can be used as a general framework for comparing the logical properties of different types of aggregation - including classical preference aggregation. expand
|
|
SESSION: Formal models of agency: poster papers |
|
|
|
Programming and simulation of quantum search agents |
|
Matthias Klusch,
René Schubotz
|
|
Article No.: 86 |
|
doi>10.1145/1329125.1329230 |
|
Full text: PDF
|
|
Key idea of this work is to appropriately extend one prominent generic agent architecture, namely InteRRap [8], to the case of a quantum pattern matching (QPM) based type-I quantum search agent (QSA) that is supposed to run on a hybrid quantum computer, ...
Key idea of this work is to appropriately extend one prominent generic agent architecture, namely InteRRap [8], to the case of a quantum pattern matching (QPM) based type-I quantum search agent (QSA) that is supposed to run on a hybrid quantum computer, and to show it's feasibility by instantiating the respective QuantumInteRRap architecture. For a comprehensive and in-depth introduction to quantum computation (QC) we refer the interested reader to [10]. An extended version of this work can be found at [6]. expand
|
|
A quantified epistemic logic for reasoning about multiagent systems |
|
F. Belardinelli,
A. Lomuscio
|
|
Article No.: 87 |
|
doi>10.1145/1329125.1329231 |
|
Full text: PDF
|
|
We investigate quantified interpreted systems, a semantics for multiagent systems in which agents can reason about individuals, their properties, and the relationships among them. We analyse a first-order epistemic language interpreted on this semantics ...
We investigate quantified interpreted systems, a semantics for multiagent systems in which agents can reason about individuals, their properties, and the relationships among them. We analyse a first-order epistemic language interpreted on this semantics and show soundness and completeness of Q.S5n, an axiomatisation for these structures. expand
|
|
A framework for reasoning about rational agents |
|
Wojciech Jamroga,
Nils Bulling
|
|
Article No.: 88 |
|
doi>10.1145/1329125.1329232 |
|
Full text: PDF
|
|
We propose an extension of alternating-time temporal logic, that can be used for reasoning about the behavior and abilities of agents under various rationality assumptions.
We propose an extension of alternating-time temporal logic, that can be used for reasoning about the behavior and abilities of agents under various rationality assumptions. expand
|
|
SESSION: Societal aspects: full papers |
|
|
|
A formal road from institutional norms to organizational structures |
|
Davide Grossi,
Frank Dignum,
John-Jules Ch. Meyer
|
|
Article No.: 89 |
|
doi>10.1145/1329125.1329234 |
|
Full text: PDF
|
|
Up to now, the way institutions and organizations have been used in the development of open systems has not often gone further than a useful heuristics. In order to develop systems actually implementing institutions and organizations, formal methods ...
Up to now, the way institutions and organizations have been used in the development of open systems has not often gone further than a useful heuristics. In order to develop systems actually implementing institutions and organizations, formal methods should take the place of heuristic ones. The paper presents a formal semantics for the notion of institution and its components (abstract and concrete norms, empowerment of agents, roles) and defines a formal relation between institutions and organizational structures. As a result, it is shown how institutional norms can be refined to constructs---organizational structures---which are closer to an implemented system. It is also shown how such a refinement process can be fully formalized and it is therefore amenable to rigorous verification. expand
|
|
Distributed norm management in regulated multiagent systems |
|
Dorian Gaertner,
Andres Garcia-Camino,
Pablo Noriega,
J.-A. Rodriguez-Aguilar,
Wamberto Vasconcelos
|
|
Article No.: 90 |
|
doi>10.1145/1329125.1329235 |
|
Full text: PDF
|
|
Norms are widely recognised as a means of coordinating multiagent systems. The distributed management of norms is a challenging issue and we observe a lack of truly distributed computational realisations of normative models. In order to regulate ...
Norms are widely recognised as a means of coordinating multiagent systems. The distributed management of norms is a challenging issue and we observe a lack of truly distributed computational realisations of normative models. In order to regulate the behaviour of autonomous agents that take part in multiple, related activities, we propose a normative model, the Normative Structure (NS), an artifact that is based on the propagation of normative positions (obligations, prohibitions, permissions), as consequences of agents' actions. Within a NS, conflicts may arise due to the dynamic nature of the MAS and the concurrency of agents' actions. However, ensuring conflict-freedom of a NS at design time is computationally intractable. We show this by formalising the notion of conflict, providing a mapping of NSs into Coloured Petri Nets and borrowing well-known theoretical results from that field. Since online conflict resolution is required, we present a tractable algorithm to be employed distributedly. We then demonstrate that this algorithm is paramount for the distributed enactment of a NS. expand
|
|
Resolving conflict and inconsistency in norm-regulated virtual organizations |
|
Wamberto Vasconcelos,
Martin J. Kollingbaum,
Timothy J. Norman
|
|
Article No.: 91 |
|
doi>10.1145/1329125.1329236 |
|
Full text: PDF
|
|
Norm-governed virtual organizations define, govern and facilitate coordinated resource sharing and problem solving in societies of agents. With an explicit account of norms, openness in virtual organizations can be achieved: new components, designed ...
Norm-governed virtual organizations define, govern and facilitate coordinated resource sharing and problem solving in societies of agents. With an explicit account of norms, openness in virtual organizations can be achieved: new components, designed by various parties, can be seamlessly accommodated. We focus on virtual organizations realised as multiagent systems, in which human and software agents interact to achieve individual and global goals. However, any realistic account of norms should address their dynamic nature: norms will change as agents interact with each other and their environment. Due to the changing nature of norms or due to norms stemming from different virtual organizations, there will be situations when an action is simultaneously permitted and prohibited, that is, a conflict arises. Likewise, there will be situations when an action is both obliged and prohibited, that is, an inconsistency arises. We introduce an approach, based on first-order unification, to detect and resolve such conflicts and inconsistencies. In our proposed solution, we annotate a norm with the set of values their variables should not have in order to avoid a conflict or an inconsistency with another norm. Our approach neatly accommodates the domain-dependent interrelations among actions and the indirect conflicts/inconsistencies these may cause. More generally, we can capture a useful notion of inter-agent (and inter-role) delegation of actions and norms associated to them, and use it to address conflicts/inconsistencies caused by action delegation. We illustrate our approach with an e-Science example in which agents support Grid services. expand
|
|
SESSION: Societal aspects: poster paper |
|
|
|
Friends no more: norm enforcement in multiagent systems |
|
Adrian Perreau de Pinninck,
Carles Sierra,
Marco Schorlemmer
|
|
Article No.: 92 |
|
doi>10.1145/1329125.1329238 |
|
Full text: PDF
|
|
We propose a new distributed mechanism to enforce norms by ostracizing agents that do not abide by them. Our simulations have shown that, although complete ostracism is not always possible, the mechanism substantially reduces the number of norm violations.
We propose a new distributed mechanism to enforce norms by ostracizing agents that do not abide by them. Our simulations have shown that, although complete ostracism is not always possible, the mechanism substantially reduces the number of norm violations. expand
|
|
SESSION: Learning: full papers |
|
|
|
Automatic feature extraction for autonomous general game playing agents |
|
David M. Kaiser
|
|
Article No.: 93 |
|
doi>10.1145/1329125.1329240 |
|
Full text: PDF
|
|
The General Game Playing (GGP) problem is concerned with developing systems capable of playing many different games, even games the system has never encountered before. Successful GGP agents must be able to extract relevant features from the formal game ...
The General Game Playing (GGP) problem is concerned with developing systems capable of playing many different games, even games the system has never encountered before. Successful GGP agents must be able to extract relevant features from the formal game description and construct effective search heuristics. In this article, we present a procedure by which autonomous General Game Playing agents can generate effective and efficient search heuristics from the formal game description. The major aspect of our approach is an innovative technique to automatically extract critical features from the game structure. Our method has been incorporated into a fully implemented system that came in fourth place at the second General Game Playing Competition held at AAAI-06. expand
|
|
Batch reinforcement learning in a complex domain |
|
Shivaram Kalyanakrishnan,
Peter Stone
|
|
Article No.: 94 |
|
doi>10.1145/1329125.1329241 |
|
Full text: PDF
|
|
Temporal difference reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent's experience based on sequential actions in the environment. However, their most common algorithmic variants are ...
Temporal difference reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent's experience based on sequential actions in the environment. However, their most common algorithmic variants are relatively inefficient in their use of experience data, which in many agent-based settings can be scarce. In particular, they make just one learning "update" for each atomic experience. Batch reinforcement learning algorithms, on the other hand, aim to achieve greater data efficiency by saving experience data and using it in aggregate to make updates to the learned policy. Their success has been demonstrated in the past on simple domains like grid worlds and low-dimensional control applications like pole balancing. In this paper, we compare and contrast batch reinforcement learning algorithms with on-line algorithms based on their empirical performance in a complex, continuous, noisy, multiagent domain, namely RoboCup soccer Keepaway. We find that the two batch methods we consider, Experience Replay and Fitted Q Iteration, both yield significant gains in sample complexity, while achieving high asymptotic performance. expand
|
|
Model-based function approximation in reinforcement learning |
|
Nicholas K. Jong,
Peter Stone
|
|
Article No.: 95 |
|
doi>10.1145/1329125.1329242 |
|
Full text: PDF
|
|
Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains difficult, a few impressive success stories notwithstanding. Most interesting ...
Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains difficult, a few impressive success stories notwithstanding. Most interesting agent-environment systems have large state spaces, so performance depends crucially on efficient generalization from a small amount of experience. Current algorithms rely on model-free function approximation, which estimates the long-term values of states and actions directly from data and assumes that actions have similar values in similar states. This paper proposes model-based function approximation, which combines two forms of generalization by assuming that in addition to having similar values in similar states, actions also have similar effects. For one family of generalization schemes known as averagers, computation of an approximate value function from an approximate model is shown to be equivalent to the computation of the exact value function for a finite model derived from data. This derivation both integrates two independent sources of generalization and permits the extension of model-based techniques developed for finite problems. Preliminary experiments with a novel algorithm, AMBI (Approximate Models Based on Instances), demonstrate that this approach yields faster learning on some standard benchmark problems than many contemporary algorithms. expand
|
|
A framework for agent-based distributed machine learning and data mining |
|
Jan Tozicka,
Michael Rovatsos,
Michal Pechoucek
|
|
Article No.: 96 |
|
doi>10.1145/1329125.1329243 |
|
Full text: PDF
|
|
This paper proposes a framework for agent-based distributed machine learning and data mining based on (i) the exchange of meta-level descriptions of individual learning processes among agents and (ii) online reasoning about learning success and learning ...
This paper proposes a framework for agent-based distributed machine learning and data mining based on (i) the exchange of meta-level descriptions of individual learning processes among agents and (ii) online reasoning about learning success and learning progress by learning agents. We present an abstract architecture that enables agents to exchange models of their local learning processes and introduces a number of different methods for integrating these processes. This allows us to apply existing agent interaction mechanisms to distributed machine learning tasks, thus leveraging the powerful coordination methods available in agent-based computing, and enables agents to engage in meta-reasoning about their own learning decisions. We apply this architecture to a real-world distributed clustering application to illustrate how the conceptual framework can be used in practical systems in which different learners may be using different datasets, hypotheses and learning algorithms. We report on experimental results obtained using this system, review related work on the subject, and discuss potential future extensions to the framework. expand
|
|
SESSION: Learning: poster papers |
|
|
|
Aggregation in multiagent systems and the problem of truth-tracking |
|
Gabriella Pigozzi,
Stephan Hartmann
|
|
Article No.: 97 |
|
doi>10.1145/1329125.1329245 |
|
Full text: PDF
|
|
One of the major problems that artificial intelligence needs to tackle is the combination of different and potentially conflicting sources of information. Examples are multi-sensor fusion, database integration and expert systems development. In this ...
One of the major problems that artificial intelligence needs to tackle is the combination of different and potentially conflicting sources of information. Examples are multi-sensor fusion, database integration and expert systems development. In this paper we are interested in the aggregation of propositional logic-based information, a problem recently addressed in the literature on information fusion. It has applications in multiagent systems that aim at aggregating the distributed agent-based knowledge into an (ideally) unique set of propositions. We consider a group of autonomous agents who individually hold a logically consistent set of propositions. Each set of propositions represents an agent's beliefs on issues on which the group has to make a collective decision. To make the collective decision, several aggregation procedures have been proposed in the literature. Assuming that all propositions in question are factually right or wrong, we ask how good belief fusion is as a truth tracker. Will it single out the true set of propositions? And how does information fusion compare with other aggregation procedures? We address these questions in a probabilistic framework and show that information fusion does especially well for agents with a middling competence of hitting the truth of an individual proposition. expand
|
|
Adapting in agent-based markets: a study from TAC SCM |
|
David Pardoe,
Peter Stone
|
|
Article No.: 98 |
|
doi>10.1145/1329125.1329246 |
|
Full text: PDF
|
|
An agent attempting to model market conditions may benefit from considering how various combinations of competitor strategies would impact these conditions. We give an illustration using a prediction task faced by our agent for the Supply Chain Management ...
An agent attempting to model market conditions may benefit from considering how various combinations of competitor strategies would impact these conditions. We give an illustration using a prediction task faced by our agent for the Supply Chain Management scenario of the Trading Agent Competition (TAC SCM). We present the learning approach taken, evaluate its effectiveness, and then explore methods of improving predictions through combining multiple sources of data reflecting various combinations of competitor behaviors. expand
|
|
Reinforcement learning with utility-aware agents for market-based resource allocation |
|
Eduardo Rodrigues Gomes,
Ryszard Kowalczyk
|
|
Article No.: 99 |
|
doi>10.1145/1329125.1329247 |
|
Full text: PDF
|
|
In this paper we propose and investigate the use of Reinforcement Learning in a market-based resource allocation mechanism called Iterative Price Adjustment. Under standard assumptions, this mechanism uses demand functions that do not allow the agents ...
In this paper we propose and investigate the use of Reinforcement Learning in a market-based resource allocation mechanism called Iterative Price Adjustment. Under standard assumptions, this mechanism uses demand functions that do not allow the agents to have preferences over the attributes of the allocation, e.g. the price of the resources. To address this limitation, we study the case where the agent's preferences in the resource allocation are described by utility functions and they learn the demand functions given their utility functions. The approach has been evaluated with extensive experiments. expand
|
|
Towards reinforcement learning representation transfer |
|
Matthew E. Taylor,
Peter Stone
|
|
Article No.: 100 |
|
doi>10.1145/1329125.1329248 |
|
Full text: PDF
|
|
Transfer learning problems are typically framed as leveraging knowledge learned on a source task to improve learning on a related, but different, target task. Current transfer methods are able to successfully transfer knowledge between agents in different ...
Transfer learning problems are typically framed as leveraging knowledge learned on a source task to improve learning on a related, but different, target task. Current transfer methods are able to successfully transfer knowledge between agents in different reinforcement learning tasks, reducing the time needed to learn the target. However, the complimentary task of representation transfer, i.e. transferring knowledge between agents with different internal representations, has not been well explored. The goal in both types of transfer problems is the same: reduce the time needed to learn the target with transfer, relative to learning the target without transfer. This work introduces one such representation transfer algorithm which is implemented in a complex multiagent domain. Experiments demonstrate that transferring the learned knowledge between different representations is both possible and beneficial. expand
|
|
SESSION: Auctions and electronic markets: full papers |
|
|
|
A winner determination algorithm for auction-based decentralized scheduling |
|
Chun Wang,
Hamada H. Ghenniwa,
Weiming Shen
|
|
Article No.: 101 |
|
doi>10.1145/1329125.1329250 |
|
Full text: PDF
|
|
This paper presents a formulation and an algorithm for the winner determination problem in auction-based decentralized scheduling. Without imposing a time line discretization, the proposed approach allows bidders to bid for the processing of a set of ...
This paper presents a formulation and an algorithm for the winner determination problem in auction-based decentralized scheduling. Without imposing a time line discretization, the proposed approach allows bidders to bid for the processing of a set of tasks under release time and due date constraints using an expressive bidding language designed for decentralized scheduling. The proposed winner determination algorithm uses a depth first branch and bound search. The search branches on bids and a constraint directed scheduling procedure is used at each node to verify the feasibility of the allocation. Experiments against a commercial optimization package, CPLEX 10.0, show that the proposed algorithm is more than an order of magnitude faster on average over a set of winner determination problems of decentralized scheduling generated based on a suite of job shop constraint satisfaction benchmark problems previously developed in the literature. expand
|
|
Bidding algorithms for a distributed combinatorial auction |
|
Benito Mendoza,
José M. Vidal
|
|
Article No.: 102 |
|
doi>10.1145/1329125.1329251 |
|
Full text: PDF
|
|
Distributed allocation and multiagent coordination problems can be solved through combinatorial auctions. However, most of the existing winner determination algorithms for combinatorial auctions are centralized. The PAUSE auction is one of a few efforts ...
Distributed allocation and multiagent coordination problems can be solved through combinatorial auctions. However, most of the existing winner determination algorithms for combinatorial auctions are centralized. The PAUSE auction is one of a few efforts to release the auctioneer from having to do all the work (it might even be possible to get rid of the auctioneer). It is an increasing price combinatorial auction that naturally distributes the problem of winner determination amongst the bidders in such a way that they have an incentive to perform the calculation. It can be used when we wish to distribute the computational load among the bidders or when the bidders do not wish to reveal their true valuations unless necessary. PAUSE establishes the rules the bidders must obey. However, it does not tell us how the bidders should calculate their bids. We have developed a couple of bidding algorithms for the bidders in a PAUSE auction. Our algorithms always return the set of bids that maximizes the bidder's utility. Since the problem is NP-Hard, run time remains exponential on the number of items, but it is remarkably better than an exhaustive search. In this paper we present our bidding algorithms, discuss their virtues and drawbacks, and compare the solutions obtained by them to the revenue-maximizing solution found by a centralized winner determination algorithm. expand
|
|
Outperforming the competition in multi-unit sealed bid auctions |
|
Ioannis A. Vetsikas,
Nicholas R. Jennings
|
|
Article No.: 103 |
|
doi>10.1145/1329125.1329252 |
|
Full text: PDF
|
|
In this paper, we examine the behavior of bidding agents that are in direct competition with the other participants in an auction setting. Thus the agents are not simply trying to maximize their own utility, rather they wish to maximize a weighted difference ...
In this paper, we examine the behavior of bidding agents that are in direct competition with the other participants in an auction setting. Thus the agents are not simply trying to maximize their own utility, rather they wish to maximize a weighted difference of their own gain to that of their competitors. By so doing, this work significantly extends the existing state-of-the-art results on single unit auctions, by generalizing to the multi-unit case. Specifically, our main result is the derivation of symmetric Bayes-Nash equilibria for these agents in both mth and (m + 1)th price sealed bid auctions. Subsequently, we use these equilibria to examine the profits of different agents and show that aiming to beat the competition is more effective than pure self interest in any competitive setting. Finally, we examine how the auctioneer's revenue is affected and find that the weight that agents place in minimizing the opponents' profit determines whether the mth or the (m + 1)th price auction yields a higher revenue. expand
|
|
Winner determination for mixed multi-unit combinatorial auctions via petri nets |
|
Andrea Giovannucci,
J. A. Rodriguez-Aguilar,
Jesus Cerquides,
Ulle Endriss
|
|
Article No.: 104 |
|
doi>10.1145/1329125.1329253 |
|
Full text: PDF
|
|
Mixed Multi-Unit Combinatorial Auctions (MMUCAs) allow agents to bid for bundles of goods to buy, goods to sell, and transformations of goods. In particular, MMUCAs offer a high potential to be employed for the automated assembly of supply chains of ...
Mixed Multi-Unit Combinatorial Auctions (MMUCAs) allow agents to bid for bundles of goods to buy, goods to sell, and transformations of goods. In particular, MMUCAs offer a high potential to be employed for the automated assembly of supply chains of agents offering goods and services, and in general MMUCAs extend and generalise several types of combinatorial auctions. Here we provide a formalism, based on an extension of Petri Nets, with which MMUCAs, and therefore all auction types subsumed by MMUCAs --- and in particular combinatorial auctions for supply chain formation (SCF)---, can be formally analysed. As a second direct benefit, consequence of the provided mapping to Petri Nets, we manage to dramatically reduce the number of decision variables involved in the optimisation problem posed by MMUCAs from quadratic to linear for a wide class of MMUCA Winner Determination Problems (WDPs). Hence, we also make headway in the practical application of MMUCAs, and in particular to SCF. expand
|
|
SESSION: Auctions and electronic markets: poster papers |
|
|
|
Average-case tractability of manipulation in voting via the fraction of manipulators |
|
Ariel D. Procaccia,
Jeffrey S. Rosenschein
|
|
Article No.: 105 |
|
doi>10.1145/1329125.1329255 |
|
|
|
Market-driven agents with uncertain and dynamic outside options |
|
Fenghui Ren,
Kwang Mong Sim,
Minjie Zhang
|
|
Article No.: 106 |
|
doi>10.1145/1329125.1329256 |
|
Full text: PDF
|
|
One of the most crucial criterion in automated negotiation is how to reach a consensus agreement for all negotiators under any negotiation environment. Currently, most negotiation strategies can work under the static environment only. This paper presents ...
One of the most crucial criterion in automated negotiation is how to reach a consensus agreement for all negotiators under any negotiation environment. Currently, most negotiation strategies can work under the static environment only. This paper presents a model for designing negotiation agents that makes adjustable rates of concession by reacting to changing market situations with uncertain and dynamic outside options. This work is based on the model of market-driven agents (MDAs). To determine the amount of the concession for each trading cycle, these market-driven agents are guided by four mathematical functions of trading opportunity, trading competition, trading time and strategy and trading eagerness. The contribution of this paper is designing and developing an extended MDA model with the flexibility to respond to uncertain and dynamic outside options, so as to increase problem solving ability for agent negotiation in broad application domains. expand
|
|
Infinitesimal nash transfers for resource allocation in strong social alliances |
|
Paul-Amaury Matt,
Francesca Toni
|
|
Article No.: 107 |
|
doi>10.1145/1329125.1329257 |
|
Full text: PDF
|
|
We introduce a distributed and tractable mechanism for the allocation of continuously divisible resources to agents, that locally maximises the (Nash) product of their individual welfare. The mechanism involves specific m-resources-at-a-time multilateral ...
We introduce a distributed and tractable mechanism for the allocation of continuously divisible resources to agents, that locally maximises the (Nash) product of their individual welfare. The mechanism involves specific m-resources-at-a-time multilateral deals over bits of resources, termed infinitesimal Nash transfers. It provides an effective way of building "strong social alliances", where in a social alliance agents fully cooperate for the global interest of society, and a strong social alliance has near-optimal utilitarian and egalitarian social welfare, as understood in social choice and welfare economics. The mechanism is scalable, can be distributed amongst agents and can be used to support, e.g., fair trade. expand
|
|
An incentive mechanism for eliciting fair ratings of sellers in e-marketplaces |
|
Jie Zhang,
Robin Cohen
|
|
Article No.: 108 |
|
doi>10.1145/1329125.1329258 |
|
Full text: PDF
|
|
In this paper, we propose a novel incentive mechanism for eliciting fair ratings of selling agents from buying agents. In our mechanism, buyers model other buyers and select the most trustworthy ones as their neighbors from whom they can ask advice about ...
In this paper, we propose a novel incentive mechanism for eliciting fair ratings of selling agents from buying agents. In our mechanism, buyers model other buyers and select the most trustworthy ones as their neighbors from whom they can ask advice about sellers. In addition, however, sellers model the reputation of buyers. Reputable buyers always provide fair ratings of sellers, and are likely to be neighbors of many other buyers. In marketplaces operating with our mechanism, sellers will increase quality and decrease prices of products to satisfy reputable buyers. In consequence, our mechanism creates incentives for buyers to provide fair ratings of sellers. expand
|
|
A service oriented marketplace for next generation networks |
|
Donna Griffin,
Dirk Pesch
|
|
Article No.: 109 |
|
doi>10.1145/1329125.1329259 |
|
Full text: PDF
|
|
During the past few decades, the field of telecommunications has been the subject to a continuous evolution. Kridel [1] associates this evolution to three interrelated phenomena: shifts in regulations, increased competition and technological progress. ...
During the past few decades, the field of telecommunications has been the subject to a continuous evolution. Kridel [1] associates this evolution to three interrelated phenomena: shifts in regulations, increased competition and technological progress. In addition to Kridels' phenomena, we see higher end user expectations with regards to service quality and price as another dimension to this evolution. Here, we propose a market place approach, called Agent Grid Service Marketplace (AGSM), that addresses those evolutionary factors [2]. expand
|
|
Online auctions for bidders with interdependent values |
|
Florin Constantin,
Takayuki Ito,
David C. Parkes
|
|
Article No.: 110 |
|
doi>10.1145/1329125.1329260 |
|
Full text: PDF
|
|
Interdependent values (IDV) is a valuation model allowing bidders in an auction to express their value for the item(s) to sell as a function of the other bidders' information. We investigate the incentive compatibility (IC) of single-item auctions for ...
Interdependent values (IDV) is a valuation model allowing bidders in an auction to express their value for the item(s) to sell as a function of the other bidders' information. We investigate the incentive compatibility (IC) of single-item auctions for IDV bidders in dynamic environments. We provide a necessary and sufficient characterization for IC in this setting. We show that if bidders can misreport departure times and private signals, no reasonable auction can be IC. We present a reasonable IC auction for the case where bidders cannot misreport departures. expand
|
|
SESSION: Distributed constraint processing: full papers |
|
|
|
A complete distributed constraint optimization method for non-traditional pseudotree arrangements |
|
James Atlas,
Keith Decker
|
|
Article No.: 111 |
|
doi>10.1145/1329125.1329262 |
|
Full text: PDF
|
|
Distributed Constraint Optimization (DCOP) is a general framework that can model complex problems in multiagent systems. Several current algorithms that solve general DCOP instances, including ADOPT and DPOP, arrange agents into a traditional pseudotree ...
Distributed Constraint Optimization (DCOP) is a general framework that can model complex problems in multiagent systems. Several current algorithms that solve general DCOP instances, including ADOPT and DPOP, arrange agents into a traditional pseudotree structure. We introduce an extension to the DPOP algorithm that handles an extended set of pseudotree arrangements. Our algorithm correctly solves DCOP instances for pseudotrees that include edges between nodes in separate branches. The algorithm also solves instances with traditional pseudotree arrangements using the same procedure as DPOP. We compare our algorithm with DPOP using several metrics including the induced width of the pseudotrees, the maximum dimensionality of messages and computation, and the maximum sequential path cost through the algorithm. We prove that for some problem instances it is not possible to generate a traditional pseudotree using edge-traversal heuristics that will outperform a cross-edged pseudotree. We use multiple heuristics to generate pseudotrees and choose the best pseudotree in linear space-time complexity. For some problem instances we observe significant improvements in message and computation sizes compared to DPOP. expand
|
|
Optimal on-line scheduling in stochastic multiagent systems in continuous space-time |
|
Wim Wiegerinck,
Bart van den Broek,
Bert Kappen
|
|
Article No.: 112 |
|
doi>10.1145/1329125.1329263 |
|
Full text: PDF
|
|
We consider multiagent systems with stochastic non-linear dynamics in continuous space-time. We focus on systems of agents that aim to visit a number of given target locations at given points in time at minimal control cost. The online optimization of ...
We consider multiagent systems with stochastic non-linear dynamics in continuous space-time. We focus on systems of agents that aim to visit a number of given target locations at given points in time at minimal control cost. The online optimization of which agent has to visit which target requires the solution of the Hamilton-Jacobi-Bellman (HJB) equation, which is a non-linear partial differential equation (PDE). Under some conditions, the log-transform can be applied to turn the HJB equation into a linear PDE. We then show that the optimal solution in the multiagent scheduling problem can be expressed in closed form as a sum of single schedule solutions. expand
|
|
Robust coordination to sustain throughput of an unstable agent network |
|
Rajesh Gautam,
Kazuo Miyashita
|
|
Article No.: 113 |
|
doi>10.1145/1329125.1329264 |
|
Full text: PDF
|
|
We present a multiagent coordination technique to maintain throughput of a large-scale agent network system in the face of failures of agents. Failures do not just deteriorate throughput of the system but also create and change bottlenecks in the system. ...
We present a multiagent coordination technique to maintain throughput of a large-scale agent network system in the face of failures of agents. Failures do not just deteriorate throughput of the system but also create and change bottlenecks in the system. Since loss of bottleneck's capacity degrades the overall system performance, the system should identify bottlenecks dynamically and keep their utilization at a high level. In our system, CABS, information about an agent's urgency of jobs to fulfill demanded throughput and maintain its utilization is passed to upstream agents in the network. Upstream agents utilize this information to identify bottleneck agents and coordinate their actions to provide the bottlenecks with necessary and sufficient jobs for preventing their starvation and congestion. We empirically evaluate CABS using a benchmark problem of the semiconductor fabrication process, which is a good example of a large-scale network system, in comparison with a well-known traditional manufacturing control method. expand
|
|
Sequential resource allocation in multiagent systems with uncertainties |
|
Jianhui Wu,
Edmund H. Durfee
|
|
Article No.: 114 |
|
doi>10.1145/1329125.1329265 |
|
Full text: PDF
|
|
Exchanging scarce resources during execution among a group of agents is one way to improve the overall performance in multiagent systems with limited shared resources, but implementing optimal sequential resource allocation is often a nontrivial problem ...
Exchanging scarce resources during execution among a group of agents is one way to improve the overall performance in multiagent systems with limited shared resources, but implementing optimal sequential resource allocation is often a nontrivial problem in complex systems with uncertainties. In this paper, we present an MILP-based algorithm that can automatically break a large mission into multiple phases and make optimal resource (re)allocations at the entry of each phase. We illustrate our algorithms through several increasingly complex classes of sequential resource allocation problems, and show through experiments that our techniques can increase agents' rewards for varying levels of constraints on resources and constraints on exchanging resources. expand
|
|
SESSION: Distributed constraint processing: poster papers |
|
|
|
Periodic real-time resource allocation for teams of progressive processing agents |
|
Gilles Steeve Dibangoye,
Abdel-Illah Mouaddib,
Brahim Chaib-Draa
|
|
Article No.: 115 |
|
doi>10.1145/1329125.1329267 |
|
Full text: PDF
|
|
In this paper, we focus on the problem of finding a periodic allocation strategy for teams of resource-bounded agents. We propose a real-time dynamic algorithm RTDA*, that exploits two key properties to avoid the exponential increase in the state and ...
In this paper, we focus on the problem of finding a periodic allocation strategy for teams of resource-bounded agents. We propose a real-time dynamic algorithm RTDA*, that exploits two key properties to avoid the exponential increase in the state and action spaces associated with multiagent systems. First, resource allocation at each time period follows an earliest deadline first order (EDF) over agents. Second, the resources are undivided, i.e., the resources allocated to an agent restrict their availability to others over time. We can therefore view each incoming agent as a cyclic individual resource-bounded processing, namely "cyclic progressive reasoning unit" (C-PRU), and solve, off-line, the single agent resource allocation problem. In the on-line phase, our algorithm exploits pre-compiled policies, as heuristic metrics, to build near-optimal joint decisions at each time period. expand
|
|
Reciprocal negotiation over shared resources in agent societies |
|
Sabyasachi Saha,
Sandip Sen
|
|
Article No.: 116 |
|
doi>10.1145/1329125.1329268 |
|
Full text: PDF
|
|
We are interested in domains where an agent repeatedly negotiates with other agents over shared resources where the demand or utility to the agent for the shared resources vary over time. We propose a protocol that will maximize social welfare if agents ...
We are interested in domains where an agent repeatedly negotiates with other agents over shared resources where the demand or utility to the agent for the shared resources vary over time. We propose a protocol that will maximize social welfare if agents reveal their true preferences in every negotiation. The protocol, however, is not truth-revealing and selfish agents have the incentive to artificially inflate preferences. We use a probabilistic reciprocative behavior that discourages the reporting of false preferences. This reciprocative behavior promotes cooperation in repeated negotiations and improves both individual and group long-term payoff. We characterize environmental conditions under which agents can develop and sustain mutually beneficial relationships with similar agents and avoid exploitation by different types of selfish agents. expand
|
|
Unifying distributed constraint algorithms in a BDI negotiation framework |
|
Bao Chau Le Dinh,
Kiam Tian Seow
|
|
Article No.: 117 |
|
doi>10.1145/1329125.1329269 |
|
Full text: PDF
|
|
This paper presents a novel, unified distributed constraint satisfaction framework based on automated negotiation. The Distributed Constraint Satisfaction Problem (DCSP) is one that entails several agents to search for an agreement, which is a consistent ...
This paper presents a novel, unified distributed constraint satisfaction framework based on automated negotiation. The Distributed Constraint Satisfaction Problem (DCSP) is one that entails several agents to search for an agreement, which is a consistent combination of actions that satisfies their mutual constraints in a shared environment. By anchoring the DCSP search on automated negotiation, we show that several well-known DCSP algorithms are actually mechanisms that can reach agreements through a common Belief-Desire-Intention (BDI) protocol, but using different strategies. A major motivation for this BDI framework is that it not only provides a conceptually clearer understanding of existing DCSP algorithms from an agent model perspective, but also opens up the opportunities to extend and develop new strategies for DCSP. To this end, a new strategy called Unsolicited Mutual Advice (UMA) is proposed. Performance evaluation shows that the UMA strategy can outperform some existing mechanisms in terms of computational cycles. expand
|
|
Matrix-based representation for coordination fault detection: a formal approach |
|
Meir Kalech,
Michael Lindner,
Gal A. Kaminka
|
|
Article No.: 118 |
|
doi>10.1145/1329125.1329270 |
|
Full text: PDF
|
|
Teamwork requires that team members coordinate their actions. The representation of the coordination is a key requirement since it influences the complexity and flexibility of reasoning team-members. One aspect of this requirement is detecting coordination ...
Teamwork requires that team members coordinate their actions. The representation of the coordination is a key requirement since it influences the complexity and flexibility of reasoning team-members. One aspect of this requirement is detecting coordination faults as a result of intermittent failures of sensors, communication failures, etc. Detection of such failures, based on observations of the behavior of agents, is of prime importance. Though different solutions have been presented thus far, none has presented a comprehensive and formal resolution to this problem. This paper presents a formal approach to representing multiagent coordination, and multiagent observations, using matrix structures. This representation facilitates easy representation of coordination requirements, modularity, flexibility and reuse of existing systems. Based on this representation we present a novel solution for fault detection that is both generic and efficient for large-scale teams. expand
|
|
Distributed constraint propagation for diagnosis of faults in physical processes |
|
Ana L. C. Bazzan,
Bruno C. da Silva
|
|
Article No.: 119 |
|
doi>10.1145/1329125.1329271 |
|
Full text: PDF
|
|
Most of the current research on distributed diagnosis in and for multiagent systems focuses on diagnosis of coordination failures. Proposed approaches for this problem are not efficient for diagnosing failures in physical devices. This paper proposes ...
Most of the current research on distributed diagnosis in and for multiagent systems focuses on diagnosis of coordination failures. Proposed approaches for this problem are not efficient for diagnosing failures in physical devices. This paper proposes algorithms for distributed troubleshooting of physical devices and processes. The consequences of using distributed representation of the knowledge, ATMS, and distributed reasoning are discussed and algorithms are proposed to deal with the occurrence of conflicts and the computation of the set of candidate diagnosis. expand
|
|
SESSION: Multiagent planning: full papers |
|
|
|
Distributed path planning for mobile robots using a swarm of interacting reinforcement learners |
|
Christopher M. Vigorito
|
|
Article No.: 120 |
|
doi>10.1145/1329125.1329273 |
|
Full text: PDF
|
|
Path planning for mobile robots in stochastic, dynamic environments is a difficult problem and the subject of much research in the field of robotics. While many approaches to solving this problem put the computational burden of path planning on the robot, ...
Path planning for mobile robots in stochastic, dynamic environments is a difficult problem and the subject of much research in the field of robotics. While many approaches to solving this problem put the computational burden of path planning on the robot, physical path planning methods place this burden on a set of sensor nodes distributed throughout the environment that can communicate information to each other about path costs. Previous approaches to physical path planning have looked at the performance of such networks in regular environments (e.g., office buildings) using highly structured, uniform deployments of networks (e.g., grids). Additionally, these networks do not make use of real experience obtained from the robots they assist in guiding. We extend previous work in this area by incorporating reinforcement learning techniques into these methods and show improved performance in simulated, rough terrain environments. We also show that these networks, which we term SWIRLs (Swarms of Interacting Reinforcement Learners), can perform well with deployment distributions that are not as highly structured as in previous approaches. expand
|
|
Dynamics based control with an application to area-sweeping problems |
|
Zinovi Rabinovich,
Jeffrey S. Rosenschein,
Gal A. Kaminka
|
|
Article No.: 121 |
|
doi>10.1145/1329125.1329274 |
|
Full text: PDF
|
|
In this paper we introduce Dynamics Based Control (DBC), an approach to planning and control of an agent in stochastic environments. Unlike existing approaches, which seek to optimize expected rewards (e.g., in Partially Observable Markov Decision Problems ...
In this paper we introduce Dynamics Based Control (DBC), an approach to planning and control of an agent in stochastic environments. Unlike existing approaches, which seek to optimize expected rewards (e.g., in Partially Observable Markov Decision Problems (POMDPs)), DBC optimizes system behavior towards specified system dynamics. We show that a recently developed planning and control approach, Extended Markov Tracking (EMT) is an instantiation of DBC. EMT employs greedy action selection to provide an efficient control algorithm in Markovian environments. We exploit this efficiency in a set of experiments that applied multi-target EMT to a class of area-sweeping problems (searching for moving targets). We show that such problems can be naturally defined and efficiently solved using the DBC framework, and its EMT instantiation. expand
|
|
Graph-based multiagent replanning algorithm |
|
Jian Feng Zhang,
Xuan Thang Nguyen,
Ryszard Kowalczyk
|
|
Article No.: 122 |
|
doi>10.1145/1329125.1329275 |
|
Full text: PDF
|
|
The paper presents a new approach for multiagent replanning based on Distributed Constraint Satisfaction (DisCSP) and Graph planning techniques. In this approach, a new distributed refinement strategy is proposed to construct a graph plan for fixing ...
The paper presents a new approach for multiagent replanning based on Distributed Constraint Satisfaction (DisCSP) and Graph planning techniques. In this approach, a new distributed refinement strategy is proposed to construct a graph plan for fixing errors occurred during the plan execution. The strategy employs an "max-branching" heuristic that can reduce the final graph plan size and allow faster completion time for the graph construction. The graph plan is then compiled into a DisCSP problem and solved using a multi-variable version of the Asynchronous Backtracking Algorithm. The approach is demonstrated with experiments which show that distributed planning graph and CSP can practically solve the replanning problems in a multiagent environment. expand
|
|
Towards a formal framework for multi-objective multiagent planning |
|
Abdel-Illah Mouaddib,
Mathieu Boussard,
Maroua Bouzid
|
|
Article No.: 123 |
|
doi>10.1145/1329125.1329276 |
|
Full text: PDF
|
|
Multi-Objective Multiagent Planning (MOMAP) addresses the problem of resolving conflicts between individual agent interests and the group interests. In this paper, we address this problem by presenting a formal framework to represent objective relationships, ...
Multi-Objective Multiagent Planning (MOMAP) addresses the problem of resolving conflicts between individual agent interests and the group interests. In this paper, we address this problem by presenting a formal framework to represent objective relationships, a decision model using a Vector-Valued Decentralized Markov Decision Process (2V-DEC-MDP) and an algorithm to solve the resulting 2V-DEC-MDP. The formal framework of a Vector-Valued MDP considered uses the value function which returns a vector representing the individual and the group interests. An optimal policy in such contexts is not clear but in this approach we develop a regret-based technique to find a good tradeoff between the group and individual interests. To do that, the approach we present uses Egalitarian Social Welfare orderings that allow an agent to consider during its local optimization the satisfaction of all criteria and reducing their differences. The obtained result is a good balance between individual and group satisfactions where the local policies can lead to more global satisfying behaviors in some settings. This result is illustrated in many examples and compared to alternate local policies. expand
|
|
SESSION: Multiagent planning: poster papers |
|
|
|
Knowledge and observations in the situation calculus |
|
Ryan F. Kelly,
Adrian R. Pearce
|
|
Article No.: 124 |
|
doi>10.1145/1329125.1329278 |
|
Full text: PDF
|
|
We present a powerful new account of multiagent knowledge in the situation calculus and an effective reasoning procedure for handling knowledge queries. Our approach generalizes existing work by reifying the observations made by each agent as the world ...
We present a powerful new account of multiagent knowledge in the situation calculus and an effective reasoning procedure for handling knowledge queries. Our approach generalizes existing work by reifying the observations made by each agent as the world evolves, allowing for agents that are partially or completely unaware of some of the actions that have occurred. This also enables agents to reason effectively about knowledge using only their internal history of observations, rather than requiring a full history of the world. The result is a more robust and flexible account of knowledge suitable for use in partially-observable multiagent domains. expand
|
|
Conflict estimation of abstract plans for multiagent systems |
|
Toshiharu Sugawara,
Satoshi Kurihara,
Toshio Hirotsu,
Kensuke Fukuda,
Toshihiro Takada
|
|
Article No.: 125 |
|
doi>10.1145/1329125.1329279 |
|
Full text: PDF
|
|
In hierarchical planning, selecting a plan at an abstract level affects planning performance because an abstract plan restricts the scope of primitive plans. However, if all primitive plans under the selected abstract plan have difficult-to-resolve conflicts ...
In hierarchical planning, selecting a plan at an abstract level affects planning performance because an abstract plan restricts the scope of primitive plans. However, if all primitive plans under the selected abstract plan have difficult-to-resolve conflicts with the plans of other agents, the final plan after conflict resolution will be inefficient or of low quality. In this paper, we propose a conflict estimation method to generate quality plans efficiently for multiagent systems by appropriately selecting abstract plans in hierarchical planning. This method enables agents to learn which abstract plans are less likely to cause conflicts or which conflicts will be easy to resolve. expand
|
|
Diagnosis of plan step errors and plan structure violations |
|
Cees Witteveen,
Nico Roos,
Adriaan ter Mors,
Xiaoyu Mao
|
|
Article No.: 126 |
|
doi>10.1145/1329125.1329280 |
|
Full text: PDF
|
|
Failures in plan execution can be attributed to errors in the execution of plan steps or violations of the plan structure. While in previous work we have concentrated on the first type of failures, in this paper we introduce the idea of diagnosing violations ...
Failures in plan execution can be attributed to errors in the execution of plan steps or violations of the plan structure. While in previous work we have concentrated on the first type of failures, in this paper we introduce the idea of diagnosing violations in the plan structure. The structure of a plan prescribes which actions have to be performed and which precedence constraints between them have to be respected. Especially in multiagent environments violations of plan structure might easily occur as the consequence of synchronization errors. Using a formal framework for plan diagnosis, we show how Model-Based Diagnosis can applied to identify these violations of plan structure specifications and we analyze the computational complexity of the associated diagnostic problems. expand
|
|
Integrating motivations with planning |
|
Alexandra Coddington
|
|
Article No.: 127 |
|
doi>10.1145/1329125.1329281 |
|
Full text: PDF
|
|
This paper presents two models of goal generation which enable a motivated autonomous agent to generate goals in response to changes in its underlying drives or motivations, while it is both planning and executing. A Mars rover domain is used ...
This paper presents two models of goal generation which enable a motivated autonomous agent to generate goals in response to changes in its underlying drives or motivations, while it is both planning and executing. A Mars rover domain is used to illustrate the two models: the first model involves goals being generated explicitly in response to changes to the agent's motivations where such goals are then provided to a planner, while the second model involves encoding the motivations and the goals that they may generate as part of the planner's domain model. Results from experiments on integrating the models with different planners suggest that while they may bring the benefits of autonomy that we seek, they also introduce more complexity into the planning problem. expand
|
|
SESSION: Environments and implementation techniques: full papers |
|
|
|
Implementing commitment-based interactions |
|
Michael Winikoff
|
|
Article No.: 128 |
|
doi>10.1145/1329125.1329283 |
|
Full text: PDF
|
|
Although agent interaction plays a vital role in MAS, and message-centric approaches to agent interaction have their drawbacks, present agent-oriented programming languages do not provide support for implementing agent interaction that is flexible and ...
Although agent interaction plays a vital role in MAS, and message-centric approaches to agent interaction have their drawbacks, present agent-oriented programming languages do not provide support for implementing agent interaction that is flexible and robust. Instead, messages are provided as a primitive building block. In this paper we consider one approach for modelling agent interactions: the commitment machines framework. This framework supports modelling interactions at a higher level (using social commitments), resulting in more flexible interactions. We investigate how commitment-based interactions can be implemented in conventional agent-oriented programming languages. The contributions of this paper are: a mapping from a commitment machine to a collection of BDI-style plans; extensions to the semantics of BDI programming languages; and an examination of two issues that arise when distributing commitment machines (turn management and race conditions) and solutions to these problems. expand
|
|
Normative system games |
|
Thomas Ågotnes,
Wiebe van der Hoek,
Michael Wooldridge
|
|
Article No.: 129 |
|
doi>10.1145/1329125.1329284 |
|
Full text: PDF
|
|
We develop a model of normative systems in which agents are assumed to have multiple goals of increasing priority, and investigate the computational complexity and game theoretic properties of this model. In the underlying model of normative systems, ...
We develop a model of normative systems in which agents are assumed to have multiple goals of increasing priority, and investigate the computational complexity and game theoretic properties of this model. In the underlying model of normative systems, we use Kripke structures to represent the possible transitions of a multiagent system. A normative system is then simply a subset of the Kripke structure, which contains the arcs that are forbidden by the normative system. We specify an agent's goals as a hierarchy of formulae of Computation Tree Logic (CTL), a widely used logic for representing the properties of Kripke structures: the intuition is that goals further up the hierarchy are preferred by the agent over those that appear further down the hierarchy. Using this scheme, we define a model of ordinal utility, which in turn allows us to interpret our Kripke-based normative systems as games, in which agents must determine whether to comply with the normative system or not. We then characterise the computational complexity of a number of decision problems associated with these Kripke-based normative system games; for example, we show that the complexity of checking whether there exists a normative system which has the property of being a Nash implementation is NP-complete. expand
|
|
Operational semantics of multiagent interactions |
|
Juan M. Serrano,
Sergio Saugar
|
|
Article No.: 130 |
|
doi>10.1145/1329125.1329285 |
|
Full text: PDF
|
|
The social stance advocated by institutional frameworks and most multiagent system methodologies has resulted in a wide spectrum of organizational and communicative abstractions which have found currency in several programming frameworks and software ...
The social stance advocated by institutional frameworks and most multiagent system methodologies has resulted in a wide spectrum of organizational and communicative abstractions which have found currency in several programming frameworks and software platforms. Still, these tools and frameworks are designed to support a limited range of interaction capabilities that constrain developers to a fixed set of particular, pre-defined abstractions. The main hypothesis motivating this paper is that the variety of multiagent interaction mechanisms -- both, organizational and communicative, share a common semantic core. In the realm of software architectures, the paper proposes a connector-based model of multiagent interactions which attempts to identify the essential structure underlying multiagent interactions. Furthermore, the paper also provides this model with a formal execution semantics which describes the dynamics of social interactions. The proposed model is intended as the abstract machine of an organizational programming language which allows programmers to accommodate an open set of interaction mechanisms. expand
|
|
Modular interpreted systems |
|
Wojciech Jamroga,
Thomas Ågotnes
|
|
Article No.: 131 |
|
doi>10.1145/1329125.1329286 |
|
Full text: PDF
|
|
We propose a new class of representations that can be used for modeling (and model checking) temporal, strategic and epistemic properties of agents and their teams. Our representations borrow the main ideas from interpreted systems of Halpern, ...
We propose a new class of representations that can be used for modeling (and model checking) temporal, strategic and epistemic properties of agents and their teams. Our representations borrow the main ideas from interpreted systems of Halpern, Fagin et al.; however, they are also modular and compact in the way concurrent programs are. We also mention preliminary results on model checking alternating-time temporal logic for this natural class of models. expand
|
|
SESSION: Environments and implementation techniques: poster papers |
|
|
|
A platform for massive agent-based simulation and its evaluation |
|
Gaku Yamamoto,
Hideki Tai,
Hideyuki Mizuta
|
|
Article No.: 132 |
|
doi>10.1145/1329125.1329288 |
|
Full text: PDF
|
|
There are many studies on ABS and several frameworks for ABS have already been published. However, there are few frameworks that can enable agent-based simulation using large numbers of agents. We have been developing a Java-based platform for Massive ...
There are many studies on ABS and several frameworks for ABS have already been published. However, there are few frameworks that can enable agent-based simulation using large numbers of agents. We have been developing a Java-based platform for Massive Agent-Based Simulation (MABS) called "Zillions of Agents-based Simulation Environment" or ZASE. The purpose of ZASE is to develop MABS applications on multiple computers. ZASE is designed to host over millions of agents. We introduce ZASE in this paper. We evaluated ZASE for the agent-based auction simulation where the number of agents varied from ten to a million. The results indicate that the number of agents affects the final bid prices and their distributions. Performance measurement results on both an SMP computer environment and a multiple-computer environment are shown. expand
|
|
The IRM4S model: the influence/reaction principle for multiagent based simulation |
|
Fabien Michel
|
|
Article No.: 133 |
|
doi>10.1145/1329125.1329289 |
|
Full text: PDF
|
|
The IRM4S model (Influence Reaction Model for Simulation) is an adaptation of the formalism of [2] for multiagent based simulations (MABS). The goal of IRM4S is to provide a framework that eases the use of the Influence/Reaction principle within MABS.
The IRM4S model (Influence Reaction Model for Simulation) is an adaptation of the formalism of [2] for multiagent based simulations (MABS). The goal of IRM4S is to provide a framework that eases the use of the Influence/Reaction principle within MABS. expand
|
|
A WSA-based architecture for building multiagent systems |
|
Aluizio Haendchen Filho,
Hércules Antônio do Prado,
Carlos José Pereira de Lucena
|
|
Article No.: 134 |
|
doi>10.1145/1329125.1329290 |
|
Full text: PDF
|
|
This paper discusses some advantages of applying the SOA paradigm for the MAS development, showing a framework whose architecture follows the WSA reference model. We believe that service-oriented paradigm can simplify the MAS development because it demands ...
This paper discusses some advantages of applying the SOA paradigm for the MAS development, showing a framework whose architecture follows the WSA reference model. We believe that service-oriented paradigm can simplify the MAS development because it demands a much simpler coordination level than the traditional approaches focused on the message-oriented model. expand
|
|
Mass programmed agents for simulating human strategies in large scale systems |
|
Michal Chalamish,
David Sarne,
Sarit Kraus
|
|
Article No.: 135 |
|
doi>10.1145/1329125.1329291 |
|
Full text: PDF
|
|
Simulation is an important tool for studying systems' behavior under specific conditions. In recent years, agent technology has been recognized as a promising new approach for developing simulation systems, in particular, Multiagent Systems (MAS) based ...
Simulation is an important tool for studying systems' behavior under specific conditions. In recent years, agent technology has been recognized as a promising new approach for developing simulation systems, in particular, Multiagent Systems (MAS) based simulations. expand
|
|
Breaking into industry: tool support for multiagent systems |
|
Simon Lynch,
Keerthi Rajendran
|
|
Article No.: 136 |
|
doi>10.1145/1329125.1329292 |
|
Full text: PDF
|
|
Multiagent systems (MAS) research is well established yet there is little development of agent technology in industry. It has been suggested that this is due, in part, to the unavailability of support tools for Agent Oriented Software Engineering (AOSE). ...
Multiagent systems (MAS) research is well established yet there is little development of agent technology in industry. It has been suggested that this is due, in part, to the unavailability of support tools for Agent Oriented Software Engineering (AOSE). This paper suggests requirements for Integrated Development Environments (IDEs) to support MAS construction. We suggest that an IDE can be built as its own MAS which allows it to be decoupled from any particular agent framework thereby allowing it to be platform independent. expand
|
|
Deterministic nonlinear modeling of ant algorithm with logistic multiagent system |
|
Rodolphe Charrier,
Christine Bourjot,
Francois Charpillet
|
|
Article No.: 137 |
|
doi>10.1145/1329125.1329293 |
|
Full text: PDF
|
|
Ant algorithms are one of the main programming paradigms in swarm intelligence. They are built on stochastic decision functions, which can also be found in other types of bio-inspired algorithms with the same mathematical form. However, though this modeling ...
Ant algorithms are one of the main programming paradigms in swarm intelligence. They are built on stochastic decision functions, which can also be found in other types of bio-inspired algorithms with the same mathematical form. However, though this modeling leads to high-performance algorithms, some phenomena, like symmetry break, are still not well understood or modeled at the ant level. This paper proposes an original analysis of the problem : we establish a reactive multiagent system based on logistic nonlinear decision maps, and designed according to the influence-reaction scheme. Our proposition is an entirely novel approach to the mathematical foundations of ant algorithms : contrary to the current stochastic approaches, we show that an alternative deterministic model exists, which has its origin in deterministic chaos theory. The rewriting of the decision functions leads to a new way of understanding and visualizing the convergence behavior of ant algorithms. We apply our approach on a concrete example, namely the binary bridge problem. expand
|
|
Practical extensions in agent programming languages |
|
Mehdi Dastani,
Dirk Hobo,
John-Jules Ch. Meyer
|
|
Article No.: 138 |
|
doi>10.1145/1329125.1329294 |
|
Full text: PDF
|
|
This paper proposes programming constructs to improve the practical application of existing BDI-based agent-oriented programming languages that have formal semantics. The proposed programming constructs include operations such as testing, adopting and ...
This paper proposes programming constructs to improve the practical application of existing BDI-based agent-oriented programming languages that have formal semantics. The proposed programming constructs include operations such as testing, adopting and dropping declarative goals, different execution modes for plans, repairing plans when their execution fail, event and exception handling mechanisms, and interfaces to existing imperative and declarative programming languages. expand
|
|
Predictive fault tolerance in multiagent systems: a plan-based replication approach |
|
Alessandro de Luna Almeida,
Samir Aknine,
Jean-Pierre Briot,
Jacques Malenfant
|
|
Article No.: 139 |
|
doi>10.1145/1329125.1329295 |
|
Full text: PDF
|
|
The fact that multiagent applications are prone to the same faults that any distributed system is susceptible to and the need for a higher quality of service in these systems justify the increasing interest in fault-tolerant multiagent systems. In this ...
The fact that multiagent applications are prone to the same faults that any distributed system is susceptible to and the need for a higher quality of service in these systems justify the increasing interest in fault-tolerant multiagent systems. In this article, we propose an original method for providing dependability in multiagent systems through replication. Our method is different from other works because our research focuses on building an automatic, adaptive and predictive replication policy where critical agents are replicated to minimize the impact of failures. This policy is determined by taking into account the criticality of the plans of the agents, which contain the collective and individual behaviors of the agents in the application. The set of replication strategies applied at a given moment to an agent is then fine-tuned gradually by the replication system so as to reflect the dynamicity of the multiagent system. Some preliminary measurements were made to assess the efficiency of our approach and future directions are presented. expand
|
|
Filters for semantic service composition in service-oriented multiagent systems |
|
Alberto Fernández,
Sascha Ossowski
|
|
Article No.: 140 |
|
doi>10.1145/1329125.1329296 |
|
Full text: PDF
|
|
In Service-Oriented MAS middle-agents provide different kinds of matchmaking functionalities. If no adequate services are available for a specific request, a planning functionality can be used to build up composite services. In order to take advantage ...
In Service-Oriented MAS middle-agents provide different kinds of matchmaking functionalities. If no adequate services are available for a specific request, a planning functionality can be used to build up composite services. In order to take advantage of recent advances in the field of AI planning for this purpose, we propose exploiting organisational information of Service-Oriented MAS to heuristically filter out those services that are probably irrelevant to the planning process. We present a novel framework for service-class based filtering and show how it can be instantiated to a particular MAS domain based on organisational information. expand
|
|
Towards reflective mobile agents for resource-constrained mobile devices |
|
Conor Muldoon,
Gregory M. P. O'Hare,
John F. Bradley
|
|
Article No.: 141 |
|
doi>10.1145/1329125.1329297 |
|
Full text: PDF
|
|
The vision of ubiquitous computing is one in which resource constrained mobile devices form ad-hoc networks to enable the delivery of services that are sensitive and responsive to people. Such networks are dynamic and must be capable of dealing with ...
The vision of ubiquitous computing is one in which resource constrained mobile devices form ad-hoc networks to enable the delivery of services that are sensitive and responsive to people. Such networks are dynamic and must be capable of dealing with uncertain information. As emergent system behaviour begins to evolve the complexity of pervasive systems increases. The agent development community have been addressing issues of complexity and uncertainty, within distributed computing, for several years. This paper details the migration process of Agent Factory Micro Edition (AFME), a minimised footprint agent platform for resource constrained mobile devices. The process enables agents to migrate from a resource rich environment to a resource constrained mobile device and vice versa. It requires agents to dynamically alter their form and behaviour so as to adapt to their context. expand
|
|
Prediction horizons in polyagent models |
|
H. Van Dyke Parunak,
Theodore C. Belding,
Sven Brueckner
|
|
Article No.: 142 |
|
doi>10.1145/1329125.1329298 |
|
Full text: PDF
|
|
Many agent-based models predict the future. Nonlinear interactions in most non-trivial domains make predictions useless beyond a certain point (the "prediction horizon"), as agent trajectories diverge. We exhibit this behavior in a simple agent-based ...
Many agent-based models predict the future. Nonlinear interactions in most non-trivial domains make predictions useless beyond a certain point (the "prediction horizon"), as agent trajectories diverge. We exhibit this behavior in a simple agent-based model, and discuss how a single agent in such a model can estimate the prediction horizon locally and use this estimate to modulate dynamically how far it gazes into the future. expand
|
|
Towards simulating billions of agents in thousands of seconds |
|
I. V. Aprameya Rao,
Manish Jain,
Kamalakar Karlapalem
|
|
Article No.: 143 |
|
doi>10.1145/1329125.1329299 |
|
Full text: PDF
|
|
Building multiagent systems that can scale up to very large number of agents is a challenging research problem. In this paper, we present Distributed Multi Agent System Framework (DMASF), a system which can simulate billions of agents in thousands of ...
Building multiagent systems that can scale up to very large number of agents is a challenging research problem. In this paper, we present Distributed Multi Agent System Framework (DMASF), a system which can simulate billions of agents in thousands of seconds. DMASF utilizes distributed computation to gain performance as well as a database to manage the agent and environment state. We briefly present the design and implementation of DMASF and present experimental results. DMASF is a generic and versatile tool that can be used for building massive multi agent system applications. expand
|
|
Building small worlds in unstructured P2P networks using a multiagent Bayesian inference mechanism |
|
Prithviraj Dasgupta
|
|
Article No.: 144 |
|
doi>10.1145/1329125.1329300 |
|
Full text: PDF
|
|
Over the past few years, peer-to-peer(p2p) unstructured networks have emerged as an attractive paradigm for enabling online interactions between a large number of users in a decentralized manner. However, the decentralized nature of unstructured p2p ...
Over the past few years, peer-to-peer(p2p) unstructured networks have emerged as an attractive paradigm for enabling online interactions between a large number of users in a decentralized manner. However, the decentralized nature of unstructured p2p networks makes load balancing a challenging problem. Specifically, the self-interested nature of users on the nodes of a p2p network and dynamic changes in network topology give rise to an unbalanced distribution of nodes across an unstructured p2p network. This results in network congestion and significant search latencies for all nodes. In this paper, we describe a small-world network model and a Bayesian inference mechanism within a multiagent setting to address these issues. Simulation results for a file sharing p2p application show that our algorithm achieves an exponential reduction in number of messages exchanged and improves load-balancing across the network. expand
|
|
SESSION: Formal models of agency: full papers |
|
|
|
A temporal epistemic logic with a reset operation |
|
Alessio Lomuscio,
Bożena Woźna
|
|
Article No.: 145 |
|
doi>10.1145/1329125.1329302 |
|
Full text: PDF
|
|
We present an axiomatisation for an extension of a temporal epistemic logic with an epistemic "reset" operator defined on the intersection between epistemic and temporal relations. Additionally we show the logic has the finite model property, hence it ...
We present an axiomatisation for an extension of a temporal epistemic logic with an epistemic "reset" operator defined on the intersection between epistemic and temporal relations. Additionally we show the logic has the finite model property, hence it is decidable. expand
|
|
Agents, beliefs, and plausible behavior in a temporal setting |
|
Nils Bulling,
Wojciech Jamroga
|
|
Article No.: 146 |
|
doi>10.1145/1329125.1329303 |
|
Full text: PDF
|
|
Logics of knowledge and belief are often too static and inflexible to be used on real-world problems. In particular, they usually offer no concept for expressing that some course of events is more likely to happen than another. We address this ...
Logics of knowledge and belief are often too static and inflexible to be used on real-world problems. In particular, they usually offer no concept for expressing that some course of events is more likely to happen than another. We address this problem and extend CTLK (computation tree logic with knowledge) with a notion of plausibility, which allows for practical and counterfactual reasoning. The new logic CTLKP (CTLK with plausibility) includes also a particular notion of belief. A plausibility update operator is added to this logic in order to change plausibility assumptions dynamically. Furthermore, we examine some important properties of these concepts. In particular, we show that, for a natural class of models, belief is a KD45 modality. We also show that model checking CTLKP is PTIME-complete and can be done in time linear with respect to the size of models and formulae. expand
|
|
A grounded specification language for agent programs |
|
Mehdi Dastani,
M. Birna van Riemsdijk,
John-Jules Ch. Meyer
|
|
Article No.: 147 |
|
doi>10.1145/1329125.1329304 |
|
Full text: PDF
|
|
This paper studies the relation between agent specification and agent programming languages. In particular, it shows that an agent programming language obeys some desirable properties expressed in an agent specification language, i.e., that any agent ...
This paper studies the relation between agent specification and agent programming languages. In particular, it shows that an agent programming language obeys some desirable properties expressed in an agent specification language, i.e., that any agent implemented by the programming language satisfies the desirable property expressed in the specification language. We study this relation by defining and aligning the semantics of an agent specification language and implementation language, and prove that certain properties expressed in the specification language are satisfied by the implementation language. expand
|
|
SESSION: Formal models of agency: poster papers |
|
|
|
Contextual deliberation of cognitive agents in defeasible logic |
|
M. Dastani,
G. Governatori,
A. Rotolo,
I. Song,
L. van der Torre
|
|
Article No.: 148 |
|
doi>10.1145/1329125.1329306 |
|
Full text: PDF
|
|
Cognitive agents often deliberate about preferences among rules. Consider, for example, an agent with the obligation to travel to Paris next week leading to a desire to travel by train, with the preferences that if the desire to travel by train cannot ...
Cognitive agents often deliberate about preferences among rules. Consider, for example, an agent with the obligation to travel to Paris next week leading to a desire to travel by train, with the preferences that if the desire to travel by train cannot be met, then there is a desire to travel by plane. Such an agent may reason about preferences among rules as follows: • The rule leading to a desire to travel by train is preferred to the rule leading to a desire to travel by plane, maybe as a second alternative. expand
|
|
A design framework for generating BDI-agents from goal models |
|
Loris Penserini,
Anna Perini,
Angelo Susi,
Mirko Morandini,
John Mylopoulos
|
|
Article No.: 149 |
|
doi>10.1145/1329125.1329307 |
|
Full text: PDF
|
|
We define a tool-supported design framework that allows to specify an agent goal model and to automatically generate fragments of a BDI agent from it. We devise the design process as a transformation process from platform-independent design models to ...
We define a tool-supported design framework that allows to specify an agent goal model and to automatically generate fragments of a BDI agent from it. We devise the design process as a transformation process from platform-independent design models to platform-specific models and then to code. The design framework is demonstrated by referring to the Tropos methodology and to the JADE/Jadex platform. In this short paper, key steps in the process are illustrated through an example. expand
|
|
Give agents their artifacts: the A&A approach for engineering working environments in MAS |
|
Alessandro Ricci,
Mirko Viroli,
Andrea Omicini
|
|
Article No.: 150 |
|
doi>10.1145/1329125.1329308 |
|
Full text: PDF
|
|
In human society, almost any cooperative working context accounts for different kinds of object, tool, artifacts in general, that humans adopt, share and intelligently exploit so as to support their working activities, in particular social ones. According ...
In human society, almost any cooperative working context accounts for different kinds of object, tool, artifacts in general, that humans adopt, share and intelligently exploit so as to support their working activities, in particular social ones. According to theories in human sciences--Activity Theory and Distributed Cognition are two main examples [5, 4]--and to related disciplines in computer science--such as Computer Supported Cooperative Work (CSCW) and Human-Computer Interaction (HCI)--such entities have a key role in determining the success or failure of the activities, playing an essential function in simplifying complex tasks and--more generally--in designing solutions that scale with activity complexity. Such a perspective can be found also in some works in the context of Distributed Artificial Intelligence [1, 2]. Analogously to the human case, we claim that also (cognitive) multiagent systems (MAS) could greatly benefit from the definition and systematic exploitation of a suitable notion of working environment, composed by different sorts of artifacts, dynamically constructed, shared and used by agents to support their working activities. Along this line, in this paper first we introduce a conceptual framework called A&A (Agents and Artifacts) which aims at directly modelling and engineering working environments in the context of cognitive multiagent systems; then, we provide a brief overview of the basic technologies that support such an approach, CARTAGO in particular--a Java-based framework for engineering working environments to be integrated with heterogeneous agent platforms. Such a perspective is strenghtened by recent efforts in AOSE (Agent-Oriented Software Engineering) that remark the fundamental role of the environment for the engineering of MAS [8]. The A&A framework can be considered an instance of such approaches, with some specific peculiarity: (i) abstractions and generality---the aim is to find a basic set of conceptual abstractions and related theory which, analogously to the agent abstraction, could be general enough to be the basis to define concrete architectures and programming environments, but specific enough to capture the essential properties of systems; (ii) cognitive---analogous to designed environment in human society, the properties of such environment abstractions should be conceived to be suitably and effectively exploited by cognitive agents, as intelligent constructors / users / manipulators of the environment. The work presented in this paper generalises and extends our previous work focussed on coordination artifacts presented at [6], and more recent works about the notion of artifact [7]. expand
|
|
From desires to intentions through dialectical analysis |
|
Nicolas D. Rotstein,
Alejandro J. Garcia,
Guillermo R. Simari
|
|
Article No.: 151 |
|
doi>10.1145/1329125.1329309 |
|
Full text: PDF
|
|
In this work, we introduce a framework where defeasible argumentation is used for reasoning about beliefs, desires and intentions. A dialectical filtering process is introduced in order to obtain a subset of the agent's desires containing only those ...
In this work, we introduce a framework where defeasible argumentation is used for reasoning about beliefs, desires and intentions. A dialectical filtering process is introduced in order to obtain a subset of the agent's desires containing only those that are actually achievable in the current situation. In our framework, different agents types can be defined and this will affect the way in which current desires are obtained. Finally, intentions will be current desires that the agent may commit to pursue. expand
|
|
Convergence and rate of convergence of a simple ant model |
|
Amine Boumaza,
Bruno Scherrer
|
|
Article No.: 152 |
|
doi>10.1145/1329125.1329310 |
|
Full text: PDF
|
|
We present a simple ant model that solves a discrete foraging problem. We provide simulations and a convergence analysis. We argue that the ant population computes the solutions of some optimal control problems and converges in some well defined sense. ...
We present a simple ant model that solves a discrete foraging problem. We provide simulations and a convergence analysis. We argue that the ant population computes the solutions of some optimal control problems and converges in some well defined sense. We also discuss the rate of convergence with respect to the number of ants: we give experimental and theoretical arguments that suggest that this rate is superlinear with respect to the number of agents. expand
|
|
Delegation and mental states |
|
Emiliano Lorini,
Nicolas Troquard,
Andreas Herzig,
Cristiano Castelfranchi
|
|
Article No.: 153 |
|
doi>10.1145/1329125.1329311 |
|
Full text: PDF
|
|
In the recent literature on multiagent systems there have been several proposals of formal systems for reasoning about delegation. Most of these approaches have dealt with the concept of delegation leaving mental states such as beliefs, goals and intentions ...
In the recent literature on multiagent systems there have been several proposals of formal systems for reasoning about delegation. Most of these approaches have dealt with the concept of delegation leaving mental states such as beliefs, goals and intentions out of consideration. The aim of this paper is to develop a formal approach for reasoning about delegation by modeling intentions and beliefs of the delegating agent in an explicit way. We present a logic where it is possible to investigate the relations between the concept of Intention to be and the concept of Delegation. expand
|
|
Model-based belief merging without distance measures |
|
Verónica Borja Macías,
Pilar Pozos Parra
|
|
Article No.: 154 |
|
doi>10.1145/1329125.1329312 |
|
Full text: PDF
|
|
Merging operators try to define the beliefs of a group of agents according to the beliefs of each member of the group. Several model-based propositional belief merging operators have been proposed which use distance measures and aggregation functions. ...
Merging operators try to define the beliefs of a group of agents according to the beliefs of each member of the group. Several model-based propositional belief merging operators have been proposed which use distance measures and aggregation functions. This paper introduces the notion of Partial Satisfiability which is an alternative way of measuring the satisfaction of a formula since this notion lets us have satisfaction values in the interval [0,1]. Partial Satisfiability allows us to define model-based merging operator. The proposal produces similar results to other merging approaches, but while other approaches require many merging operators in order to achieve satisfactory results for different scenarios this proposal obtains similar results for all these different scenarios with a unique operator. Moreover, unlike most of model-based approaches, this approach considers the case where the belief bases are inconsistent. The framework presented is in a preliminary state and further analysis of its properties is needed in order to characterize the proposed merging operator in terms of postulates. expand
|
|
SESSION: Argumentation and negotiation: full papers |
|
|
|
A multilateral multi-issue negotiation protocol |
|
Miniar Hemaissia,
Amal El Fallah Seghrouchni,
Christophe Labreuche,
Juliette Mattioli
|
|
Article No.: 155 |
|
doi>10.1145/1329125.1329314 |
|
Full text: PDF
|
|
In this paper, we present a new protocol to address multilateral multi-issue negotiation in a cooperative context. We consider complex dependencies between multiple issues by modelling the preferences of the agents with a multi-criteria decision aid ...
In this paper, we present a new protocol to address multilateral multi-issue negotiation in a cooperative context. We consider complex dependencies between multiple issues by modelling the preferences of the agents with a multi-criteria decision aid tool, also enabling us to extract relevant information on a proposal assessment. This information is used in the protocol to help in accelerating the search for a consensus between the cooperative agents. In addition, the negotiation procedure is defined in a crisis management context where the common objective of our agents is also considered in the preferences of a mediator agent. expand
|
|
Approximate and online multi-issue negotiation |
|
Shaheen S. Fatima,
Michael Wooldridge,
Nicholas R. Jennings
|
|
Article No.: 156 |
|
doi>10.1145/1329125.1329315 |
|
Full text: PDF
|
|
This paper analyzes bilateral multi-issue negotiation between self-interested autonomous agents. The agents have time constraints in the form of both deadlines and discount factors. There are m > 1 issues for negotiation where each issue is ...
This paper analyzes bilateral multi-issue negotiation between self-interested autonomous agents. The agents have time constraints in the form of both deadlines and discount factors. There are m > 1 issues for negotiation where each issue is viewed as a pie of size one. The issues are "indivisible" (i.e., individual issues cannot be split between the parties; each issue must be allocated in its entirety to either agent). Here different agents value different issues differently. Thus, the problem is for the agents to decide how to allocate the issues between themselves so as to maximize their individual utilities. For such negotiations, we first obtain the equilibrium strategies for the case where the issues for negotiation are known a priori to the parties. Then, we analyse their time complexity and show that finding the equilibrium offers is an NP-hard problem, even in a complete information setting. In order to overcome this computational complexity, we then present negotiation strategies that are approximately optimal but computationally efficient, and show that they form an equilibrium. We also analyze the relative error (i.e., the difference between the true optimum and the approximate). The time complexity of the approximate equilibrium strategies is O(nm/ε2) where n is the negotiation deadline and ε the relative error. Finally, we extend the analysis to online negotiation where different issues become available at different time points and the agents are uncertain about their valuations for these issues. Specifically, we show that an approximate equilibrium exists for online negotiation and show that the expected difference between the optimum and the approximate is O(√m). These approximate strategies also have polynomial time complexity. expand
|
|
A randomized method for the shapley value for the voting game |
|
Shaheen S. Fatima,
Michael Wooldridge,
Nicholas R. Jennings
|
|
Article No.: 157 |
|
doi>10.1145/1329125.1329316 |
|
Full text: PDF
|
|
The Shapley value is one of the key solution concepts for coalition games. Its main advantage is that it provides a unique and fair solution, but its main problem is that, for many coalition games, the Shapley value cannot be determined in polynomial ...
The Shapley value is one of the key solution concepts for coalition games. Its main advantage is that it provides a unique and fair solution, but its main problem is that, for many coalition games, the Shapley value cannot be determined in polynomial time. In particular, the problem of finding this value for the voting game is known to be #P-complete in the general case. However, in this paper, we show that there are some specific voting games for which the problem is computationally tractable. For other general voting games, we overcome the problem of computational complexity by presenting a new randomized method for determining the approximate Shapley value. The time complexity of this method is linear in the number of players. We also show, through empirical studies, that the percentage error for the proposed method is always less than 20% and, in most cases, less than 5%. expand
|
|
A unified and general framework for argumentation-based negotiation |
|
Leila Amgoud,
Yannis Dimopoulos,
Pavlos Moraitis
|
|
Article No.: 158 |
|
doi>10.1145/1329125.1329317 |
|
Full text: PDF
|
|
This paper proposes a unified and general framework for argumentation-based negotiation, in which the role of argumentation is formally analyzed. The framework makes it possible to study the outcomes of an argumentation-based negotiation. ...
This paper proposes a unified and general framework for argumentation-based negotiation, in which the role of argumentation is formally analyzed. The framework makes it possible to study the outcomes of an argumentation-based negotiation. It shows what an agreement is, how it is related to the theories of the agents, when it is possible, and how this can be attained by the negotiating agents in this case. It defines also the notion of concession, and shows in which situation an agent will make one, as well as how it influences the evolution of the dialogue. expand
|
|
Learning and joint deliberation through argumentation in multiagent systems |
|
Santi Ontañón,
Enric Plaza
|
|
Article No.: 159 |
|
doi>10.1145/1329125.1329318 |
|
Full text: PDF
|
|
In this paper we will present an argumentation framework for learning agents (AMAL) designed for two purposes: (1) for joint deliberation, and (2) for learning from communication. The AMAL framework is completely based on learning from examples: the ...
In this paper we will present an argumentation framework for learning agents (AMAL) designed for two purposes: (1) for joint deliberation, and (2) for learning from communication. The AMAL framework is completely based on learning from examples: the argument preference relation, the argument generation policy, and the counterargument generation policy are case-based techniques. For join deliberation, learning agents share their experience by forming a committee to decide upon some joint decision. We experimentally show that the argumentation among committees of agents improves both the individual and joint performance. For learning from communication, an agent engages into arguing with other agents in order to contrast its individual hypotheses and receive counterexamples; the argumentation process improves their learning scope and individual performance. expand
|
|
Arguing and explaining classifications |
|
Leila Amgoud,
Mathieu Serrurier
|
|
Article No.: 160 |
|
doi>10.1145/1329125.1329319 |
|
Full text: PDF
|
|
Argumentation is a promising approach used by autonomous agents for reasoning about inconsistent knowledge, based on the construction and the comparison of arguments. In this paper, we apply this approach to the classification problem, whose purpose ...
Argumentation is a promising approach used by autonomous agents for reasoning about inconsistent knowledge, based on the construction and the comparison of arguments. In this paper, we apply this approach to the classification problem, whose purpose is to construct from a set of training examples a model (or hypothesis) that assigns a class to any new example. We propose a general formal argumentation-based model that constructs arguments for/against each possible classification of an example, evaluates them, and determines among the conflicting arguments the acceptable ones. Finally, a "valid" classification of the example is suggested. Thus, not only the class of the example is given, but also the reasons behind that classification are provided to the user as well in a form that is easy to grasp. We show that such an argumentation-based approach for classification offers other advantages, like for instance classifying examples even when the set of training examples is inconsistent, and considering more general preference relations between hypotheses. Moreover, we show that in the particular case of concept learning, the results of version space theory are retrieved in an elegant way in our argumentation framework. expand
|
|
SESSION: Cooperation, coordination, and teamwork: full papers |
|
|
|
Searching for joint gains in automated negotiations based on multi-criteria decision making theory |
|
Quoc Bao Vo,
Lin Padgham
|
|
Article No.: 161 |
|
doi>10.1145/1329125.1329321 |
|
Full text: PDF
|
|
It is well established by conflict theorists and others that successful negotiation should incorporate "creating value" as well as "claiming value." Joint improvements that bring benefits to all parties can be realised by (i) identifying attributes that ...
It is well established by conflict theorists and others that successful negotiation should incorporate "creating value" as well as "claiming value." Joint improvements that bring benefits to all parties can be realised by (i) identifying attributes that are not of direct conflict between the parties, (ii) tradeoffs on attributes that are valued differently by different parties, and (iii) searching for values within attributes that could bring more gains to one party while not incurring too much loss on the other party. In this paper we propose an approach for maximising joint gains in automated negotiations by formulating the negotiation problem as a multi-criteria decision making problem and taking advantage of several optimisation techniques introduced by operations researchers and conflict theorists. We use a mediator to protect the negotiating parties from unnecessary disclosure of information to their opponent, while also allowing an objective calculation of maximum joint gains. We separate out attributes that take a finite set of values (simple attributes) from those with continuous values, and we show that for simple attributes, the mediator can determine the Pareto-optimal values. In addition we show that if none of the simple attributes strongly dominates the other simple attributes, then truth telling is an equilibrium strategy for negotiators during the optimisation of simple attributes. We also describe an approach for improving joint gains on non-simple attributes, by moving the parties in a series of steps, towards the Pareto-optimal frontier. expand
|
|
Matrix-based representation for coordination fault detection: a formal approach |
|
Meir Kalech,
Michael Lindner,
Gal A. Kaminka
|
|
Article No.: 162 |
|
doi>10.1145/1329125.1329322 |
|
Full text: PDF
|
|
Teamwork requires that team members coordinate their actions. The representation of the coordination is a key requirement since it influences the complexity and flexibility of reasoning team---members. One aspect of this requirement is detecting coordination ...
Teamwork requires that team members coordinate their actions. The representation of the coordination is a key requirement since it influences the complexity and flexibility of reasoning team---members. One aspect of this requirement is detecting coordination faults as a result of intermittent failures of sensors, communication failures, etc. Detection of such failures, based on observations of the behavior of agents, is of prime importance. Though different solutions have been presented thus far, none has presented a comprehensive and formal resolution to this problem. This paper presents a formal approach to representing multiagent coordination, and multiagent observations, using matrix structures. This representation facilitates easy representation of coordination requirements, modularity, flexibility and reuse of existing systems. Based on this representation we present a novel solution for fault-detection that is both generic and efficient for large-scale teams. expand
|
|
Unifying distributed constraint algorithms in a BDI negotiation framework |
|
Bao Chau Le Dinh,
Kiam Tian Seow
|
|
Article No.: 163 |
|
doi>10.1145/1329125.1329323 |
|
Full text: PDF
|
|
This paper presents a novel, unified distributed constraint satisfaction framework based on automated negotiation. The Distributed Constraint Satisfaction Problem (DCSP) is one that entails several agents to search for an agreement, which is a consistent ...
This paper presents a novel, unified distributed constraint satisfaction framework based on automated negotiation. The Distributed Constraint Satisfaction Problem (DCSP) is one that entails several agents to search for an agreement, which is a consistent combination of actions that satisfies their mutual constraints in a shared environment. By anchoring the DCSP search on automated negotiation, we show that several well-known DCSP algorithms are actually mechanisms that can reach agreements through a common Belief-Desire-Intention (BDI) protocol, but using different strategies. A major motivation for this BDI framework is that it not only provides a conceptually clearer understanding of existing DCSP algorithms from an agent model perspective, but also opens up the opportunities to extend and develop new strategies for DCSP. To this end, a new strategy called Unsolicited Mutual Advice (UMA) is proposed. Performance evaluation shows that the UMA strategy can outperform some existing mechanisms in terms of computational cycles. expand
|
|
SESSION: Trust and reputation: full papers |
|
|
|
Dynamically learning sources of trust information: experience vs. reputation |
|
Karen K. Fullam,
K. Suzanne Barber
|
|
Article No.: 164 |
|
doi>10.1145/1329125.1329325 |
|
Full text: PDF
|
|
Trust is essential when an agent must rely on others to provide resources for accomplishing its goals. When deciding whether to trust, an agent may rely on, among other types of trust information, its past experience with the trustee or on reputations ...
Trust is essential when an agent must rely on others to provide resources for accomplishing its goals. When deciding whether to trust, an agent may rely on, among other types of trust information, its past experience with the trustee or on reputations provided by third-party agents. However, each type of trust information has strengths and weaknesses: trust models based on past experience are more certain, yet require numerous transactions to build, while reputations provide a quick source of trust information, but may be inaccurate due to unreliable reputation providers. This research examines how the accuracy of experience- and reputation-based trust models is influenced by parameters such as: frequency of transactions with the trustee, trustworthiness of the trustee, and accuracy of provided reputations. More importantly, this research presents a technique for dynamically learning the best source of trust information given these parameters. The demonstrated learning technique achieves payoffs equal to those achieved by the best single trust information source (experience or reputation) in nearly every scenario examined. expand
|
|
Rumours and reputation: evaluating multi-dimensional trust within a decentralised reputation system |
|
Steven Reece,
Alex Rogers,
Stephen Roberts,
Nicholas R. Jennings
|
|
Article No.: 165 |
|
doi>10.1145/1329125.1329326 |
|
Full text: PDF
|
|
In this paper we develop a novel probabilistic model of computational trust that explicitly deals with correlated multi-dimensional contracts. Our starting point is to consider an agent attempting to estimate the utility of a contract, and we show that ...
In this paper we develop a novel probabilistic model of computational trust that explicitly deals with correlated multi-dimensional contracts. Our starting point is to consider an agent attempting to estimate the utility of a contract, and we show that this leads to a model of computational trust whereby an agent must determine a vector of estimates that represent the probability that any dimension of the contract will be successfully fulfilled, and a covariance matrix that describes the uncertainty and correlations in these probabilities. We present a formalism based on the Dirichlet distribution that allows an agent to calculate these probabilities and correlations from their direct experience of contract outcomes, and we show that this leads to superior estimates compared to an alternative approach using multiple independent beta distributions. We then show how agents may use the sufficient statistics of this Dirichlet distribution to communicate and fuse reputation within a decentralised reputation system. Finally, we present a novel solution to the problem of rumour propagation within such systems. This solution uses the notion of private and shared information, and provides estimates consistent with a centralised reputation system, whilst maintaining the anonymity of the agents, and avoiding bias and overconfidence. expand
|
|
Presumptive selection of trust evidence |
|
Pierpaolo Dondio,
Stephen Barrett
|
|
Article No.: 166 |
|
doi>10.1145/1329125.1329327 |
|
Full text: PDF
|
|
1. This paper proposes a generic method for identifying elements in a domain that can be used as trust evidences. As an alternative to external infrastructured approaches based on certificates or user recommendations we propose a computation based on ...
1. This paper proposes a generic method for identifying elements in a domain that can be used as trust evidences. As an alternative to external infrastructured approaches based on certificates or user recommendations we propose a computation based on evidences gathered directly from application elements that have been recognized to have a trust meaning. However, when the selection of evidences is done using a dedicated infrastructure or user's collaboration it remains a well-bounded problem. Instead, when evidences must be selected directly from domain activity selection is generally unsystematic and subjective, typically resulting in an unbounded problem. To address these issues, our paper proposes a general methodology for selecting trust evidences among elements of the domain under analysis. The method uses presumptive reasoning combined with a human-based and intuitive notion of Trust. Using the method the problem of evidence selection becomes the critical analysis of identified evidences plausibility against the situation and their logical consistency. We present an evaluation, in the context of the Wikipedia project, in which trust predictions based on evidences identified by our method are compared to a computation based on domain-specific expertise. expand
|
|
SESSION: Trust and reputation: poster papers |
|
|
|
Agent-based model of impact of socioeconomic stressors: a dynamic network perspective |
|
Shah Jamal Alam,
Ruth Meyer,
Emma Norling
|
|
Article No.: 167 |
|
doi>10.1145/1329125.1329329 |
|
Full text: PDF
|
|
We have developed an agent-based simulation model based on a real case study in the Sekhukhune district of the Limpopo province in South Africa. The work reported here is part of an ongoing project that deals with social complexity emerging from bottom-up ...
We have developed an agent-based simulation model based on a real case study in the Sekhukhune district of the Limpopo province in South Africa. The work reported here is part of an ongoing project that deals with social complexity emerging from bottom-up as a result of individuals' interaction. We model these interactions at the individual and the household level. Networks that result from the social processes in this simulation are dynamic and co-evolving. We outline a possible way of comparing simulation snapshots when the population size does not depend on a network's global characteristic. expand
|
|
Reducing the complexity of logics for multiagent systems |
|
Marcin Dziubiński,
Rineke Verbrugge,
Barbara Dunin-Kȩplicz
|
|
Article No.: 168 |
|
doi>10.1145/1329125.1329330 |
|
Full text: PDF
|
|
Theories of multiagent systems (MAS), in particular those based on modal logics, often suffer from a high computational complexity. This is due in part to the combination of agents' individual attitudes (beliefs, goals and intentions), and even more ...
Theories of multiagent systems (MAS), in particular those based on modal logics, often suffer from a high computational complexity. This is due in part to the combination of agents' individual attitudes (beliefs, goals and intentions), and even more importantly to the presence of group attitudes, such as common belief and collective intention. expand
|
|
Strategy recommender agents (ALEX) - the methodology |
|
Ronald Uriel Ruiz Ordóñez,
Josep Lluis de la Rosa i Esteva,
Javier Guzmán-Obando
|
|
Article No.: 169 |
|
doi>10.1145/1329125.1329331 |
|
Full text: PDF
|
|
Agents for Alignment into strategy Experience (ALEX agents), a type of recommender agent (RA), are proposed here as a means of helping employees to perform tasks in line with the strategy adopted by their particular organization. ...
Agents for Alignment into strategy Experience (ALEX agents), a type of recommender agent (RA), are proposed here as a means of helping employees to perform tasks in line with the strategy adopted by their particular organization. Such an organization is represented in this approach by Fuzzified Strategic Maps (FSMs). Collaborative filtering (CF), opinion based filtering (OBF) and a Human Values Scale (HVS) are used in the ALEX-based approach to interpret the organization's strategy and actions aligned with it. ALEX agents can send messages to their users (the employees) to guide them towards meeting organizational aims aligned with that strategy and with the behavior of the organization's other employees or recommender agents. A real-life example using members of a research group in a European university is presented. expand
|
|
InterPol: a policy framework for managing trust and privacy in referral networks |
|
Yathiraj B. Udupi,
Munindar P. Singh
|
|
Article No.: 170 |
|
doi>10.1145/1329125.1329332 |
|
Full text: PDF
|
|
Referral networks are a kind of P2P system consisting of autonomous agents who seek, provide services, or refer other service providers. Key applications include service discovery and selection, and knowledge sharing. This use of referrals is inspired ...
Referral networks are a kind of P2P system consisting of autonomous agents who seek, provide services, or refer other service providers. Key applications include service discovery and selection, and knowledge sharing. This use of referrals is inspired by human interactions, where referrals are a key basis for judging the trustworthiness of a given service. The use of referrals enable an agent to control how its request is processed, it also provides an architectural basis for four kinds of interaction policies. InterPol is a language and framework supporting such policies. InterPol has been implemented using a Datalog-based policy engine for each agent. It has been applied on scenarios from a (multinational) health care project. The contribution of this paper is in a general referrals-based framework for privacy and trust management, which is shown to effectively capture a variety of privacy and trust requirements of autonomous users. expand
|
|
SSDPOP: improving the privacy of DCOP with secret sharing |
|
Rachel Greenstadt,
Barbara Grosz,
Michael D. Smith
|
|
Article No.: 171 |
|
doi>10.1145/1329125.1329333 |
|
Full text: PDF
|
|
multiagent systems designed to work collaboratively with groups of people typically require private information that people will entrust to them only if they have assurance that this information will be protected. Although Distributed Constraint Optimization ...
multiagent systems designed to work collaboratively with groups of people typically require private information that people will entrust to them only if they have assurance that this information will be protected. Although Distributed Constraint Optimization (DCOP) has emerged as a prominent technique for multiagent coordination, existing algorithms for solving DCOP problems do not adeqately protect agents' privacy. This paper analyzes privacy protection and loss in existing DCOP algorithms. It presents a new algorithm, SSDPOP, which augments a prominent DCOP algorithm (DPOP) with secret sharing techniques. This approach significantly reduces privacy loss, while preserving the structure of the DPOP algorithm and introducing only minimal computational overhead. Results show that SSDPOP reduces privacy loss by 29--88% on average over DPOP. expand
|
|
Towards provably secure trust and reputation systems in e-marketplaces |
|
Reid Kerr,
Robin Cohen
|
|
Article No.: 172 |
|
doi>10.1145/1329125.1329334 |
|
Full text: PDF
|
|
In this paper, we present a framework for evaluating the security of trust and reputation systems for electronic marketplaces populated with buying and selling agents. Our proposed framework offers a method for researchers to understand the security ...
In this paper, we present a framework for evaluating the security of trust and reputation systems for electronic marketplaces populated with buying and selling agents. Our proposed framework offers a method for researchers to understand the security of their systems, and to provide precise guarantees of the degree of provable security that these systems offer. We demonstrate the viability of our proposed framework by presenting a specific monetary-based trust system known as Trunits, along with an analysis that shows that Trunits provides a guaranteed level of security for buyers. expand
|
|
Mechanisms for norm emergence in multiagent societies |
|
Bastin Tony Roy Savarimuthu,
Maryam Purvis
|
|
Article No.: 173 |
|
doi>10.1145/1329125.1329335 |
|
Full text: PDF
|
|
Norms are shared expectations of behaviours that exist in human societies. Norms help societies by increasing the predictability of individual behaviours and by improving co-operation and collaboration among members. Norms have been of interest to Multiagent ...
Norms are shared expectations of behaviours that exist in human societies. Norms help societies by increasing the predictability of individual behaviours and by improving co-operation and collaboration among members. Norms have been of interest to Multiagent Systems (MAS) researchers as software agents may violate norms due to their autonomy. In order to built robust MAS that are norm compliant and systems that evolve and adapt norms dynamically, the study of norms is crucial. Our research focuses on how norms emerge in agent societies. In this paper we propose two mechanisms for norm emergence. expand
|
|
Social comparison in crowds: a short report |
|
Gal A. Kaminka,
Natalie Fridman
|
|
Article No.: 174 |
|
doi>10.1145/1329125.1329336 |
|
Full text: PDF
|
|
Modeling crowd behavior is an important challenge for cognitive modelers. We propose a novel model of crowd behavior, based on Festinger's Social Comparison Theory, a social psychology theory known and expanded since the early 1950's. We propose a concrete ...
Modeling crowd behavior is an important challenge for cognitive modelers. We propose a novel model of crowd behavior, based on Festinger's Social Comparison Theory, a social psychology theory known and expanded since the early 1950's. We propose a concrete framework for SCT, and evaluate its implementations in several crowd behavior scenarios. Results from task measures and human judges evaluation shows that the SCT model produces improved results compared to base models from the literature. expand
|
|
Socially embedded multi agent based simulation of financial market |
|
Kiyoshi Izumi,
Hiroki Matsui,
Yutaka Matsuo
|
|
Article No.: 175 |
|
doi>10.1145/1329125.1329337 |
|
Full text: PDF
|
|
This paper proposed a new approach that integrated an artificial market simulation and text-mining with real information. In this approach, economic trends were extracted from text data circulating in the real world. Then, the trends were inputted into ...
This paper proposed a new approach that integrated an artificial market simulation and text-mining with real information. In this approach, economic trends were extracted from text data circulating in the real world. Then, the trends were inputted into the market simulation. The simulation could support users' action to the actual market. This approach was used for the decision of exchange rate policy and suggested that the operation by intervention was effective for the stabilization of the yen-dollar rate in 1995. Our simulation revealed that the action rule proposed by our system could reduce over 70% of rate fluctuation. This approach can offer a useful social simulation as a tool to users. expand
|
|
Robust methods for tracking intelligent agents playing in an artificial financial market |
|
Nachi Gupta,
Raphael Hauser,
Neil F. Johnson
|
|
Article No.: 176 |
|
doi>10.1145/1329125.1329338 |
|
Full text: PDF
|
|
When analyzing financial time-series for predictability, the norm has been to find trends and patterns directly in the series despite the inherent dynamical system apparent at the individual agent level. This underlying buy and sell model provides more ...
When analyzing financial time-series for predictability, the norm has been to find trends and patterns directly in the series despite the inherent dynamical system apparent at the individual agent level. This underlying buy and sell model provides more information than the time-series alone. We provide a methodology for finding pockets of predictability in a financial time-series using a multiagent market model and an empirical study to illustrate convergence of these methods. expand
|
|
Preservation of obligations in a temporal and deontic framework |
|
Jan Broersen,
Julien Brunel
|
|
Article No.: 177 |
|
doi>10.1145/1329125.1329339 |
|
Full text: PDF
|
|
We study logical properties that concern the preservation of future-directed obligations that have not been fulfilled yet. Our starting point is a product of temporal and deontic logics. We investigate some modifications of the semantics of the product ...
We study logical properties that concern the preservation of future-directed obligations that have not been fulfilled yet. Our starting point is a product of temporal and deontic logics. We investigate some modifications of the semantics of the product in order to satisfy preservation properties, without loosing too much of the basic properties of the product. We arrive at a semantics in which we only consider ideal histories that share the same past as the current one, and that enables a characterization of the states in which the obligations propagate. These are the states where any obligation of a formula that concerns the present moment is not violated. When there are such violations, the deontic realm switches to a lower level of ideality. expand
|
|
Integrating authority, deontics, and communications within a joint intention framework |
|
Marcus J. Huber,
Sanjeev Kumar,
David McGee,
Sean A. Lisse
|
|
Article No.: 178 |
|
doi>10.1145/1329125.1329340 |
|
Full text: PDF
|
|
Many agents are fielded within environments requiring modeling traditional organizational structures such as military hierarchies and corporations, with their associated authority relationships and a strong form of responsibility associated with the ...
Many agents are fielded within environments requiring modeling traditional organizational structures such as military hierarchies and corporations, with their associated authority relationships and a strong form of responsibility associated with the subordinate agents. Furthermore, communications between agents placed in such environments benefit from a strong, consistent semantic model to express not only the source's goal but more importantly their intent as well. Addressing the need above, we have developed an integrated semantic framework for modeling and operationalizing authority relationships, deontic operators, and inter-agent communications based upon joint intention theory. This allows us to not only regularize the representation and reasoning components of agents but to also realize: improved coordination due to enhanced agent teamwork (persistence and robustness toward successful achievement of goals even in the face of problems); improved conformance of the behavior of each agent according to its organizational role and authority related to other organizational positions; and improved bounding of the behavior of agents when faced with the imposition of deontic operators from various sources. expand
|
|
Making social choices from individuals' CP-nets |
|
Keith Purrington,
Edmund H. Durfee
|
|
Article No.: 179 |
|
doi>10.1145/1329125.1329341 |
|
Full text: PDF
|
|
CP-nets are an attractive model for representing individual preferences, in part because they allow us to find the best outcome for an agent in time that is proportional to just the number of features in an outcome. In this paper, we investigate whether ...
CP-nets are an attractive model for representing individual preferences, in part because they allow us to find the best outcome for an agent in time that is proportional to just the number of features in an outcome. In this paper, we investigate whether similar efficiencies can apply to finding the best social outcome for agents whose individual preferences are captured in CP-nets. Because CP-nets provide only qualitative information, we adopt a way to compare outcomes across agents based on each outcome's relative standing in the individuals' spaces of possible outcomes. This in turn guides the search through the outcome preference graphs that are induced by the agents' CP-nets to find the optimal social outcome. Because these induced preference graphs are exponential in the number of features, we examine the conditions under which the agents can search directly using their CP-nets, and show that our approach yields near-optimal social outcomes in exponentially less time. expand
|
|
Reputation in the joint venture game |
|
Philip Hendrix,
Barbara J. Grosz
|
|
Article No.: 180 |
|
doi>10.1145/1329125.1329342 |
|
Full text: PDF
|
|
In many settings, agents need to identify competent partners to assist them in accomplishing tasks. Direct experience may not provide sufficient data to learn the competence of other agents. Reputation---a community-based assessment of agent competence---can ...
In many settings, agents need to identify competent partners to assist them in accomplishing tasks. Direct experience may not provide sufficient data to learn the competence of other agents. Reputation---a community-based assessment of agent competence---can augment direct experience, but is prone to error. This paper addresses the question of when reputation information is useful, examining a variety of multiagent settings. It provides a systematic study of the way the utility of reputation varies by group size, group competency, level of error, and whether reputation information is available. Results demonstrate that the utility received from reputation increases as group size increases. However, the experiments also show that reputation is useful in small groups, during early rounds of a game series. These results also revealed a "pigeonholing phenomenon" in which highly capable agents are miscategorized by the reputation system as having low competence based on early sequences of low performance. This effect can be countered by introducing a systematic positive bias to the system. expand
|
|
SESSION: Applications and computational environments: full papers |
|
|
|
An efficient heuristic approach for security against multiple adversaries |
|
Praveen Paruchuri,
Jonathan P. Pearce,
Milind Tambe,
Fernando Ordonez,
Sarit Kraus
|
|
Article No.: 181 |
|
doi>10.1145/1329125.1329344 |
|
Full text: PDF
|
|
In adversarial multiagent domains, security, commonly defined as the ability to deal with intentional threats from other agents, is a critical issue. This paper focuses on domains where these threats come from unknown adversaries. These domains can be ...
In adversarial multiagent domains, security, commonly defined as the ability to deal with intentional threats from other agents, is a critical issue. This paper focuses on domains where these threats come from unknown adversaries. These domains can be modeled as Bayesian games; much work has been done on finding equilibria for such games. However, it is often the case in multiagent security domains that one agent can commit to a mixed strategy which its adversaries observe before choosing their own strategies. In this case, the agent can maximize reward by finding an optimal strategy, without requiring equilibrium. Previous work has shown this problem of optimal strategy selection to be NP-hard. Therefore, we present a heuristic called ASAP, with three key advantages to address the problem. First, ASAP searches for the highest-reward strategy, rather than a Bayes-Nash equilibrium, allowing it to find feasible strategies that exploit the natural first-mover advantage of the game. Second, it provides strategies which are simple to understand, represent, and implement. Third, it operates directly on the compact, Bayesian game representation, without requiring conversion to normal form. We provide an efficient Mixed Integer Linear Program (MILP) implementation for ASAP, along with experimental results illustrating significant speedups and higher rewards over other approaches. expand
|
|
An agent-based approach for privacy-preserving recommender systems |
|
Richard Cissée,
Sahin Albayrak
|
|
Article No.: 182 |
|
doi>10.1145/1329125.1329345 |
|
Full text: PDF
|
|
Recommender Systems are used in various domains to generate personalized information based on personal user data. The ability to preserve the privacy of all participants is an essential requirement of the underlying Information Filtering architectures, ...
Recommender Systems are used in various domains to generate personalized information based on personal user data. The ability to preserve the privacy of all participants is an essential requirement of the underlying Information Filtering architectures, because the deployed Recommender Systems have to be accepted by privacy-aware users as well as information and service providers. Existing approaches neglect to address privacy in this multilateral way. We have developed an approach for privacy-preserving Recommender Systems based on Multiagent System technology which enables applications to generate recommendations via various filtering techniques while preserving the privacy of all participants. We describe the main modules of our solution as well as an application we have implemented based on this approach. expand
|
|
On the benefits of cheating by self-interested agents in vehicular networks |
|
Raz Lin,
Sarit Kraus,
Yuval Shavitt
|
|
Article No.: 183 |
|
doi>10.1145/1329125.1329346 |
|
Full text: PDF
|
|
As more and more cars are equipped with GPS and Wi Fi transmitters, it becomes easier to design systems that will allow cars to interact autonomously with each other, e.g., regarding traffic on the roads. Indeed, car manufacturers are already equipping ...
As more and more cars are equipped with GPS and Wi Fi transmitters, it becomes easier to design systems that will allow cars to interact autonomously with each other, e.g., regarding traffic on the roads. Indeed, car manufacturers are already equipping their cars with such devices. Though, currently these systems are a proprietary, we envision a natural evolution where agent applications will be developed for vehicular systems, e.g., to improve car routing in dense urban areas. Nonetheless, this new technology and agent applications may lead to the emergence of self-interested car owners, who will care more about their own welfare than the social welfare of their peers. These car owners will try to manipulate their agents such that they transmit false data to their peers. Using a simulation environment, which models a real transportation network in a large city, we demonstrate the benefits achieved by self-interested agents if no counter-measures are implemented. expand
|
|
SESSION: Applications and computational environments: poster papers |
|
|
|
Physics inspired multiagent system for vehicle platooning |
|
Contet Jean-Michel,
Gechter Franck,
Gruer Pablo,
Koukam Abder
|
|
Article No.: 184 |
|
doi>10.1145/1329125.1329348 |
|
Full text: PDF
|
|
Since about two decades, many works have been made in order to provide solutions to the vehicle platoon problem. The main issue related to platoon systems consists in controling the global platoon geometry: inter vehicular distance and trajectory matching. ...
Since about two decades, many works have been made in order to provide solutions to the vehicle platoon problem. The main issue related to platoon systems consists in controling the global platoon geometry: inter vehicular distance and trajectory matching. Another important aspect is related to platoon's evolution, mainly by vehicle merging and splitting. This paper presents a reactive multiagent solution aimed at providing distributed control of vehicle platoons with train configuration. This solution is based on a physics inspired interaction model in which every vehicle interacts only with the preceding one. Platoon stability emerges as a global result of the individual interactions. Furthermore, the adaptation to different kind of vehicles is made by tuning model's physical parameters. Simulation experiments have been made in order to compare our proposal with impedance control model. The experiments have been designed in order to evaluate trajectory matching abilities and merge/split capabilities. Experiments with on-wheels small-robots have also contributed to the validation of our approach. expand
|
|
Towards valuation-aware agent-based traffic control |
|
Heiko Schepperle,
Klemens Böhm,
Simone Forster
|
|
Article No.: 185 |
|
doi>10.1145/1329125.1329349 |
|
Full text: PDF
|
|
Traffic authorities work hard to improve resource utilization in traffic. But these efforts do not consider that the valuations of waiting time can be different for each driver, e.g., a driver of a courier service delivering express mail typically has ...
Traffic authorities work hard to improve resource utilization in traffic. But these efforts do not consider that the valuations of waiting time can be different for each driver, e.g., a driver of a courier service delivering express mail typically has a higher valuation of reduced waiting time than other motorists. We propose that traffic-control mechanisms should be valuation-aware and propose and describe such a mechanism called Time-Slot Exchange. The idea of this mechanism is that vehicles are assigned time slots to cross the intersection, and vehicles, or, more specifically, agent-based driver-assistance systems of the vehicles can then trade these time slots. Simulations show that our mechanism increases overall satisfaction considerably, compared to a state-of-the-art traffic-control mechanism. expand
|
|
SESSION: Perceptual and embedded agents: full papers |
|
|
|
IFSA: incremental feature-set augmentation for reinforcement learning tasks |
|
Mazda Ahmadi,
Matthew E. Taylor,
Peter Stone
|
|
Article No.: 186 |
|
doi>10.1145/1329125.1329351 |
|
Full text: PDF
|
|
Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algorithms exist to learn effective policies in such problems, learning is ...
Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algorithms exist to learn effective policies in such problems, learning is often used to solve real world problems, which typically have large state spaces, and therefore suffer from the "curse of dimensionality." One effective method for speeding-up reinforcement learning algorithms is to leverage expert knowledge. In this paper, we propose a method for dynamically augmenting the agent's feature set in order to speed up value-function-based reinforcement learning. The domain expert divides the feature set into a series of subsets such that a novel problem concept can be learned from each successive subset. Domain knowledge is also used to order the feature subsets in order of their importance for learning. Our algorithm uses the ordered feature subsets to learn tasks significantly faster than if the entire feature set is used from the start. Incremental Feature-Set Augmentation (IFSA) is fully implemented and tested in three different domains: Gridworld, Blackjack and RoboCup Soccer Keepaway. All experiments show that IFSA can significantly speed up learning and motivates the applicability of this novel RL method. expand
|
|
On discovery and learning of models with predictive representations of state for agents with continuous actions and observations |
|
David Wingate,
Satinder Singh
|
|
Article No.: 187 |
|
doi>10.1145/1329125.1329352 |
|
Full text: PDF
|
|
Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discrete PSRs uses an elegant construct called the system dynamics matrix and ...
Models of agent-environment interaction that use predictive state representations (PSRs) have mainly focused on the case of discrete observations and actions. The theory of discrete PSRs uses an elegant construct called the system dynamics matrix and derives the notion of predictive state as a sufficient statistic via the rank of the matrix. With continuous observations and actions, such a matrix and its rank no longer exist. In this paper, we show how to define an analogous construct for the continuous case, called the system dynamics distributions, and use information theoretic notions to define a sufficient statistic and thus state. Given this new construct, we use kernel density estimation to learn approximate system dynamics distributions from data, and use information-theoretic tools to derive algorithms for discovery of state and learning of model parameters. We illustrate our new modeling method on two example problems. expand
|
|
Speeding up moving-target search |
|
Sven Koenig,
Maxim Likhachev,
Xiaoxun Sun
|
|
Article No.: 188 |
|
doi>10.1145/1329125.1329353 |
|
Full text: PDF
|
|
In this paper, we study moving-target search, where an agent (= hunter) has to catch a moving target (= prey). The agent does not necessarily know the terrain initially but can observe it within a certain sensor range around itself. It uses the strategy ...
In this paper, we study moving-target search, where an agent (= hunter) has to catch a moving target (= prey). The agent does not necessarily know the terrain initially but can observe it within a certain sensor range around itself. It uses the strategy to always move on a shortest presumed unblocked path toward the target, which is a reasonable strategy for computer-controlled characters in video games. We study how the agent can find such paths faster by exploiting the fact that it performs A* searches repeatedly. To this end, we extend Adaptive A*, an incremental heuristic search method, to moving-target search and demonstrate experimentally that the resulting MT-Adaptive A* is faster than isolated A* searches and, in many situations, also D* Lite, a state-of-the-art incremental heuristic search method. In particular, it is faster than D* Lite by about one order of magnitude for moving-target search in known and initially unknown mazes if both search methods use the same informed heuristics. expand
|
|
Towards using multiple cues for robust object recognition |
|
Sarah Aboutalib,
Manuela Veloso
|
|
Article No.: 189 |
|
doi>10.1145/1329125.1329354 |
|
Full text: PDF
|
|
A robot's ability to assist humans in a variety of tasks, e.g. in search and rescue or in a household, heavily depends on the robot's reliable recognition of the objects in the environment. Numerous approaches attempt to recognize objects based only ...
A robot's ability to assist humans in a variety of tasks, e.g. in search and rescue or in a household, heavily depends on the robot's reliable recognition of the objects in the environment. Numerous approaches attempt to recognize objects based only on the robot's vision. However, the same type of object can have very different visual appearances, such as shape, size, pose, and color. Although such approaches are widely studied with relative success, the general object recognition task still remains very challenging. We build our work upon the fact that robots can observe humans interacting with the objects in their environment, and thus providing numerous non-visual cues to those objects' identities. We research on a flexible object recognition approach which can use any multiple cues, whether they are visual cues intrinsic to the object or provided by observation of a human. We realize the challenging issue that multiple cues can have different weight in their association with an object definition and need to be taken into account during recognition. In this paper, we contribute a probabilistic relational representation of the cue weights and an object recognition algorithm that can flexibly combine multiple cues of any type to robustly recognize objects. We show illustrative results of our implemented approach using visual, activity, gesture, and speech cues, provided by machine or human, to recognize objects more robustly than when using only a single cue. expand
|
|
SESSION: Mechanism design and game theory: full papers |
|
|
|
A computational characterization of multiagent games with fallacious rewards |
|
Ariel D. Procaccia,
Jeffrey S. Rosenschein
|
|
Article No.: 190 |
|
doi>10.1145/1329125.1329356 |
|
|
|
A near-optimal strategy for a heads-up no-limit Texas Hold'em poker tournament |
|
Peter Bro Miltersen,
Troels Bjerre Sørensen
|
|
Article No.: 191 |
|
doi>10.1145/1329125.1329357 |
|
Full text: PDF
|
|
We analyze a heads-up no-limit Texas Hold'em poker tournament with a fixed small blind of 300 chips, a fixed big blind of 600 chips and a total amount of 8000 chips on the table (until recently, these parameters defined the heads-up endgame of sit-n-go ...
We analyze a heads-up no-limit Texas Hold'em poker tournament with a fixed small blind of 300 chips, a fixed big blind of 600 chips and a total amount of 8000 chips on the table (until recently, these parameters defined the heads-up endgame of sit-n-go tournaments on the popular Party-Poker.com online poker site). Due to the size of this game, a computation of an optimal (i.e. minimax) strategy for the game is completely infeasible. However, combining an algorithm due to Koller, Megiddo and von Stengel with concepts of Everett and suggestions of Sklansky, we compute an optimal jam/fold strategy, i.e. a strategy that would be optimal if any bet made by the player playing by the strategy (but not bets of his opponent) had to be his entire stack. Our computations establish that the computed strategy is near-optimal for the unrestricted tournament (i.e., with post-flop play being allowed) in the rigorous sense that a player playing by the computed strategy will win the tournament with a probability within 1.4 percentage points of the probability that an optimal strategy (allowing post-flop play) would give. expand
|
|
Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker |
|
Andrew Gilpin,
Tuomas Sandholm
|
|
Article No.: 192 |
|
doi>10.1145/1329125.1329358 |
|
Full text: PDF
|
|
We present new approximation methods for computing game-theoretic strategies for sequential games of imperfect information. At a high level, we contribute two new ideas. First, we introduce a new state-space abstraction algorithm. In each round of the ...
We present new approximation methods for computing game-theoretic strategies for sequential games of imperfect information. At a high level, we contribute two new ideas. First, we introduce a new state-space abstraction algorithm. In each round of the game, there is a limit to the number of strategically different situations that an equilibrium-finding algorithm can handle. Given this constraint, we use clustering to discover similar positions, and we compute the abstraction via an integer program that minimizes the expected error at each stage of the game. Second, we present a method for computing the leaf payoffs for a truncated version of the game by simulating the actions in the remaining portion of the game. This allows the equilibrium-finding algorithm to take into account the entire game tree while having to explicitly solve only a truncated version. Experiments show that each of our two new techniques improves performance dramatically in Texas Hold'em poker. The techniques lead to a drastic improvement over prior approaches for automatically generating agents, and our agent plays competitively even against the best agents overall. expand
|
|
Empirical game-theoretic analysis of the TAC Supply Chain game |
|
Patrick R. Jordan,
Christopher Kiekintveld,
Michael P. Wellman
|
|
Article No.: 193 |
|
doi>10.1145/1329125.1329359 |
|
Full text: PDF
|
|
The TAC Supply Chain Management (TAC/SCM) game presents a challenging dynamic environment for autonomous decision-making in a salient application domain. Strategic interactions complicate the analysis of games such as TAC/SCM. since the effectiveness ...
The TAC Supply Chain Management (TAC/SCM) game presents a challenging dynamic environment for autonomous decision-making in a salient application domain. Strategic interactions complicate the analysis of games such as TAC/SCM. since the effectiveness of a given strategy depends on the strategies played by other agents on the supply chain. The TAC tournament generates results from one particular path of combinations, and success in the tournament is rightly regarded as evidence for agent quality. Such results along with post-competition controlled experiments provide useful evaluations of novel techniques employed in the game. We argue that a broader game-theoretic analysis framework can provide a firmer foundation for choice of experimental contexts. Exploiting a repository of agents from the 2005 and 2006 TAC/SCM tournaments, we demonstrate an empirical game-theoretic methodology based on extensive simulation and careful measurement. Our analysis of agents from TAC-05 reveals interesting interactions not seen in the tournament. Extending the analysis to TAC-06 enables us to measure progress from year-to-year, and generates a candidate empirical equilibrium among the best known strategies. We use this equilibrium as a stable background population for comparing relative performance of the 2006 agents, yielding insights complementing the tournament results. expand
|
|
SESSION: Mechanism design and game theory: poster papers |
|
|
|
An adaptive strategy for minority games |
|
Ka-man Lam,
Ho-fung Leung
|
|
Article No.: 194 |
|
doi>10.1145/1329125.1329361 |
|
Full text: PDF
|
|
Many real life situations, like the financial market, auctions and resources competitions, can be modeled as Minority Games. In minority games, players choose to join one of the two sides, A or B. The players are rewarded if they have joined ...
Many real life situations, like the financial market, auctions and resources competitions, can be modeled as Minority Games. In minority games, players choose to join one of the two sides, A or B. The players are rewarded if they have joined the minority side, and punished otherwise. A traditional way to play in the minority games is to use predictors to decide which side to join. A predictor predicts the winning side in the next time step given a history of winning sides in previous time steps. In this paper, we introduce Behavioral Predictors and Adaptive Strategies for the minority game, with which players perform much better than those using previous models. expand
|
|
Routing games with an unknown set of active players |
|
Itai Ashlagi,
Dov Monderer,
Moshe Tennenholtz
|
|
Article No.: 195 |
|
doi>10.1145/1329125.1329362 |
|
Full text: PDF
|
|
In many settings there exists a set of potential participants, but the set of participants who are actually active in the system, and in particular their number, is unknown. This topic has been first analyzed by Ashlagi, Monderer, and Tennenholtz [AMT] ...
In many settings there exists a set of potential participants, but the set of participants who are actually active in the system, and in particular their number, is unknown. This topic has been first analyzed by Ashlagi, Monderer, and Tennenholtz [AMT] in the context of simple routing games, where the network consists of a set of parallel links, and the agents can not split their jobs among different paths. AMT used the model of pre-Bayesian games, and the concept of safety-level equilibrium for the analysis of these games. In this paper we extend the work by AMT. We deal with splitable routing games, where each player can split his job among paths in a given network. In this context we generalize the analysis to all two-node networks, in which paths may intersect in unrestricted manner. We characterize the relationships between the number of potential participants and the number of active participants under which ignorance is beneficial to each of the active participants. expand
|
|
Requirements driven agent collaboration |
|
Liwei Zheng,
Zhi Jin
|
|
Article No.: 196 |
|
doi>10.1145/1329125.1329363 |
|
Full text: PDF
|
|
This paper proposes the requirements driven agent collaboration. This proposal assumes that there are plenty different service agents distributed in Internet. When a request for accomplishing a particular task occurs, these autonomous agents can recognize ...
This paper proposes the requirements driven agent collaboration. This proposal assumes that there are plenty different service agents distributed in Internet. When a request for accomplishing a particular task occurs, these autonomous agents can recognize the newly emergent requirements and dynamically aggregate together to compete with others for fulfilling the requirements. This paper presents a preliminary framework for the requirement driven agent collaboration based on the automated mechanism design. expand
|
|
Coalition structure generation with worst case guarantees based on cardinality structure |
|
She-Xiong Su,
Shan-Li Hu,
Chun-Yi Shi
|
|
Article No.: 197 |
|
doi>10.1145/1329125.1329364 |
|
Full text: PDF
|
|
Coalition formation is a key topic in multiagent systems. One may prefer a coalition structure that maximizes the sum of the values of the coalitions, but often the number of coalition structures is too large to allow exhaustive search for the optimal ...
Coalition formation is a key topic in multiagent systems. One may prefer a coalition structure that maximizes the sum of the values of the coalitions, but often the number of coalition structures is too large to allow exhaustive search for the optimal one. But then, can the coalition structure found via a partial search be guaranteed to be within a bound from optimum? Sandholm et al. showed that it suffices to search the lowest two levels of the coalition structure graph in order to establish a worst case bound K(n). Dang et al. presented an algorithm that takes a step further to search those coalition structures whose biggest coalition's cardinality is greater than or equal to ⌈n(k − 1)/(k + 1)⌉, which is the best result known so far. Against this background, this paper reports on a novel anytime algorithm based on cardinality structure that only have to take a step further to search those coalition structures whose cardinality structure is in the CCS(n, b). Consequently, the algorithm reported in this paper is obviously better than that of Sandholm et al. (up to 1035 times faster when n=100, K=2) and Dang et al (up to 1018 times faster when n=100, K=3). expand
|
|
Some results on approximating the minimax solution in approval voting |
|
Rob LeGrand,
Evangelos Markakis,
Aranyak Mehta
|
|
Article No.: 198 |
|
doi>10.1145/1329125.1329365 |
|
Full text: PDF
|
|
Voting has been a very popular method for preference aggregation in multiagent environments. It is often the case that a set of agents with different preferences need to make a choice among a set of alternatives, where the alternatives could be various ...
Voting has been a very popular method for preference aggregation in multiagent environments. It is often the case that a set of agents with different preferences need to make a choice among a set of alternatives, where the alternatives could be various entities such as potential committee members, or joint plans of action. A standard methodology for this scenario is to have each agent express his preferences and then select an alternative according to some voting protocol. Several decision making applications in AI have followed this approach including problems in collaborative filtering [10] and planning [3, 4]. expand
|
|
SESSION: Cooperative distributed problem solving: full papers |
|
|
|
A globally optimal algorithm for TTD-MDPs |
|
Sooraj Bhat,
David L. Roberts,
Mark J. Nelson,
Charles L. Isbell,
Michael Mateas
|
|
Article No.: 199 |
|
doi>10.1145/1329125.1329367 |
|
Full text: PDF
|
|
In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)---a variant of MDPs in which the goal is to realize a specified distribution of trajectories through a state space---as a general agent-coordination ...
In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)---a variant of MDPs in which the goal is to realize a specified distribution of trajectories through a state space---as a general agent-coordination framework. We present several advances to previous work on TTD-MDPs. We improve on the existing algorithm for solving TTD-MDPs by deriving a greedy algorithm that finds a policy that provably minimizes the global KL-divergence from the target distribution. We test the new algorithm by applying TTD-MDPs to drama management, where a system must coordinate the behavior of many agents to ensure that a game follows a coherent storyline, is in keeping with the author's desires, and offers a high degree of replayability. Although we show that suboptimal greedy strategies will fail in some cases, we validate previous work that suggests that they can work well in practice. We also show that our new algorithm provides guaranteed accuracy even in those cases, with little additional computational cost. Further, we illustrate how this new approach can be applied online, eliminating the memory-intensive offline sampling necessary in the previous approach. expand
|
|
A Q-decomposition and bounded RTDP approach to resource allocation |
|
Pierrick Plamondon,
Brahim Chaib-draa,
Abder Rezak Benaskeur
|
|
Article No.: 200 |
|
doi>10.1145/1329125.1329368 |
|
Full text: PDF
|
|
This paper contributes to solve effectively stochastic resource allocation problems known to be NP-Complete. To address this complex resource management problem, a Q-decomposition approach is proposed when the resources which are already shared among ...
This paper contributes to solve effectively stochastic resource allocation problems known to be NP-Complete. To address this complex resource management problem, a Q-decomposition approach is proposed when the resources which are already shared among the agents, but the actions made by an agent may influence the reward obtained by at least another agent. The Q-decomposition allows to coordinate these reward separated agents and thus permits to reduce the set of states and actions to consider. On the other hand, when the resources are available to all agents, no Q-decomposition is possible and we use heuristic search. In particular, the bounded Real-time Dynamic Programming (bounded RTDP) is used. Bounded RTDP concentrates the planning on significant states only and prunes the action space. The pruning is accomplished by proposing tight upper and lower bounds on the value function. expand
|
|
Combinatorial resource scheduling for multiagent MDPs |
|
Dmitri A. Dolgov,
Michael R. James,
Michael E. Samples
|
|
Article No.: 201 |
|
doi>10.1145/1329125.1329369 |
|
Full text: PDF
|
|
Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial problem of scheduling the usage of multiple resources among agents that ...
Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial problem of scheduling the usage of multiple resources among agents that operate in stochastic environments, modeled as Markov decision processes (MDPs). In recent years, efficient resource-allocation algorithms have been developed for agents with resource values induced by MDPs. However, this prior work has focused on static resource-allocation problems where resources are distributed once and then utilized in infinite-horizon MDPs. We extend those existing models to the problem of combinatorial resource scheduling, where agents persist only for finite periods between their (predefined) arrival and departure times, requiring resources only for those time periods. We provide a computationally efficient procedure for computing globally optimal resource assignments to agents over time. We illustrate and empirically analyze the method in the context of a stochastic job-scheduling domain. expand
|
|
Organizational self-design in semi-dynamic environments |
|
Sachin Kamboj,
Keith S. Decker
|
|
Article No.: 202 |
|
doi>10.1145/1329125.1329370 |
|
Full text: PDF
|
|
Organizations are an important basis for coordination in multiagent systems. However, there is no best way to organize and all ways of organizing are not equally effective. Attempting to optimize an organizational structure depends strongly on environmental ...
Organizations are an important basis for coordination in multiagent systems. However, there is no best way to organize and all ways of organizing are not equally effective. Attempting to optimize an organizational structure depends strongly on environmental features including problem characteristics, available resources, and agent capabilities. If the environment is dynamic, the environmental conditions or the problem task structure may change over time. This precludes the use of static, design-time generated, organizational structures in such systems. On the other hand, for many real environments, the problems are not totally unique either: certain characteristics and conditions change slowly, if at all, and these can have an important effect in creating stable organizational structures. Organizational-Self Design (OSD) has been proposed as an approach for constructing suitable organizational structures at runtime. We extend the existing OSD approach to include worth-oriented domains, model other resources in addition to only processor resources and build in robustness into the organization. We then evaluate our approach against the contract-net approach and show that our OSD agents perform better, are more efficient, and more flexible to changes in the environment. expand
|
|
SESSION: Cooperative distributed problem solving: poster papers |
|
|
|
DisLRPɑ: ɑ-approximation in generalized mutual assignment |
|
Katsutoshi Hirayama
|
|
Article No.: 203 |
|
doi>10.1145/1329125.1329372 |
|
Full text: PDF
|
|
This paper presents a new distributed solution protocol, called DisLRPα, for the Generalized Mutual Assignment Problem (GMAP). The GMAP is a typical distributed combinatorial optimization problem whose goal is to maximize social ...
This paper presents a new distributed solution protocol, called DisLRPα, for the Generalized Mutual Assignment Problem (GMAP). The GMAP is a typical distributed combinatorial optimization problem whose goal is to maximize social welfare of the agents. Unlike the previous protocol for the GMAP, DisLRPα can provide a theoretical guarantee on global solution quality. In DisLRPα, as with in the previous protocol, the agents repeatedly solve their local problems while coordinating their local solutions using a distributed constraint satisfaction technique. The key difference is that, in DisLRPα, each agent is required to produce a feasible solution whose local objective value is not lower than α (0 < α ≤ 1) times the local optimal value. Our experimental results on benchmark problem instances show that DisLRPα can certainly find a solution whose global objective value is higher than that theoretically guaranteed. Furthermore, they also show that, while spending extra communication and computation costs, DisLRPα can produce a significantly better solution than the previous protocol if we set α appropriately. expand
|
|
A swarm based approximated algorithm to the extended generalized assignment problem (E-GAP) |
|
Paulo R. Ferreira, Jr.,
Felipe S. Boffo,
Ana L. C. Bazzan
|
|
Article No.: 204 |
|
doi>10.1145/1329125.1329373 |
|
Full text: PDF
|
|
This paper addresses distributed task allocation in complex scenarios modeled using the distributed constraint optimization problem (DCOP) formalism. We propose and evaluate a novel algorithm for distributed task allocation based on theoretical models ...
This paper addresses distributed task allocation in complex scenarios modeled using the distributed constraint optimization problem (DCOP) formalism. We propose and evaluate a novel algorithm for distributed task allocation based on theoretical models of division of labor in social insect colonies, called Swarm-GAP. Swarm-GAP was experimented in an abstract centralized simulation environment and in the RoboCup Rescue Simulator. We show that Swarm-GAP achieves similar results to other recent proposed algorithm with a dramatic reduction in communication and computation. Thus, our approach is highly scalable regarding both the number of agents and tasks. expand
|
|
A statistical decision-making model for choosing among multiple alternatives |
|
Shulamit Reches,
Shavit Talman,
Sarit Kraus
|
|
Article No.: 205 |
|
doi>10.1145/1329125.1329374 |
|
Full text: PDF
|
|
Automated agents often have several alternatives to choose from in order to solve a problem. Usually the agent does not know in advance which alternative is the best one, so some exploration is required. However, in most cases there is a cost associated ...
Automated agents often have several alternatives to choose from in order to solve a problem. Usually the agent does not know in advance which alternative is the best one, so some exploration is required. However, in most cases there is a cost associated with exploring the domain, which must be minimized in order to be worthwhile. We concentrate on cases where the agent has some prior knowledge about each alternative, which is expressed in terms of units of information. A unit of information about an alternative is the result of choosing the alternative - for example, in the e-commerce domain one unit of information can be a customer's impression or feedback about a supplier; in the heuristic domain one unit of information can be the observed result of running one simulation with a given heuristic function. In our environments the agent has a-priori only a small number of units of information about each alternative, and it would like to use this knowledge in deciding between its alternatives. Nevertheless, since the agent has only a limited number of units of information, deciding between the alternatives solely based on these units may be risky. In extreme cases, they can even mislead the agent to choose the worst alternative rather than the best one. expand
|
|
Dynamic task allocation within an open service-oriented MAS architecture |
|
Ivan J. Jureta,
Stephane Faulkner,
Youssef Achbany,
Marco Saerens
|
|
Article No.: 206 |
|
doi>10.1145/1329125.1329375 |
|
Full text: PDF
|
|
A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding task specialist agents depending on their prior performance while anticipating ...
A MAS architecture consisting of service centers is proposed. Within each service center, a mediator coordinates service delivery by allocating individual tasks to corresponding task specialist agents depending on their prior performance while anticipating performance of newly entering agents. By basing mediator behavior on a novel multicriteria-driven (including quality of service, deadline, reputation, cost, and user preferences) reinforcement learning algorithm, integrating the exploitation of acquired knowledge with optimal, undirected, continual exploration, adaptability to changes in agent availability and performance is ensured. The reported experiments indicate the algorithm behaves as expected and outperforms two standard approaches. expand
|
|
Managing the pedigree and quality of information in dynamic information sharing environments |
|
Bin Yu,
Srikanth Kallurkar,
Ganesh Vaidyanathan,
Donald Steiner
|
|
Article No.: 207 |
|
doi>10.1145/1329125.1329376 |
|
Full text: PDF
|
|
The quality of information is crucial for decision making in many mission-critical applications such as battlefield operations and intelligence analysis. However, as the system becomes larger and more diverse, it is becoming increasingly difficult to ...
The quality of information is crucial for decision making in many mission-critical applications such as battlefield operations and intelligence analysis. However, as the system becomes larger and more diverse, it is becoming increasingly difficult to assess the quality of information from various operators or data sources. In this paper we propose an agent-based approach to managing the quality of information, e.g., its trustworthiness, in network centric information sharing environments, where software agents collaborate with each other to automatically represent and assess the trustworthiness of information from its pedigree within the framework of Dempster-Shafer theory. expand
|
|
Understanding decentralised control of resource allocation in a minimal multi-agent system |
|
Mariusz Jacyno,
Seth Bullock,
Terry Payne,
Michael Luck
|
|
Article No.: 208 |
|
doi>10.1145/1329125.1329377 |
|
Full text: PDF
|
|
In response to the advent of new computational infrastructures, a number of initiatives, such as autonomic computing [5] and utility computing [8], have been announced by major IT vendors sharing the same underlying principles of provisioning ...
In response to the advent of new computational infrastructures, a number of initiatives, such as autonomic computing [5] and utility computing [8], have been announced by major IT vendors sharing the same underlying principles of provisioning distributed computational resources to a large number of users "on demand". Since, by their nature, such systems are large, open and dynamic, allocation of resources to users presents unique challenges that threaten to overwhelm existing centralised management approaches [2]. expand
|
|
Developing multi-agent systems with automatic agent generation and dynamic task allocation mechanisms |
|
Xiaoqin Zhang,
Haiping Xu,
Bhavesh Shrestha
|
|
Article No.: 209 |
|
doi>10.1145/1329125.1329378 |
|
Full text: PDF
|
|
Multi-Agent System (MAS) is a suitable programming paradigm for distributed information systems and applications. We have been working on a set of technologies and mechanisms to ease and formalize the development of MAS, and to increase its reliability ...
Multi-Agent System (MAS) is a suitable programming paradigm for distributed information systems and applications. We have been working on a set of technologies and mechanisms to ease and formalize the development of MAS, and to increase its reliability and reuse-ability too. We aim to cover the analysis and modeling, design and implementation phases. The first goal is to separate concerns. We have proposed a three-layered development process to separate the multiple issues in a multi-agent system, while some of them are application-dependent, others are not; some of them are platform-dependent and others are not. We have also aimed to separate the domain knowledge and the intelligent problem-solving capabilities. We adapt a role-based modeling approach, conceptual roles are defined with the domain related knowledge, such as goals, permissions, organizational relationship, and interaction protocols, etc; where agent is a concrete entity equipped with motivations, resources and problem-solving capabilities. expand
|
|
Building coalitions involving agents and humans: reports from agent-based participatory simulations |
|
Paul Guyot,
Shinichi Honiden
|
|
Article No.: 210 |
|
doi>10.1145/1329125.1329379 |
|
Full text: PDF
|
|
Agent-based participatory simulations are laboratory experiments designed like agent-based simulations and where humans access the simulation as software agents. This paper describes the outcomes of six experiments lasting up to two hours each, where ...
Agent-based participatory simulations are laboratory experiments designed like agent-based simulations and where humans access the simulation as software agents. This paper describes the outcomes of six experiments lasting up to two hours each, where human players took part in an iterated game derived from the El Farol bar problem. Agents decide synchronously to go to the bar or to stay home and the benefit depends on the bar attendance, with a threshold effect: it is better to stay home if more than 60% of the agents go. Contrasting with the original version of this problem, we allowed agents, and therefore humans, to communicate before they took their decision. The first two experiments allowed us to train participants and to introduce the notion of teams. Teams represented coalitions within the game and positively affected scoring, but they were not part of an obvious solution to the problem and they did not enforce cooperative behavior in the game. Drawing from these experiments, we designed autonomous agents reproducing strategies of the participants. These agents took part in the last four participatory experiments and we observed the formation of coalitions between agents, between humans and between agents and humans. expand
|
|
Multi-task overlapping coalition parallel formation algorithm |
|
Chao-Feng Lin,
Shan-Li Hu
|
|
Article No.: 211 |
|
doi>10.1145/1329125.1329380 |
|
Full text: PDF
|
|
The issue of coalition formation has been investigated from many aspects, but until recently little attention has been paid to overlapping coalition formation. What's more, an algorithm of multi-task coalition parallel formation hasn't been proposed ...
The issue of coalition formation has been investigated from many aspects, but until recently little attention has been paid to overlapping coalition formation. What's more, an algorithm of multi-task coalition parallel formation hasn't been proposed yet. In this contribution, we adopt the binary Particle Swarm Optimization (PSO) to address the coalition formation problem, taking overlapping coalition and parallelizability into account simultaneously. We develop an algorithm to parallelize the process of overlapping coalition formation, and guarantee the precedence order of tasks at the same time. As for our algorithm, a strategy is introduced to improve its capability of global searching and convergent rate of its solutions. expand
|
|
Agents that collude to evade taxes |
|
Luis Antunes,
João Balsa,
Helder Coelho
|
|
Article No.: 212 |
|
doi>10.1145/1329125.1329381 |
|
Full text: PDF
|
|
We explore the link between micro-level motivations leading to and being influenced by macro-level outcomes to study the complex issue of tax evasion. If it is obvious why there is a benefit for people who evade taxes, it is less obvious why people would ...
We explore the link between micro-level motivations leading to and being influenced by macro-level outcomes to study the complex issue of tax evasion. If it is obvious why there is a benefit for people who evade taxes, it is less obvious why people would pay any taxes at all, given the the small probability of being caught, and the small penalties involved. We use exploratory simulation and progressively deepening models of agents and of simulations to study the reasons behind tax evasion. We have unveiled some relatively simple social mechanisms that can explain the compliance numbers observed in real economies. We claim that simulation with multiple agents provides a strong methodological tool with which to support the design of public policies. expand
|
|
Achieving cooperation among selfish agents in the air traffic management domain using signed money |
|
Geert Jonker,
Frank Dignum,
John-Jules Meyer
|
|
Article No.: 213 |
|
doi>10.1145/1329125.1329382 |
|
Full text: PDF
|
|
We present a monetary system by which selfish agents can cooperate reciprocally. We show that a straight-forward market mechanism can lead to unfair situations when agents misuse key positions. We show that it is not easy to retaliate wrongdoers, as ...
We present a monetary system by which selfish agents can cooperate reciprocally. We show that a straight-forward market mechanism can lead to unfair situations when agents misuse key positions. We show that it is not easy to retaliate wrongdoers, as there is a dominant strategy that deviates from the retaliating strategy. We present a monetary system in which every user can issue money and every user is required to sign each credit it issues or circulates. By using a trust-based credit-valuation function, wrongdoers are retaliated and it is no longer dominant to deviate from the retaliating strategy. expand
|
|
An event-driven approach for agent-based business process enactment |
|
Payal Chakravarty,
Munindar P. Singh
|
|
Article No.: 214 |
|
doi>10.1145/1329125.1329383 |
|
Full text: PDF
|
|
Agents enacting business processes in large open environments need to adaptively accommodate exceptions. Work on multiagent approaches can flexibly model business processes. This paper proposes an event-driven architecture that enriches such models with ...
Agents enacting business processes in large open environments need to adaptively accommodate exceptions. Work on multiagent approaches can flexibly model business processes. This paper proposes an event-driven architecture that enriches such models with events resulting in a more robust and proactive system. Specifically, we place this architecture in a business process framework based on protocols and policies, where agents' behaviors are specified via rules. The contributions of this paper include (1) an event-driven architecture, (2) a specification language that combines event logic with rules and (3) a methodology to incorporate events into a process (such as for fine-grained monitoring), (4) a way to manage subscriptions to simple events efficiently. This approach is applied on a well-known business scenario. expand
|
|
Executing multi-robot cases through a single coordinator |
|
Raquel Ros,
Manuela Veloso
|
|
Article No.: 215 |
|
doi>10.1145/1329125.1329384 |
|
Full text: PDF
|
|
It is challenging to design general robot soccer coordination behaviors that address individual states. We have successfully followed a case-based approach to define behaviors for a single soccer robot. In our multi-robot system we now distinguish retriever ...
It is challenging to design general robot soccer coordination behaviors that address individual states. We have successfully followed a case-based approach to define behaviors for a single soccer robot. In our multi-robot system we now distinguish retriever robots that access the case library, reason about the situation, and select the most appropriate cases. They communicate with the other robots and they all execute the retrieved case in a coordinated way. We evaluate our approach with two robots demonstrating that the robots successfully coordinate and the number of passes during a game highly increases compared to an approach with an implicit coordination mechanism. expand
|
|
Distributed coordination in uncertain multiagent systems |
|
Rajiv T. Maheswaran,
Craig M. Rogers,
Romeo Sanchez
|
|
Article No.: 216 |
|
doi>10.1145/1329125.1329385 |
|
Full text: PDF
|
|
We consider real-time multi-agent coordination in a dynamic and uncertain domain addressing both distributed state information and partial knowledge of the common reward function. The challenge is to find functional strategies when bounded rationality ...
We consider real-time multi-agent coordination in a dynamic and uncertain domain addressing both distributed state information and partial knowledge of the common reward function. The challenge is to find functional strategies when bounded rationality hinders the ability to encompass the values of possible sample paths of the system. This paper discusses a new approach based on assigning agents to monitor portions of the reward structure for which they aggregate and propagate appropriate profiles which compactly represent relevant information used for policy modification. This approach shows promise as an alternate and potentially superior technique with respect to current decision-theoretic and scheduling approaches. expand
|
|
SESSION: Multiagent planning: full papers |
|
|
|
Graphical models for online solutions to interactive POMDPs |
|
Prashant Doshi,
Yifeng Zeng,
Qiongyu Chen
|
|
Article No.: 217 |
|
doi>10.1145/1329125.1329387 |
|
Full text: PDF
|
|
We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear than the previous representation. These graphical models called interactive ...
We develop a new graphical representation for interactive partially observable Markov decision processes (I-POMDPs) that is significantly more transparent and semantically clear than the previous representation. These graphical models called interactive dynamic influence diagrams (I-DIDs) seek to explicitly model the structure that is often present in real-world problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that I-POMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent online as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-DIDs may be applied and demonstrate their usefulness. expand
|
|
Letting loose a SPIDER on a network of POMDPs: generating quality guaranteed policies |
|
Pradeep Varakantham,
Janusz Marecki,
Yuichi Yabu,
Milind Tambe,
Makoto Yokoo
|
|
Article No.: 218 |
|
doi>10.1145/1329125.1329388 |
|
Full text: PDF
|
|
Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are a popular approach for modeling multi-agent systems acting in uncertain domains. Given the significant complexity of solving distributed POMDPs, particularly as we scale ...
Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are a popular approach for modeling multi-agent systems acting in uncertain domains. Given the significant complexity of solving distributed POMDPs, particularly as we scale up the numbers of agents, one popular approach has focused on approximate solutions. Though this approach is efficient, the algorithms within this approach do not provide any guarantees on solution quality. A second less popular approach focuses on global optimality, but typical results are available only for two agents, and also at considerable computational cost. This paper overcomes the limitations of both these approaches by providing SPIDER, a novel combination of three key features for policy generation in distributed POMDPs: (i) it exploits agent interaction structure given a network of agents (i.e. allowing easier scale-up to larger number of agents); (ii) it uses a combination of heuristics to speedup policy search; and (iii) it allows quality guaranteed approximations, allowing a systematic tradeoff of solution quality for time. Experimental results show orders of magnitude improvement in performance when compared with previous global optimal algorithms. expand
|
|
On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints |
|
Janusz Marecki,
Milind Tambe
|
|
Article No.: 219 |
|
doi>10.1145/1329125.1329389 |
|
Full text: PDF
|
|
Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difficult to solve. In this paper, we improve a state-of-the-art heuristic solution method ...
Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difficult to solve. In this paper, we improve a state-of-the-art heuristic solution method for DEC-MDPs, called OC-DEC-MDP, that has recently been shown to scale up to larger DEC-MDPs. Our heuristic solution method, called Value Function Propagation (VFP), combines two orthogonal improvements of OC-DEC-MDP. First, it speeds up OC-DEC-MDP by an order of magnitude by maintaining and manipulating a value function for each state (as a function of time) rather than a separate value for each pair of sate and time interval. Furthermore, it achieves better solution qualities than OC-DEC-MDP because, as our analytical results show, it does not overestimate the expected total reward like OC-DEC- MDP. We test both improvements independently in a crisis-management domain as well as for other types of domains. Our experimental results demonstrate a significant speedup of VFP over OC-DEC-MDP as well as higher solution qualities in a variety of situations. expand
|
|
Q-value functions for decentralized POMDPs |
|
Frans A. Oliehoek,
Nikos Vlassis
|
|
Article No.: 220 |
|
doi>10.1145/1329125.1329390 |
|
Full text: PDF
|
|
Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by dynamic programming, and then a policy is extracted from this value function. ...
Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by dynamic programming, and then a policy is extracted from this value function. In this paper we study whether similar Q-value functions can be defined in decentralized POMDP models (Dec-POMDPs), what the cost of computing such value functions is, and how policies can be extracted from such value functions. Using the framework of Bayesian games, we argue that searching for the optimal Q-value function may be as costly as exhaustive policy search. Then we analyze various approximate Q-value functions that allow efficient computation. Finally, we describe a family of algorithms for extracting policies from such Q-value functions. expand
|
|
SESSION: Multiagent planning: poster papers |
|
|
|
Utility-based plan recognition: an extended abstract |
|
Dorit Avrahami-Zilberbrand,
Gal A. Kaminka
|
|
Article No.: 221 |
|
doi>10.1145/1329125.1329392 |
|
Full text: PDF
|
|
Plan recognition is the process of inferring other agents' plans and goals based on their observable actions. Essentially all previous work in plan recognition has focused on the recognition process itself, with no regard to the use of the information ...
Plan recognition is the process of inferring other agents' plans and goals based on their observable actions. Essentially all previous work in plan recognition has focused on the recognition process itself, with no regard to the use of the information in the recognizing agent. As a result, low-likelihood recognition hypotheses that may imply significant meaning to the observer, are ignored in existing work. In this paper, we present novel efficient algorithms that allows the observer to incorporate her own biases and preferences---in the form of a utility function---into the plan recognition process. This allows choosing recognition hypotheses based on their expected utility to the observer. We call this Utility-based Plan Recognition (UPR). We briefly discuss a hybrid symbolic decision-theoretic plan recognizer, and demonstrate the efficacy of this approach in an example. expand
|
|
Planning and defeasible reasoning |
|
Diego R. Garcia,
Alejandro J. Garcia,
Guillermo R. Simari
|
|
Article No.: 222 |
|
doi>10.1145/1329125.1329393 |
|
Full text: PDF
|
|
We present an argumentation-based formalism that an agent could use for constructing plans. We will analyze the interaction of arguments and actions when they are combined to construct plans using Partial Order Planning techniques.
We present an argumentation-based formalism that an agent could use for constructing plans. We will analyze the interaction of arguments and actions when they are combined to construct plans using Partial Order Planning techniques. expand
|
|
Distributed intrusion detection in partially observable Markov decision processes |
|
Doran Chakraborty,
Sandip Sen
|
|
Article No.: 223 |
|
doi>10.1145/1329125.1329394 |
|
Full text: PDF
|
|
The problem of decentralized control occurs frequently in realistic domains where agents have to cooperate to achieve a universal goal. Planning for domain-level joint strategy takes into account the uncertainty of the underlying environment in computing ...
The problem of decentralized control occurs frequently in realistic domains where agents have to cooperate to achieve a universal goal. Planning for domain-level joint strategy takes into account the uncertainty of the underlying environment in computing near-optimal joint-strategies that can handle the intrinsic domain uncertainty. However, uncertainty related to agents deviating from the recommended joint-policy is not taken into consideration. We focus on hostile domains, where the goal is to quickly identify deviations from planned behavior by any compromised agents. There is a growing need to develop techniques that enable the system to recognize and recover from such deviations. We discuss the problem from the intruder's perspective and then present a distributed intrusion detection scheme that can detect a particular type of attack. expand
|
|
Subjective approximate solutions for decentralized POMDPs |
|
Anton Chechetka,
Katia Sycara
|
|
Article No.: 224 |
|
doi>10.1145/1329125.1329395 |
|
Full text: PDF
|
|
A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DEC-POMDPs) provide a convenient, but intractable model for specifying planning problems ...
A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DEC-POMDPs) provide a convenient, but intractable model for specifying planning problems in cooperative teams. Compared to the single-agent case, an additional challenge is posed by the lack of free communication between the teammates. We argue, that acting close to optimally in a team involves a tradeoff between opportunistically taking advantage of agent's local observations and being predictable for the teammates. We present a more opportunistic version of an existing approximate algorithm for DEC-POMDPs and investigate the tradeoff. Preliminary evaluation shows that in certain settings oportunistic modification provides significantly better performance. expand
|
|
Modeling plan coordination in multiagent decision processes |
|
Ping Xuan
|
|
Article No.: 225 |
|
doi>10.1145/1329125.1329396 |
|
Full text: PDF
|
|
In multiagent planning, it is often convenient to view a problem as two subproblems: agent local planning and coordination. Thus, we can classify agent activities into two categories: agent local problem solving activities and coordination activities, ...
In multiagent planning, it is often convenient to view a problem as two subproblems: agent local planning and coordination. Thus, we can classify agent activities into two categories: agent local problem solving activities and coordination activities, with each category of activities addressing the corresponding subproblem. However, recent mathematical models, such as decentralized Markov decision processes (DEC-MDP) and partially observable Markov decision processes (DEC-POMDP), view the problem as a single decision process and do not make the distinctions between agent local planning and coordination. In this paper, we present a synergistic representation that brings these two views together, and show that these two views are equivalent. Under this representation, traditional plan coordination mechanisms can be conveniently modeled and interpreted as approximation methods for solving the decision processes. expand
|
|
SESSION: Ontologies: full papers |
|
|
|
A formal model for situated semantic alignment |
|
Manuel Atencia,
Marco Schorlemmer
|
|
Article No.: 226 |
|
doi>10.1145/1329125.1329398 |
|
Full text: PDF
|
|
Ontology matching is currently a key technology to achieve the semantic alignment of ontological entities used by knowledge-based applications, and therefore to enable their interoperability in distributed environments such as multi-agent systems. Most ...
Ontology matching is currently a key technology to achieve the semantic alignment of ontological entities used by knowledge-based applications, and therefore to enable their interoperability in distributed environments such as multi-agent systems. Most ontology matching mechanisms, however, assume matching prior integration and rely on semantics that has been coded a priori in concept hierarchies or external sources. In this paper, we present a formal model for a semantic alignment procedure that incrementally aligns differing conceptualisations of two or more agents relative to their respective perception of the environment or domain they are acting in. It hence makes the situation in which the alignment occurs explicit in the model. We resort to Channel Theory to carry out the formalisation. expand
|
|
A multi-agent system for building dynamic ontologies |
|
Kévin Ottens,
Marie-Pierre Gleizes,
Pierre Glize
|
|
Article No.: 227 |
|
doi>10.1145/1329125.1329399 |
|
Full text: PDF
|
|
Ontologies building from text is still a time-consuming task which justifies the growth of Ontology Learning. Our system named Dynamo is designed along this domain but following an original approach based on an adaptive multi-agent architecture. ...
Ontologies building from text is still a time-consuming task which justifies the growth of Ontology Learning. Our system named Dynamo is designed along this domain but following an original approach based on an adaptive multi-agent architecture. In this paper we present a distributed hierarchical clustering algorithm, core of our approach. It is evaluated and compared to a more conventional centralized algorithm. We also present how it has been improved using a multi-criteria approach. With those results in mind, we discuss the limits of our system and add as perspectives the modifications required to reach a complete ontology building solution. expand
|
|
Argumentation over ontology correspondences in MAS |
|
Loredana Laera,
Ian Blacoe,
Valentina Tamma,
Terry Payne,
Jerôme Euzenat,
Trevor Bench-Capon
|
|
Article No.: 228 |
|
doi>10.1145/1329125.1329400 |
|
Full text: PDF
|
|
In order to support semantic interoperation in open environments, where agents can dynamically join or leave and no prior assumption can be made on the ontologies to align, the different agents involved need to agree on the semantics of the terms used ...
In order to support semantic interoperation in open environments, where agents can dynamically join or leave and no prior assumption can be made on the ontologies to align, the different agents involved need to agree on the semantics of the terms used during the interoperation. Reaching this agreement can only come through some sort of negotiation process. Indeed, agents will differ in the domain ontologies they commit to; and their perception of the world, and hence the choice of vocabulary used to represent concepts. We propose an approach for supporting the creation and exchange of different arguments, that support or reject possible correspondences. Each agent can decide, according to its preferences, whether to accept or refuse a candidate correspondence. The proposed framework considers arguments and propositions that are specific to the matching task and are based on the ontology semantics. This argumentation framework relies on a formal argument manipulation schema and on an encoding of the agents' preferences between particular kinds of arguments. expand
|
|
Learning consumer preferences using semantic similarity |
|
Reyhan Aydoğan,
Pinar Yolum
|
|
Article No.: 229 |
|
doi>10.1145/1329125.1329401 |
|
Full text: PDF
|
|
In online, dynamic environments, the services requested by consumers may not be readily served by the providers. This requires the service consumers and providers to negotiate their service needs and offers. Multiagent negotiation approaches typically ...
In online, dynamic environments, the services requested by consumers may not be readily served by the providers. This requires the service consumers and providers to negotiate their service needs and offers. Multiagent negotiation approaches typically assume that the parties agree on service content and focus on finding a consensus on service price. In contrast, this work develops an approach through which the parties can negotiate the content of a service. This calls for a negotiation approach in which the parties can understand the semantics of their requests and offers and learn each other's preferences incrementally over time. Accordingly, we propose an architecture in which both consumers and producers use a shared ontology to negotiate a service. Through repetitive interactions, the provider learns consumers' needs accurately and can make better targeted offers. To enable fast and accurate learning of preferences, we develop an extension to Version Space and compare it with existing learning techniques. We further develop a metric for measuring semantic similarity between services and compare the performance of our approach using different similarity metrics. expand
|
|
SESSION: Ontologies: full papers |
|
|
|
Cooperative evolution of service ontologies |
|
Murat Şensoy,
Pinar Yolum
|
|
Article No.: 230 |
|
doi>10.1145/1329125.1329403 |
|
Full text: PDF
|
|
Most of the proposed approaches in automatic service selection assume the existence of a common ontology among communicating agents. However, this assumption becomes difficult to support in environments, where the agents' ontologies can evolve independently ...
Most of the proposed approaches in automatic service selection assume the existence of a common ontology among communicating agents. However, this assumption becomes difficult to support in environments, where the agents' ontologies can evolve independently based on their individual experiences. In this paper, we propose an approach through which agents can cooperatively update their ontologies and teach one another concepts from their ontologies. This leads to a society of agents with different but overlapping ontologies. Our simulation results show that mutually accepted concepts emerge based on the interactions of the agents. Further, agents learn and use concepts that are created by other to express their own service needs. expand
|
|
An ontology-based approach to interoperability for Bayesian agents |
|
Elder Rizzon Santos,
Moser Silva Fagundes,
Rosa Maria Vicari
|
|
Article No.: 231 |
|
doi>10.1145/1329125.1329404 |
|
Full text: PDF
|
|
This paper presents an ontology-based approach to promote the interoperability among agents that represent their knowledge through Bayesian networks. This research relies on semantic web foundations to achieve knowledge interoperability in the context ...
This paper presents an ontology-based approach to promote the interoperability among agents that represent their knowledge through Bayesian networks. This research relies on semantic web foundations to achieve knowledge interoperability in the context of multiagent systems. Our first step was the specification of an ontology that formalizes the structures of the Bayesian network representation. Once handled the issue of the knowledge representation, we specify how a Bayesian agent operates such representation. Thus, we define a model of internal architecture to support Bayesian agents in the knowledge sharing and maintenance tasks. expand
|
|
Exchanging reputation values among heterogeneous agent reputation models: an experience on ART testbed |
|
Anarosa A. F. Brandão,
Laurent Vercouter,
Sara Casare,
Jaime Sichman
|
|
Article No.: 232 |
|
doi>10.1145/1329125.1329405 |
|
Full text: PDF
|
|
In open MAS it is often a problem to achieve agents' interoperability. The heterogeneity of its components turns the establishment of interaction or cooperation among them into a non trivial task, since agents may use different internal models and the ...
In open MAS it is often a problem to achieve agents' interoperability. The heterogeneity of its components turns the establishment of interaction or cooperation among them into a non trivial task, since agents may use different internal models and the decision about trust other agents is a crucial condition to the formation of agents' cooperation. In this paper we propose the use of an ontology to deal with this issue. We experiment this idea by enhancing the ART reputation model with semantic data obtained from this ontology. This data is used during interaction among heterogeneous agents when exchanging reputation values and may be used for agents that use different reputation models. expand
|
|
SESSION: Agent learning, evolution, and adaptation: full papers |
|
|
|
Confidence-based policy learning from demonstration using Gaussian mixture models |
|
Sonia Chernova,
Manuela Veloso
|
|
Article No.: 233 |
|
doi>10.1145/1329125.1329407 |
|
Full text: PDF
|
|
We contribute an approach for interactive policy learning through expert demonstration that allows an agent to actively request and effectively represent demonstration examples. In order to address the inherent uncertainty of human demonstration, we ...
We contribute an approach for interactive policy learning through expert demonstration that allows an agent to actively request and effectively represent demonstration examples. In order to address the inherent uncertainty of human demonstration, we represent the policy as a set of Gaussian mixture models (GMMs), where each model, with multiple Gaussian components, corresponds to a single action. Incrementally received demonstration examples are used as training data for the GMM set. We then introduce our confident execution approach, which focuses learning on relevant parts of the domain by enabling the agent to identify the need for and request demonstrations for specific parts of the state space. The agent selects between demonstration and autonomous execution based on statistical analysis of the uncertainty of the learned Gaussian mixture set. As it achieves proficiency at its task and gains confidence in its actions, the agent operates with increasing autonomy, eliminating the need for unnecessary demonstrations of already acquired behavior, and reducing both the training time and the demonstration workload of the expert. We validate our approach with experiments in simulated and real robot domains. expand
|
|
Forecasting market prices in a supply chain game |
|
Christopher Kiekintveld,
Jason Miller,
Patrick R. Jordan,
Michael P. Wellman
|
|
Article No.: 234 |
|
doi>10.1145/1329125.1329408 |
|
Full text: PDF
|
|
Future market conditions can be a pivotal factor in making business decisions. We present and evaluate methods used by our agent, Deep Maize, to forecast market prices in the Trading Agent Competition Supply Chain Management Game. As a guiding principle ...
Future market conditions can be a pivotal factor in making business decisions. We present and evaluate methods used by our agent, Deep Maize, to forecast market prices in the Trading Agent Competition Supply Chain Management Game. As a guiding principle we seek to exploit as many sources of available information as possible to inform predictions. Since information comes in several different forms, we integrate well-known methods in a novel way to make predictions. The core of our predictor is a nearest-neighbors machine learning algorithm that identifies historical instances with similar economic indicators. We augment this with an online learning procedure that transforms the predictions by optimizing a scoring rule. This allows us to select more relevant historical contexts using additional information available during individual games. We also explore the advantages of two different representations for predicting price distributions. One uses absolute prices, and the other uses statistics of price time series to exploit local stability. We evaluate these methods using both data from the 2005 tournament final round and additional simulations. We compare several variations of our predictor to one another and a baseline predictor similar to those used by many other tournament agents. We show substantial improvements over the baseline predictor, and demonstrate that each element of our predictor contributes to improved performance. expand
|
|
Conditional random fields for activity recognition |
|
Douglas L. Vail,
Manuela M. Veloso,
John D. Lafferty
|
|
Article No.: 235 |
|
doi>10.1145/1329125.1329409 |
|
Full text: PDF
|
|
Activity recognition is a key component for creating intelligent, multi-agent systems. Intrinsically, activity recognition is a temporal classification problem. In this paper, we compare two models for temporal classification: hidden Markov models (HMMs), ...
Activity recognition is a key component for creating intelligent, multi-agent systems. Intrinsically, activity recognition is a temporal classification problem. In this paper, we compare two models for temporal classification: hidden Markov models (HMMs), which have long been applied to the activity recognition problem, and conditional random fields (CRFs). CRFs are discriminative models for labeling sequences. They condition on the entire observation sequence, which avoids the need for independence assumptions between observations. Conditioning on the observations vastly expands the set of features that can be incorporated into the model without violating its assumptions. Using data from a simulated robot tag domain, chosen because it is multi-agent and produces complex interactions between observations, we explore the differences in performance between the discriminatively trained CRF and the generative HMM. Additionally, we examine the effect of incorporating features which violate independence assumptions between observations; such features are typically necessary for high classification accuracy. We find that the discriminatively trained CRF performs as well as or better than an HMM even when the model features do not violate the independence assumptions of the HMM. In cases where features depend on observations from many time steps, we confirm that CRFs are robust against any degradation in performance. expand
|
|
SESSION: Agent learning, evolution, and adaptation: poster papers |
|
|
|
Using priorities to simplify behavior coordination |
|
Brent E. Eskridge,
Dean F. Hougen
|
|
Article No.: 236 |
|
doi>10.1145/1329125.1329411 |
|
Full text: PDF
|
|
Previous research has used behavior hierarchies to address the problem of coordinating large numbers of behaviors. However, behavior hierarchies scale poorly since they require the state information of low-level behaviors. Abstracting this state information ...
Previous research has used behavior hierarchies to address the problem of coordinating large numbers of behaviors. However, behavior hierarchies scale poorly since they require the state information of low-level behaviors. Abstracting this state information into priorities has recently been introduced to resolve this problem. In this work, we evaluate both the quality of priority-based behavior hierarchies and their ease of development. This is done by using grammatical evolution to learn how to coordinate low-level behaviors to accomplish a task. We show that not only do priority-based behavior hierarchies perform just as well as standard hierarchies but that they promote faster learning of solutions that are better suited as components in larger hierarchies. expand
|
|
Advice taking in multiagent reinforcement learning |
|
Michael Rovatsos,
Alexandros Belesiotis
|
|
Article No.: 237 |
|
doi>10.1145/1329125.1329412 |
|
Full text: PDF
|
|
This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) that uses an additional "advice" signal to inform agents about mutually beneficial forms of behaviour. β-WoLF is an extension of the WoLF-PHC algorithm that ...
This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) that uses an additional "advice" signal to inform agents about mutually beneficial forms of behaviour. β-WoLF is an extension of the WoLF-PHC algorithm that allows agents to assess whether the advice obtained through this additional reward signal is (i) useful for the learning agent itself and (ii) currently being followed by other agents in the system. We report on experimental results obtained with this novel algorithm which indicate that it enables cooperation in scenarios in which the need to defend oneself against exploitation results in poor coordination using existing MARL algorithms. expand
|
|
SESSION: Argumentation and negotiation: full papers |
|
|
|
Children in the forest: towards a canonical problem of spatio-temporal collaboration |
|
Yi Luo,
Ladislau Bölöni
|
|
Article No.: 238 |
|
doi>10.1145/1329125.1329414 |
|
Full text: PDF
|
|
Canonical problems are simplified representations of a class of real world problems. They allow researchers to compare algorithms in a standard setting which captures the most important challenges of the real world problems being modeled. Such examples ...
Canonical problems are simplified representations of a class of real world problems. They allow researchers to compare algorithms in a standard setting which captures the most important challenges of the real world problems being modeled. Such examples are the block world for planning, two-player games for algorithms which learn the behavior of the opponent agent, or the "split the pie" game for a large class of negotiation problems. In this paper we focus on negotiating collaboration in space and time, a problem with many important real world applications. Although technically a multi-issue negotiation, we show that the problem can not be represented in a satisfactory manner by the split the pie model. We propose the "children in the rectangular forest" (CRF) model as a possible canonical problem for negotiating spatio-temporal collaboration. By exploring a centralized and a peer-to-peer negotiation based solution, we demonstrate that the problem captures the main challenges of the real world problems while allows us to simplify away some of the computationally demanding but semantically marginal features of real world problems. expand
|
|
Hypotheses refinement under topological communication constraints |
|
Gauvain Bourgne,
Gael Hette,
Nicolas Maudet,
Suzanne Pinson
|
|
Article No.: 239 |
|
doi>10.1145/1329125.1329415 |
|
Full text: PDF
|
|
We investigate the properties of a multiagent system where each (distributed) agent locally perceives its environment. Upon perception of an unexpected event, each agent locally computes its favoured hypothesis and tries to propagate it to other agents, ...
We investigate the properties of a multiagent system where each (distributed) agent locally perceives its environment. Upon perception of an unexpected event, each agent locally computes its favoured hypothesis and tries to propagate it to other agents, by exchanging hypotheses and supporting arguments (observations). However, we further assume that communication opportunities are severely constrained and change dynamically. In this paper, we mostly investigate the convergence of such systems towards global consistency. We first show that (for a wide class of protocols that we shall define), the communication constraints induced by the topology will not prevent the convergence of the system, at the condition that the system dynamics guarantees that no agent will ever be isolated forever, and that agents have unlimited time for computation and arguments exchange. As this assumption cannot be made in most situations though, we then set up an experimental framework aiming at comparing the relative efficiency and effectiveness of different interaction protocols for hypotheses exchange. We study a critical situation involving a number of agents aiming at escaping from a burning building. The results reported here provide some insights regarding the design of optimal protocol for hypotheses refinement in this context. expand
|
|
On the relevance of utterances in formal inter-agent dialogues |
|
Simon Parsons,
Peter McBurney,
Elizabeth Sklar,
Michael Wooldridge
|
|
Article No.: 240 |
|
doi>10.1145/1329125.1329416 |
|
Full text: PDF
|
|
Work on argumentation-based dialogue has defined frameworks within which dialogues can be carried out, established protocols that govern dialogues, and studied different properties of dialogues. This work has established the space in which agents are ...
Work on argumentation-based dialogue has defined frameworks within which dialogues can be carried out, established protocols that govern dialogues, and studied different properties of dialogues. This work has established the space in which agents are permitted to interact through dialogues. Recently, there has been increasing interest in the mechanisms agents might use to choose how to act --- the rhetorical manoeuvring that they use to navigate through the space defined by the rules of the dialogue. Key in such considerations is the idea of relevance, since a usual requirement is that agents stay focussed on the subject of the dialogue and only make relevant remarks. Here we study several notions of relevance, showing how they can be related to both the rules for carrying out dialogues and to rhetorical manoeuvring. expand
|
|
A generative inquiry dialogue system |
|
Elizabeth Black,
Anthony Hunter
|
|
Article No.: 241 |
|
doi>10.1145/1329125.1329417 |
|
Full text: PDF
|
|
The majority of existing work on agent dialogues considers negotiation, persuasion or deliberation dialogues. We focus on inquiry dialogues that allow two agents to share knowledge in order to construct an argument for a specific claim. Inquiry dialogues ...
The majority of existing work on agent dialogues considers negotiation, persuasion or deliberation dialogues. We focus on inquiry dialogues that allow two agents to share knowledge in order to construct an argument for a specific claim. Inquiry dialogues are particularly useful in cooperative domains such as healthcare, and can be embedded within other dialogue types. Existing inquiry dialogue systems only model dialogues, meaning they provide a protocol which dictates what the possible legal next moves are but not which of these moves to make. Our system not only includes a general dialogue-game style inquiry protocol but also a strategy, for an agent to use with this protocol, that selects exactly one of the legal moves to make. We propose a benchmark against which we compare our dialogues, being the arguments that can be constructed from the union of the agents' beliefs, and use this to define soundness and completeness properties for inquiry dialogues. We show that these properties hold for all well-formed inquiry dialogues in our system. expand
|
|
Negotiation by abduction and relaxation |
|
Chiaki Sakama,
Katsumi Inoue
|
|
Article No.: 242 |
|
doi>10.1145/1329125.1329418 |
|
Full text: PDF
|
|
This paper studies a logical framework for automated negotiation between two agents. We suppose an agent who has a knowledge base represented by a logic program. Then, we introduce methods of constructing counter-proposals in response to proposals made ...
This paper studies a logical framework for automated negotiation between two agents. We suppose an agent who has a knowledge base represented by a logic program. Then, we introduce methods of constructing counter-proposals in response to proposals made by an agent. To this end, we combine the techniques of extended abduction in artificial intelligence and relaxation in cooperative query answering for databases. These techniques are respectively used for producing conditional proposals and neighborhood proposals in the process of negotiation. We provide a negotiation protocol based on the exchange of these proposals and develop procedures for computing new proposals. expand
|
|
The LOGIC negotiation model |
|
Carles Sierra,
John Debenham
|
|
Article No.: 243 |
|
doi>10.1145/1329125.1329419 |
|
Full text: PDF
|
|
Successful negotiators prepare by determining their position along five dimensions: Legitimacy, Options, Goals, Independence, and Commitment, (LOGIC). We introduce a negotiation model based on these dimensions and on two primitive concepts: intimacy ...
Successful negotiators prepare by determining their position along five dimensions: Legitimacy, Options, Goals, Independence, and Commitment, (LOGIC). We introduce a negotiation model based on these dimensions and on two primitive concepts: intimacy (degree of closeness) and balance (degree of fairness). The intimacy is a pair of matrices that evaluate both an agent's contribution to the relationship and its opponent's contribution each from an information view and from a utilitarian view across the five LOGIC dimensions. The balance is the difference between these matrices. A relationship strategy maintains a target intimacy for each relationship that an agent would like the relationship to move towards in future. The negotiation strategy maintains a set of Options that are in-line with the current intimacy level, and then tactics wrap the Options in argumentation with the aim of attaining a successful deal and manipulating the successive negotiation balances towards the target intimacy. expand
|
|
SESSION: Argumentation and negotiation: poster papers |
|
|
|
Multi-agent information system using mobile agent negotiation based on a flexible transport ontology |
|
Hayfa Zgaya,
Slim Hammadi
|
|
Article No.: 244 |
|
doi>10.1145/1329125.1329421 |
|
Full text: PDF
|
|
According to the exponential growth of services available on large distributed networks, transport customers require relevant, interactive and instantaneous information during their travels. Thus, we designed and implemented a multi-agent information ...
According to the exponential growth of services available on large distributed networks, transport customers require relevant, interactive and instantaneous information during their travels. Thus, we designed and implemented a multi-agent information system using a special kind of software agents: the Mobile Agents. However, some network errors can occur during the mobile agents moving through the distant network nodes (bottleneck, failure, crash…). For this problem, we define in this paper a mobile agent negotiation process to reassign required services to available network nodes according to their current states in their correspondent final routes called Workplans. Furthermore, the complex interactions features of our system exceed the limits of the traditional negotiation systems which impose several restrictions on the type and format of the negotiation messages. Therefore, we designed and implemented a flexible transport ontology which allows an easy handling of the terms and messages for negotiating. expand
|
|
Negotiation partners selection mechanism based on context-dependent similarity relations |
|
Jakub Brzostowski,
Ryszard Kowalczyk
|
|
Article No.: 245 |
|
doi>10.1145/1329125.1329422 |
|
Full text: PDF
|
|
This paper proposes a context-dependent case-based mechanism for selecting negotiation partners with the focus on the adaptation of similarity relations to a specific context. The similarity relations are important part of the reasoning mechanism and ...
This paper proposes a context-dependent case-based mechanism for selecting negotiation partners with the focus on the adaptation of similarity relations to a specific context. The similarity relations are important part of the reasoning mechanism and have to be defined in a way consistent with the specific type of interaction, the negotiation. We validate the proposed approach with the use of both the probability and possibility distributions by performing experimental evaluation. expand
|
|
A pareto optimal model for automated multi-attribute negotiations |
|
Guoming Lai,
Katia Sycara,
Cuihong Li
|
|
Article No.: 246 |
|
doi>10.1145/1329125.1329423 |
|
Full text: PDF
|
|
This paper presents an applicable model for complex multi-attribute negotiations between autonomous agents. The model adopts a novel protocol which decomposes the original n-dimensional negotiation space into a series of negotiation base lines ...
This paper presents an applicable model for complex multi-attribute negotiations between autonomous agents. The model adopts a novel protocol which decomposes the original n-dimensional negotiation space into a series of negotiation base lines and in each period agents negotiate locally based on a given base line. A belief based negotiation strategy and an offer enhancement process are proposed for agents to make base offer on the negotiation base line and search for Pareto optimal enhancements of the base offer. The model achieves asymptotic Pareto optimality. expand
|
|
Using iterative narrowing to enable multi-party negotiations with multiple interdependent issues |
|
Hiromitsu Hattori,
Mark Klein,
Takayuki Ito
|
|
Article No.: 247 |
|
doi>10.1145/1329125.1329424 |
|
Full text: PDF
|
|
Multi-issue negotiations are a central part of many coordination challenges, and thus represent an important research topic. Almost all previous work in this area has assumed that negotiation issues are independent, but this is rarely the case in real-world ...
Multi-issue negotiations are a central part of many coordination challenges, and thus represent an important research topic. Almost all previous work in this area has assumed that negotiation issues are independent, but this is rarely the case in real-world contexts. Our work focuses on negotiation with interdependent issues and, therefore, nonlinear (multi-optimum) agent utility functions. Since the utility functions are typically very complex, the challenge becomes finding high-quality negotiation outcomes without making unrealistic demands concerning how much agents reveal about their utilities. Since negotiations often involve more than two parties, the approach should also be scalable. In this paper, we propose a novel protocol for addressing these challenges, wherein agents approach agreements by iteratively narrowing the space of possible agreements. In the early stages, agents submit rough bids representing promising regions from their utility functions. In later stages, they submit increasingly narrow bids for the subset of those regions that the negotiating parties all liked. We show that our method outperforms existing methods in large nonlinear utility spaces, and is computationally feasible for negotiations with as many as ten agents. expand
|
|
AutoMed: an automated mediator for bilateral negotiations under time constraints |
|
Michal Chalamish,
Sarit Kraus
|
|
Article No.: 248 |
|
doi>10.1145/1329125.1329425 |
|
Full text: PDF
|
|
Engaging in negotiations is a daily activity. Some negotiations require the involvement of a mediator in order to be concluded in a satisfying manner. In such cases, the objective is to help the negotiators reach a mutually beneficial agreement [6, 4]. ...
Engaging in negotiations is a daily activity. Some negotiations require the involvement of a mediator in order to be concluded in a satisfying manner. In such cases, the objective is to help the negotiators reach a mutually beneficial agreement [6, 4]. Our research focuses on mediation tools for dealing with bilateral negotiations under time constraints. expand
|
|
Workflow coordination for service-oriented multiagent systems |
|
Jiangbo Dang,
Jingshan Huang,
Michael N. Huhns
|
|
Article No.: 249 |
|
doi>10.1145/1329125.1329426 |
|
Full text: PDF
|
|
From a multiagent viewpoint, a workflow is a dynamic set of tasks performed by a set of agents to reach a shared goal. We show herein that commitments among agents can be used to model a workflow and coordinate their execution of it. From a service-oriented ...
From a multiagent viewpoint, a workflow is a dynamic set of tasks performed by a set of agents to reach a shared goal. We show herein that commitments among agents can be used to model a workflow and coordinate their execution of it. From a service-oriented computing viewpoint, a workflow can be represented as a set of services and a specification for the control and data flows among these services to address some business needs. As a formal declarative knowledge representation model, ontology is used as a basis for agent-based workflow execution and coordination. This paper presents methodologies to map an Ontology Web Language for Services (OWL-S) representation for a workflow to a CPN graph, a graphical and mathematical modeling tool for describing and analyzing information processing systems, and then infer commitments and causal relationships from the CPN graph. We provide an example scenario to describe our algorithms. expand
|
|
An equal excess negotiation algorithm for coalition formation |
|
Hrishikesh J. Goradia,
Jose M. Vidal
|
|
Article No.: 250 |
|
doi>10.1145/1329125.1329427 |
|
Full text: PDF
|
|
Coalition formation is an important form of interaction in multiagent systems. It enables the agents to satisfy tasks that they would otherwise be unable to perform, or would perform with a lower efficiency. The focus of our work is on real-world application ...
Coalition formation is an important form of interaction in multiagent systems. It enables the agents to satisfy tasks that they would otherwise be unable to perform, or would perform with a lower efficiency. The focus of our work is on real-world application domains where we have systems inhabited by rational, self-interested agents. We also assume an environment without any trusted central manager to resolve issues concerning multiple agents. For such environments, we have to determine both an optimal (utility-maximizing) coalition configuration and a stable payoff configuration, concurrently and in a distributed fashion. Solving each of these problems is known to be computationally expensive, and having to consider them together exacerbates the problem further. In this paper, we present our Progressive, Anytime, Convergent, and Time-efficient (PACT) algorithm for coalition formation to address the above concerns. We assess the stability of the resulting coalition by using a new stability concept, the relaxed core, which is a slight variation on the core. We show experimentally that our algorithm performs admirably in comparison to an optimal solution, it typically produces solutions that are relaxed-core-stable, and it scales well. expand
|
|
SESSION: Emergent behavior: full papers |
|
|
|
Effective tag mechanisms for evolving coordination |
|
Matthew Matlock,
Sandip Sen
|
|
Article No.: 251 |
|
doi>10.1145/1329125.1329429 |
|
Full text: PDF
|
|
Tags or observable features shared by a group of similar agents are effectively used in real and artificial societies to signal intentions and can be used to infer unobservable properties and choose appropriate behaviors. Use of tags to select partners ...
Tags or observable features shared by a group of similar agents are effectively used in real and artificial societies to signal intentions and can be used to infer unobservable properties and choose appropriate behaviors. Use of tags to select partners has been shown to produce stable cooperation in agent populations playing the Prisoner's Dilemma game. Existing tag mechanisms, however, can promote cooperation only if that requires identical actions from all group members. We propose a more general tag-based interaction scheme that facilitates and supports significantly richer coordination between agents. Our work is motivated by previous research that showed the ineffectiveness of current tag schemes for solving games requiring divergent actions. The mechanisms proposed here not only solves those problems but are effective for other general-sum games. We argue that these general-purpose tag mechanisms allow new application possibilities of multiagent learning algorithms as they allow an agent to reuse its learned knowledge about one agent when interacting with other agents sharing the same observable features. expand
|
|
Self-organizing social and spatial networks under what-if scenarios |
|
Il-Chul Moon,
Kathleen M. Carley
|
|
Article No.: 252 |
|
doi>10.1145/1329125.1329430 |
|
Full text: PDF
|
|
Multi-agent models have been used to simulate complex systems in many domains. In some models, the agents move in a physical/grid space and are constrained by their locations on the spatial space, e.g. Sugarscape. In others, the agents interact in a ...
Multi-agent models have been used to simulate complex systems in many domains. In some models, the agents move in a physical/grid space and are constrained by their locations on the spatial space, e.g. Sugarscape. In others, the agents interact in a social multi-dimensional space and are bound to their knowledge and social positions, e.g. Construct. However, many real world problems require a mixed model containing both spatial and social features. This paper introduces such a multi agent system, Construct-Spatial, which simulates agent communication and movement simultaneously. It is an extended version of Construct, which is a multi-agent social model, and its extension is based on a multi-agent grid model, Sugarscape. To understand the impact of this integration of the two spaces, we run virtual experiments and compare the output from the combined space to those from each of the two spaces. The initial analysis reveals that the integration facilitates unbalanced knowledge distribution across the agents compared to the grid-only model and limits agent network connections compared to the social network model without spatial constraints. After the comparisons, we setup what-if scenarios where we varied the type of the threats faced by network and observe their emergent behaviors. From the what-if analyses, we locate the best destabilization scenario and find the propagation of the effects from the spatial space to the social network space. We believe that this model can be a conceptual model for assessing the efficiency and the robustness of team deployments, network node distributions, sensor distributions, etc. expand
|
|
Invited talk by winner of IFAAMAS Victor Lesser Distinguished Dissertation Award |
|
Vincent Conitzer
|
|
Article No.: 253 |
|
doi>10.1145/1329125.1329431 |
|
|
|
SESSION: Applications and computational environments: full papers |
|
|
|
Computing the Banzhaf power index in network flow games |
|
Yoram Bachrach,
Jeffrey S. Rosenschein
|
|
Article No.: 254 |
|
doi>10.1145/1329125.1329433 |
|
Full text: PDF
|
|
Preference aggregation is used in a variety of multiagent applications, and as a result, voting theory has become an important topic in multiagent system research. However, power indices (which reflect how much "real power" a voter has in a weighted ...
Preference aggregation is used in a variety of multiagent applications, and as a result, voting theory has become an important topic in multiagent system research. However, power indices (which reflect how much "real power" a voter has in a weighted voting system) have received relatively little attention, although they have long been studied in political science and economics. The Banzhaf power index is one of the most popular; it is also well-defined for any simple coalitional game. In this paper, we examine the computational complexity of calculating the Banzhaf power index within a particular multiagent domain, a network flow game. Agents control the edges of a graph; a coalition wins if it can send a flow of a given size from a source vertex to a target vertex. The relative power of each edge/agent reflects its significance in enabling such a flow, and in real-world networks could be used, for example, to allocate resources for maintaining parts of the network. We show that calculating the Banzhaf power index of each agent in this network flow domain is #P-complete. We also show that for some restricted network flow domains there exists a polynomial algorithm to calculate agents' Banzhaf power indices. expand
|
|
Distributed agent-based air traffic flow management |
|
Kagan Tumer,
Adrian Agogino
|
|
Article No.: 255 |
|
doi>10.1145/1329125.1329434 |
|
Full text: PDF
|
|
Air traffic flow management is one of the fundamental challenges facing the Federal Aviation Administration (FAA) today. The FAA estimates that in 2005 alone, there were over 322,000 hours of delays at a cost to the industry in excess of three billion ...
Air traffic flow management is one of the fundamental challenges facing the Federal Aviation Administration (FAA) today. The FAA estimates that in 2005 alone, there were over 322,000 hours of delays at a cost to the industry in excess of three billion dollars. Finding reliable and adaptive solutions to the flow management problem is of paramount importance if the Next Generation Air Transportation Systems are to achieve the stated goal of accommodating three times the current traffic volume. This problem is particularly complex as it requires the integration and/or coordination of many factors including: new data (e.g., changing weather info), potentially conflicting priorities (e.g., different airlines), limited resources (e.g., air traffic controllers) and very heavy traffic volume (e.g., over 40,000 flights over the US airspace). In this paper we use FACET -- an air traffic flow simulator developed at NASA and used extensively by the FAA and industry -- to test a multi-agent algorithm for traffic flow management. An agent is associated with a fix (a specific location in 2D space) and its action consists of setting the separation required among the airplanes going though that fix. Agents use reinforcement learning to set this separation and their actions speed up or slow down traffic to manage congestion. Our FACET based results show that agents receiving personalized rewards reduce congestion by up to 45% over agents receiving a global reward and by up to 67% over a current industry approach (Monte Carlo estimation). expand
|
|
Scaling-up shopbots: a dynamic allocation-based approach |
|
David Sarne,
Sarit Kraus,
Takayuki Ito
|
|
Article No.: 256 |
|
doi>10.1145/1329125.1329435 |
|
Full text: PDF
|
|
In this paper we consider the problem of eCommerce comparison shopping agents (shopbots) that are limited by capacity constraints. In light of the phenomenal increase both in demand for price comparison services over the internet and in the number of ...
In this paper we consider the problem of eCommerce comparison shopping agents (shopbots) that are limited by capacity constraints. In light of the phenomenal increase both in demand for price comparison services over the internet and in the number of opportunities available in electronic markets, shopbots are nowadays required to improve the utilization of their finite set of querying resources. In this paper we introduce PlanBot, an innovative shopbot which uniquely integrates concepts from production management and economic search theory. PlanBot aims to maximize its efficiency by dynamically re-planning the allocation of its querying resources according to the results of formerly executed queries and new arriving requests. We detail the design principles that drive the PlanBot's operation and illustrate its improved performance (in comparison to the traditional shopbots' First-Come-First-Served (FCFS) query execution mechanisms) using a simulated environment which is based on price datasets collected over the internet. Our encouraging results suggest that the design principles we apply have the potential of being used as an infrastructure for various implementations of future comparison shopping agents. expand
|
|
SESSION: Applications and computational environments: poster papers |
|
|
|
Locating RF emitters with large UAV teams |
|
Paul Scerri,
Robin Glinton,
Sean Owens,
Steven Okamoto,
Katia Sycara
|
|
Article No.: 257 |
|
doi>10.1145/1329125.1329437 |
|
Full text: PDF
|
|
The rapidly improving availability of small, unmanned aerial vehicles (UAVs) and their ever reducing cost is leading to considerable interest in multi-UAV applications. However, while UAVs have become smaller and cheaper, there is a lack of sensors that ...
The rapidly improving availability of small, unmanned aerial vehicles (UAVs) and their ever reducing cost is leading to considerable interest in multi-UAV applications. However, while UAVs have become smaller and cheaper, there is a lack of sensors that are light, small and power efficient enough to be used on a small UAV yet are capable of taking useful measurements of objects often several hundred metres below them. Static or video cameras are one option, however image processing normally requires human input or at least computationally intensive offboard processing, restricting their applicability to very small UAV teams. In this paper, we look at how teams of UAVs can use very small Relative Signal Strength Indicator (RSSI) sensors whose only capability is to detect the approximate strength of a Radio Frequency (RF) signal, to search for and accurately locate such sources. RSSI sensors give at most an approximate range to an RF emitter and will be misleading when signals overlap. Applications of such UAV teams range from finding lost hikers or skiers carrying small RF beacons to military reconnaissance operations. Moreover, the core techniques have a wider applicability to a range of robotic teams that rely on highly uncertain sensors, e.g., search and rescue in disaster environments. expand
|
|
Moving target search in grid worlds |
|
Cagatay Undeger,
Faruk Polat
|
|
Article No.: 258 |
|
doi>10.1145/1329125.1329438 |
|
Full text: PDF
|
|
In this paper, we propose a real-time moving target search algorithm for dynamic and partially observable environments, modeled as grid world. The proposed algorithm, Real-time Moving Target Evaluation Search (MTES), is able to detect the closed directions ...
In this paper, we propose a real-time moving target search algorithm for dynamic and partially observable environments, modeled as grid world. The proposed algorithm, Real-time Moving Target Evaluation Search (MTES), is able to detect the closed directions around the agent, and determine the best direction that avoids the nearby obstacles, leading to a moving target which is assumed to be escaping almost optimally. We compared our proposal with Moving Target Search (MTS) and observed a significant improvement in the solution paths. Furthermore, we also tested our algorithm against A* in order to report quality of our solutions. expand
|
|
An agent-based model that relates investment in education to economic prosperity |
|
Yuqing Tang,
Simon Parsons,
Elizabeth Sklar
|
|
Article No.: 259 |
|
doi>10.1145/1329125.1329439 |
|
Full text: PDF
|
|
We describe work on an agent-based model that captures the relationship between the investment that a society makes in education and the outcome in terms of the health of the society's economy. In this work we created an agent-based version of an equation-based ...
We describe work on an agent-based model that captures the relationship between the investment that a society makes in education and the outcome in terms of the health of the society's economy. In this work we created an agent-based version of an equation-based model from the economics literature, and explored various settings for parameters that control the behaviors of the agents and their environment. expand
|
|
An agent-based methodology for analyzing and visualizing educational assessment data |
|
Elizabeth Sklar,
Jordan Salvit,
Christopher Camacho,
William Liu
|
|
Article No.: 260 |
|
doi>10.1145/1329125.1329440 |
|
Full text: PDF
|
|
We examine data collected from on-line assessments of the numeracy and literacy skills of young students in order to construct probabilistic agent-based controllers. We demonstrate the value of this methodology as an effective means for both analyzing ...
We examine data collected from on-line assessments of the numeracy and literacy skills of young students in order to construct probabilistic agent-based controllers. We demonstrate the value of this methodology as an effective means for both analyzing and visualizing aspects of large data sets that are difficult to capture with traditional equation-based statistics and static graphics. expand
|
|
SESSION: Demonstration program |
|
|
|
Agent trust evaluation and team formation in heterogeneous organizations |
|
K. S. Barber,
J. Ahn,
S. Budalakoti,
D. DeAngelis,
K. K. Fullam,
C. L. D. Jones,
X. Sui
|
|
Article No.: 261 |
|
doi>10.1145/1329125.1329442 |
|
Full text: PDF
|
|
This demonstration highlights different aspects of the bottom-up assembly of multi-agent teams; illustrating trust evaluation of potential partners via experience- and reputation-based trust models, multi-dimensional trust evaluation of potential partners, ...
This demonstration highlights different aspects of the bottom-up assembly of multi-agent teams; illustrating trust evaluation of potential partners via experience- and reputation-based trust models, multi-dimensional trust evaluation of potential partners, task selection through personality-based modeling and team selection strategies that maximize a team's ability to function in dynamic environments. The demonstration format will be a software live demo with supporting slide shows. expand
|
|
F-trade: an agent-mining symbiont for financial services |
|
Longbing Cao,
Chengqi Zhang
|
|
Article No.: 262 |
|
doi>10.1145/1329125.1329443 |
|
Full text: PDF
|
|
The interaction and integration of agent technology and data mining presents prominent benefits to solve some of challenging issues in individual areas. For instance, data mining can enhance agent learning, while agent can benefit data mining with distributed ...
The interaction and integration of agent technology and data mining presents prominent benefits to solve some of challenging issues in individual areas. For instance, data mining can enhance agent learning, while agent can benefit data mining with distributed pattern discovery. In this paper, we summarize the main functionalities and features of an agent service and data mining symbiont -- F-Trade. The F-Trade is constructed in Java agent service following the theory of open complex agent systems. We demonstrate the roles of agents in building up the F-Trade, as well as how agents can support data mining. On the other hand, data mining is used to strengthen agents. F-Trade provides flexible and efficient services of trading evidence back-testing, optimization and discovery, as well as plug and play of algorithms, data and system modules for financial trading and surveillance with online connectivity to huge quantities of global market data. expand
|
|
Demonstration: disaster evacuation support |
|
Christopher J. Carpenter,
Robert N. Lass,
Evan Sultanik,
Christopher J. Dugan,
Gaurav Naik,
Pragnesh Jay Modi,
Joseph B. Kopena,
Duc N. Nguyen,
William C. Regli
|
|
Article No.: 263 |
|
doi>10.1145/1329125.1329444 |
|
Full text: PDF
|
|
Evacuation or sheltering of neighborhoods, cities, or regions is a major component of responding to any natural or other disaster. Poorly chosen and uncoordinated destinations can quickly overwhelm shelter capacities. Insufficient knowledge and decision ...
Evacuation or sheltering of neighborhoods, cities, or regions is a major component of responding to any natural or other disaster. Poorly chosen and uncoordinated destinations can quickly overwhelm shelter capacities. Insufficient knowledge and decision processes may also lead to mismatches between evacuee needs and shelter capabilites, such as advanced medical units. Unfortunately, the intuitive and easy response of moving evacuees to the closest refuges can easily lead to this situation. This work attempts to address this problem by developing tools and techniques to help emergency personnel create a shared and accurate understanding of the situation, make the best decisions for the group, and effectively conduct disaster evacuations. expand
|
|
Demonstration of teamwork in uncertain domains using hybrid BDI-POMDP systems |
|
Tapana Gupta,
Pradeep Varakantham,
Timothy W. Rauenbusch,
Milind Tambe
|
|
Article No.: 264 |
|
doi>10.1145/1329125.1329445 |
|
Full text: PDF
|
|
Personal Assistant agents are becoming increasingly important in a variety of application domains in offices, at home, for medical care and many others [5, 1]. These agents are required to constantly monitor their environment (including the state of ...
Personal Assistant agents are becoming increasingly important in a variety of application domains in offices, at home, for medical care and many others [5, 1]. These agents are required to constantly monitor their environment (including the state of their users), and make periodic decisions based on their monitoring. For example, in an office environment, agents may need to monitor the location of their user in order to ascertain whether the user would be able to make it on time to a meeting [5]. Or, they may be required to monitor the progress of a user on a particular assignment and decide whether or not the user would be able to meet the deadline for completing the assignment. Teamwork between such agents is important in Personal Assistant applications to enable agents working together to achieve a common goal (such as finishing a project on time). This working demonstration shows a hybrid(BDI-POMDP) approach to accomplish such teamwork. expand
|
|
Creating densely populated virtual environments |
|
Ryan McAlinden,
Don Dini,
Chirag Merchant,
Michael van Lent
|
|
Article No.: 265 |
|
doi>10.1145/1329125.1329446 |
|
Full text: PDF
|
|
Few virtual environments are capable of supporting large numbers of autonomous agents (> 5000) with complex decision-making on a single machine. This demonstration depicts such an agent infrastructure set within a game-based virtual environment. The ...
Few virtual environments are capable of supporting large numbers of autonomous agents (> 5000) with complex decision-making on a single machine. This demonstration depicts such an agent infrastructure set within a game-based virtual environment. The embodied agent framework consists of two primary components: a lower-level navigation layer consisting of commercially-available AI middleware, and a higher-level cellular automata system driven by agent goals, resources and thresholds. The overarching game-based infrastructure consists of these two AI components, along with an ICT-developed perception system sitting atop the Gamebryo rendering engine. The typical number of agents supported on a dual-core CPU with a modern graphics card is ~10,000 rendering at 30 frames-per-second. To support this quantity and level of intelligence several design considerations were implemented, including the use of multiple threads, a clone/sprite-based avatar view, and a dynamic level-of-detail update system. Future work includes distributing the AI mechanism across multiple machines to support numbers of agents a level of magnitude higher than is currently possible. expand
|
|
Agent-based reduction of information density (ARID) demonstration |
|
Stephen O'Hara,
Nathan Dwyer
|
|
Article No.: 266 |
|
doi>10.1145/1329125.1329447 |
|
Full text: PDF
|
|
21st Century Systems, Inc. (21CSI) has developed a prototype software system that uses intelligent agents to assist in the decluttering of net-centric command and control (C2) displays. The software is called ARID, or Agent-based Reduction ...
21st Century Systems, Inc. (21CSI) has developed a prototype software system that uses intelligent agents to assist in the decluttering of net-centric command and control (C2) displays. The software is called ARID, or Agent-based Reduction of Information Density. The prototype applies our ARID technology to a C2 system designed to provide a single operator with supervisory control over the operations of multiple (typically 4 to 8) unmanned aerial vehicles (UAVs). Our demonstration will show a simulation of such a C2 system running scripted scenarios. We will have an interactive demonstration. Those who volunteer to participate in the demonstration will have the opportunity to assume the role of the UAV operator and respond to the mission tasking and emergent conditions that occur during the approximately 10 minute long scenarios. The participants will be able to run two similar scenarios, one with and one without ARID agents enabled. The participants will see for themselves whether or not the agent-assisted decluttering and alerting features helped improve their scores, which are revealed at the end of the scenario. This is a software-only simulation. Actual UAVs will, of course, not be flown. Participants will find that participating in the demonstration is somewhat like playing a real-time strategy game but with simplistic rules. It will be a fun demonstration that will show the value of agent-based assistive technology. expand
|
|
WS-agreement based resource negotiation in AgentScape |
|
M. A. Oey,
R. J. Timmer,
D. G. A. Mobach,
B. J. Overeinder,
F. M. T. Brazier
|
|
Article No.: 267 |
|
doi>10.1145/1329125.1329448 |
|
Full text: PDF
|
|
Mobile agents require access to computing resources on heterogeneous systems across the Internet. This demo illustrates how agents can negotiate terms and conditions of resource access with one or more mediators representing virtual organizations of ...
Mobile agents require access to computing resources on heterogeneous systems across the Internet. This demo illustrates how agents can negotiate terms and conditions of resource access with one or more mediators representing virtual organizations of autonomous hosts, before migrating to a new location. Time-limited resource contracts are the result: contracts between agents and mediators, and contracts between mediators and hosts. The negotiation protocol and language are based on the WS-Agreement Specification, and have been implemented and tested within the Agent-Scape framework. The demonstration shows in detail how this negotiation framework has been implemented for resource access on remote, distributed systems. expand
|
|
Airspace management of autonomous UAVs |
|
Osher Yadgar,
Regis Vincent
|
|
Article No.: 268 |
|
doi>10.1145/1329125.1329449 |
|
Full text: PDF
|
|
One major issue currently preventing the adoption of autonomous unmanned air vehicles (UAVs) is the lack of airspace management to prevent the UAVs from colliding with each other, with human-piloted planes or helicopters, with static objects such as ...
One major issue currently preventing the adoption of autonomous unmanned air vehicles (UAVs) is the lack of airspace management to prevent the UAVs from colliding with each other, with human-piloted planes or helicopters, with static objects such as buildings, and with dynamic flying objects such as flocks of birds. In this work, we present a novel airspace management approach to autonomous UAVs. Our airspace management system allows UAVs to dynamically and autonomously choose between three modes of operation: (i) centralized, (ii) cooperative decentralized, (iii) noncooperative decentralized. expand
|
|
Emergent ad hoc sensor network connectivity in large-scale disaster zones |
|
Osher Yadgar
|
|
Article No.: 269 |
|
doi>10.1145/1329125.1329450 |
|
Full text: PDF
|
|
We introduce a disaster-zone-monitoring web-based application. While using this application, the user may simulate different large-scale disaster zones. He may use tools to define a disaster zone and its communication requirements. The web application ...
We introduce a disaster-zone-monitoring web-based application. While using this application, the user may simulate different large-scale disaster zones. He may use tools to define a disaster zone and its communication requirements. The web application uses the Google Earth infrastructure and is publicly available to use online during the conference from every computer connected to the Internet. The evolving deployment of nodes will be updated each second to reflect the current state of the entities residing within the monitored zone. The user will be able to navigate through the disaster zone to inspect the dynamically changed environment, and to learn about node deployment and the current network connectivity and service availability. He could ask the system to find the number of agents required to be deployed and present this deployment. The user also will be able to define the budget limitations. Thus, the system will derive the number of agents, their deployment, and the resulting system utility. Given a system utility, the user can decide whether to adopt the deployment even though it does not guarantee full coverage, or to increase the budget to improve system utility. expand
|
|
AUML protocols and code generation in the Prometheus design tool |
|
Lin Padgham,
John Thangarajah,
Michael Winikoff
|
|
Article No.: 270 |
|
doi>10.1145/1329125.1329451 |
|
Full text: PDF
|
|
Prometheus is an agent-oriented software engineering methodology. The Prometheus Design Tool (PDT) is a software tool that supports a designer who is using the Prometheus methodology. PDT has recently been extended with two significant ...
Prometheus is an agent-oriented software engineering methodology. The Prometheus Design Tool (PDT) is a software tool that supports a designer who is using the Prometheus methodology. PDT has recently been extended with two significant new features: support for Agent UML interaction protocols, and code generation. expand
|
|
FearNot! demo: a virtual environment with synthetic characters to help bullying |
|
Marco Vala,
Pedro Sequeira,
Ana Paiva,
Ruth Aylett
|
|
Article No.: 271 |
|
doi>10.1145/1329125.1329452 |
|
Full text: PDF
|
|
This demo features FearNot!, a school-based Virtual Learning Environment (VLE) populated by synthetic characters representing the various actors in a bullying scenario. FearNot! uses emergent narrative to create improvised dramas with those characters. ...
This demo features FearNot!, a school-based Virtual Learning Environment (VLE) populated by synthetic characters representing the various actors in a bullying scenario. FearNot! uses emergent narrative to create improvised dramas with those characters. The goal is to enable children to explore bullying issues, and coping strategies, interacting with characters to which they become affectively engaged. Through their appearance, behaviours and affect, these characters are able to trigger empathic relations with the user. FearNot! is used for Personal and Health Social Education (PHSE) for children aged 8--12, in the UK, Portugal and Germany. expand
|
|
SESSION: Industry track program |
|
|
|
Demand side management in district heating systems |
|
Fredrik Wernstedt,
Paul Davidsson,
Christian Johansson
|
|
Article No.: 272 |
|
doi>10.1145/1329125.1329454 |
|
Full text: PDF
|
|
This paper describes a multiagent system that has made the voyage from research project to commercialised product. The purpose for the multiagent system is to dynamically control a system so that the load of the system is below certain threshold values ...
This paper describes a multiagent system that has made the voyage from research project to commercialised product. The purpose for the multiagent system is to dynamically control a system so that the load of the system is below certain threshold values without reduction of quality of service and by that, to avoid the usage of top load production sources and to reduce energy consumption. The fundamental idea behind the system is that a large number of small local decisions taken all in all have great impact on the overall system performance. A field-test as well as a return of investment analysis are presented. expand
|
|
MAGENTA technology case studies of magenta i-scheduler for road transportation |
|
Petr Skobelev,
Andrey Glaschenko,
Ilya Grachev,
Sergey Inozemtsev
|
|
Article No.: 273 |
|
doi>10.1145/1329125.1329455 |
|
Full text: PDF
|
|
The paper describes functionality of Magenta Multi-Agent Logistics i-Scheduler Engine presented on AAMAS 2006 conferences and gives examples of its application in business domain. The i-Scheduler Engine was designed to be scalable without risk of combinatorial ...
The paper describes functionality of Magenta Multi-Agent Logistics i-Scheduler Engine presented on AAMAS 2006 conferences and gives examples of its application in business domain. The i-Scheduler Engine was designed to be scalable without risk of combinatorial explosion, in order to handle large transportation networks as a whole. The multi-agent architecture combined with semantic network allows very granular approach for every business entity of transportation network (client, order, cargo, truck, driver, etc) and balancing of their conflicting interests. The i-Scheduler considers individual constraints and, interestingly, specific preferences of customers, drivers, trucks, cargoes, etc. This results in a unique ability to combine inbound and outbound deliveries, different fleets or private networks, driving more value from finding effective backhauls and consolidations. The paper covers the history of development, architecture and current functionality of the engine and provides a set of case studies in different transportation networks, which outline the most serious challenges Magenta overcame in each case. expand
|
|
Intelligent agent framework for order entry and management |
|
Thuc Duong Nguyen,
Simon Thompson
|
|
Article No.: 274 |
|
doi>10.1145/1329125.1329456 |
|
Full text: PDF
|
|
This paper describes an agent system we have built to handle order entry and management issue in business computing and specialized in the telecommunication domain. Our system is implemented using only industry approved standards and it can both manage ...
This paper describes an agent system we have built to handle order entry and management issue in business computing and specialized in the telecommunication domain. Our system is implemented using only industry approved standards and it can both manage complex workflows as well as handle user context dependant service selection. Furthermore, it has the abstract goal representation mechanism, which in turn can detect and reflect to changes in its working environment, leading to more accurate capturing of user's requirements. We also detail a number of prototype applications that has been built using our technology and highlight how our approach addresses the business issues around each of these applications. expand
|
|
Web services negotiation in an insurance grid |
|
Shamimabi Paurobally,
Chris van Aart,
Valentina Tamma,
Michael Wooldridge,
Peter van Hapert
|
|
Article No.: 275 |
|
doi>10.1145/1329125.1329457 |
|
Full text: PDF
|
|
There are an increasing number of initiatives for the migration of agents research towards new Internet technologies such as the semantic web, Grid, and web services. On the one hand, service oriented Grid architectures need to support dynamic cooperation, ...
There are an increasing number of initiatives for the migration of agents research towards new Internet technologies such as the semantic web, Grid, and web services. On the one hand, service oriented Grid architectures need to support dynamic cooperation, negotiation and coordination between web services controlling Grid resources, for efficient resource and task allocation and execution. On the other hand, the Grid can facilitate agent communication, life-cycle management, and access to resources for agents. The insurance sector is one such area that would benefit from the automation brought by multi-agent systems techniques in handling claims and detecting fraudulent cases. In this paper, we discuss our work on facilitating dynamic and adaptive negotiation between web and grid services. We describe our deployed approach in an InsuranceGrid, which manages businesses involved in dealing with car damage claims for a number of insurance companies. expand
|
|
The IEEE FIPA approach to integrating software agents and web services |
|
Dominic Greenwood,
Margaret Lyell,
Ashok Mallya,
Hiroki Suguri
|
|
Article No.: 276 |
|
doi>10.1145/1329125.1329458 |
|
Full text: PDF
|
|
In many settings Web services are now perceived as the first choice technology to provide neatly encapsulated functionality for Web-based computation. To date, many standards have been produced and adoption is accelerating across numerous application ...
In many settings Web services are now perceived as the first choice technology to provide neatly encapsulated functionality for Web-based computation. To date, many standards have been produced and adoption is accelerating across numerous application domains. This uptake has long been recognized by members of software agent community with several approaches reported that explore various means of extending the utility of Web services with the autonomous control offered by agents. This paper reports on the recent work of several members of this community to consolidate their approaches into a common specification describing how to seamlessly interconnect FIPA compliant agent systems with W3C compliant Web services. This work has been conducted within the context of the IEEE FIPA Agent and Web Service Integration working group and will be shortly published as a new FIPA specification. expand
|
|
Multi-agent system of Samara region social services based on social passports and smart cards of citizens |
|
Vladimir Vittikh,
Elena Gritsenko,
Oleg Surnin,
Petr Skobelev,
Denis Volhoncev,
Maxim Karavaev,
Mihaii Shamashov,
Alexander Tsarev
|
|
Article No.: 277 |
|
doi>10.1145/1329125.1329459 |
|
Full text: PDF
|
|
Paper presents multi-agent system for social services based on social passport and smart cards of citizens. It describes developed approach based on agents and ontologies, architecture of the system and its specific features. It is shown that application ...
Paper presents multi-agent system for social services based on social passport and smart cards of citizens. It describes developed approach based on agents and ontologies, architecture of the system and its specific features. It is shown that application of multi-agent technology can bring high value and clear benefits for clients in full scale regional e-government systems. expand
|
|
Real-time agent characterization and prediction |
|
H. Van Dyke Parunak,
Sven Brueckner,
Robert Matthews,
John Sauter,
Steve Brophy
|
|
Article No.: 278 |
|
doi>10.1145/1329125.1329460 |
|
Full text: PDF
|
|
Reasoning about agents that we observe in the world is challenging. Our available information is often limited to observations of the agent's external behavior in the past and present. To understand these actions, we need to deduce the agent's internal ...
Reasoning about agents that we observe in the world is challenging. Our available information is often limited to observations of the agent's external behavior in the past and present. To understand these actions, we need to deduce the agent's internal state, which includes not only rational elements (such as intentions and plans), but also emotive ones (such as fear). In addition, we often want to predict the agent's future actions, which are constrained not only by these inward characteristics, but also by the dynamics of the agent's interaction with its environment. BEE (Behavior Evolution and Extrapolation) uses a faster-than-real-time agent-based model of the environment to characterize agents' internal state by evolution against observed behavior, and then predict their future behavior, taking into account the dynamics of their interaction with the environment. expand
|
|
A multi-agent system of evidential reasoning for intelligence analyses |
|
Eric Lindahl,
Stephen O'Hara,
Qiuming Zhu
|
|
Article No.: 279 |
|
doi>10.1145/1329125.1329461 |
|
Full text: PDF
|
|
This paper describes a Multi-Agent System intended for assisting military commanders and intelligence analysts in the discovery and analysis of publicly available information that may have intelligence value (Open Source Intelligence, or OSINT). Our ...
This paper describes a Multi-Agent System intended for assisting military commanders and intelligence analysts in the discovery and analysis of publicly available information that may have intelligence value (Open Source Intelligence, or OSINT). Our system is called Webster, which is a pun on the well-known dictionary and the World Wide Web. An innovative feature of Webster is the trust network that allows for the hierarchical integration of judgements provided by both human and computer agents, and the ability to extend the system by adding new agents that encapsulate a given characterization capability - such as the ability to provide a level of facial recognition on images that may be embedded in web pages. A key challenge is in creating a normalized concept structure or belief frame that all participating agents, at a certain level, can use to focus their analysis and render opinions that can be meaningfully combined with the opinions of other entities in the system. Webster can scale from a single machine to a large interconnection of subject matter experts and special-purpose computer systems by providing proxy agents that act as intermediaries in the system. expand
|
|
Auction-based multi-robot task allocation in COMSTAR |
|
Matthew Hoeing,
Prithviraj Dasgupta,
Plamen Petrov,
Stephen O'Hara
|
|
Article No.: 280 |
|
doi>10.1145/1329125.1329462 |
|
Full text: PDF
|
|
Over the past few years, swarm based systems have emerged as an attractive paradigm for building large scale distributed systems composed of numerous independent but coordinating units. In our previous work, we have developed a protoype system called ...
Over the past few years, swarm based systems have emerged as an attractive paradigm for building large scale distributed systems composed of numerous independent but coordinating units. In our previous work, we have developed a protoype system called COMSTAR (Cooperative Multi-agent Systems for automatic TArget Recognition) using a swarm of unmanned aerial vehicles(UAVs) that is capable of identifying targets in software simulations of reconnaissance operations. Experimental results from the simulations of the COMSTAR system show that task selection among the UAVs is a crucial operation that determines the overall efficiency of the system. Previously described techniques for task selection among swarm units use a centralized server such as a ground control station to coordinate the activities of the swarm units. However, such systems are not truly distributed since the behavior of the swarm units is predominantly directed by the centralized server's task allocation algorithm. In this paper we focus on the problem of distributed task selection in a swarmed system where each swarm unit decides on the tasks it will execute by sharing information and coordinating its actions with other swarm units without the intervention of a centralized ground control station supervising its activities. Specifically, we build our task selection algorithm on an auction-based algorithm for task selection in robotic swarms described by Kalra et al. We report experimental results in a simulated environment with 18 robots and 20 tasks and compare the performance of our auction-based algorithm with other heuristic-based task selection strategies in multi-agent swarms. Our simulation results show that the auction-based algorithm improves the task completion times by 30-60% and reduces the communication overhead by as much as 90% with respect to other heuristic-based strategies maintaining similar performance in load balancing. expand
|
|
The human agent virtual environment |
|
Michael Papasimeon,
Adrian R. Pearce,
Simon Goss
|
|
Article No.: 281 |
|
doi>10.1145/1329125.1329463 |
|
Full text: PDF
|
|
In this paper we describe a multi-agent simulation called the Human Agent Virtual Environment (or HAVE). HAVE is a test bed to explore agent-environment interaction in multi-agent simulation for defence applications. The primary research driver in the ...
In this paper we describe a multi-agent simulation called the Human Agent Virtual Environment (or HAVE). HAVE is a test bed to explore agent-environment interaction in multi-agent simulation for defence applications. The primary research driver in the development of HAVE is to explore representations of virtual environments in which both humans and agents are situated, perceive these environments and undertake meaningful and appropriate actions. HAVE models and simulates a Close Air Support (CAS) mission which involves fighter or strike aircraft providing support to ground troops through the use of air-to-ground weapons. This provides a realistic and currently extremely relevant domain in which to explore agent-environment interactions. Three important research challenges have been addressed by the work. The first, is the implementation of a multi-modal representation of the virtual environment, having multiple, parallel yet consistent representations of the virtual world that were accessible to, and tailored for the different simulation components. The second is the used of labeled annotations in the virtual world which the agents could easily access and interpret. The third, the use of an appropriate architecture for capturing and representing interaction between agents and the environment they are situated in. expand
|