5 A Mobile Agent Framework to Support Distributed Information Management

"I had to quit my job to have time to read my email" - Adam Curry

5.1 Requirements

The last chapter has demonstrated that current mobile agent systems, whilst providing the necessary tools to support the migration of agents between network nodes and the communication between agents, do not provide a direct solution for addressing the problem of information overload.

Therefore, a framework is needed that will address a number of issues, simultaneously, to ensure that the requirements of a distributed information management system are met. This chapter presents the potential requirements of such a system in the context of a mobile agent architecture and also details the initial framework and components required to support mobile agents that perform distributed information management tasks.

KQML (Finin et al., 1995) is a high-level protocol and language for agent to service and agent to agent interaction and communication, commonly termed Agent Communication Languages (ACL). It is based around a notion derived from speech act theory in linguistics (Austin, 1962) where messages that are communicated consist of performatives indicating what the receiver should do with the message, for example, tell. Currently, KQML provides performatives which deal with belief revision, querying, knowledge-base maintenance, actions and services. Also, KQML has the ability to nest performatives, such that more specialised performatives can be built from more general-purpose performatives.

KIF (Genesereth et al., 1992) is a rich language that provides a standard representation of information. It can express beliefs, rules, facts and partial descriptions of functions amongst other things. However, KIF is used for more than just a information representation; it also provides a central language for communication between other ACLs. Translating between n ACLs and a central information language such as KIF, reduces the number of translators required from O(n²) to O(n+1).

The advantages of using ACLs and standard information representations are clear; if all agents within the system conform to an interchange standard, then the amount of data translation is reduced and the introduction of translation errors into the data are lessened.

5.1.5 Communication

Another aspect that needs to be considered is how agents are to name and reference other agents? It is obviously not suitable to reference agents through network or machine address, due to the fact that they may move frequently. Additionally, some form of registration needs to take place when the agent moves, so that its current address can be resolved and located. Finin (Finin et al., 1995b) advocates the use of a hierarchical naming convention, not dissimilar to the Domain Naming Service (DNS). Each agent domain is responsible for maintaining a portion of the agent naming tree and addressing is resolved by agent name servers. More importantly, they illustrate the use of agent proxies to provide protocol gateways and firewalls.

The Heterogeneous Communication Model (Goose, 1995; Goose, 1996) provides a similar mechanism for agents to communicate with other agents through an addressing mechanism; as agents transport to a domain, they register their presence with a central router. The form of this registration determines which types of messages the agents is willing to receive: this is analogous to offering server-style functions to other clients. In an example scenario, an agent could register once to indicate that it had information on a particular subject and again to say that it required information on another subject. Due to the configurable and modular nature of the HCM, these concepts can be extended over domains by adopting a DNS-like architecture.

5.1.6 Heterogeneity

To ensure that a mobile agent system is generic enough to suit a wide range of applications, it must possess heterogeneity in the following areas:

Platform. Architecture and operating system heterogeneity can be achieved by transmitting agents as source code and interpreting them at their destinations, or by pre-compiling them into byte code and interpreting the byte code at their destinations. The latter is more privacy-oriented, since it might not be prudent to allow the source code of an agent to be inspected. Also, byte code is less bulky than source code, can be interpreted faster and can be authenticated easier than either machine code or source code.
Network. To ensure that the agent has access to as wide a range of resources as possible, it should support interchangeable network protocols. This can be achieved at a network level through the use of gateways, which equip agents with a suitable protocol module that allows them to exist on the other side of the gateway.
Language. Agents should be able to be written in any language that is suitable to their task or with which the programmer is most familiar. However, to ensure that the agent offers the maximum flexibility, it must be transmitted in a form that can be interpreted rather than compiled. This has the implication that an interpreter has to be written or modified for each target language, to support the extra functionality of the mobile agent system. If this is not an option for some languages, then it may be that some will not be able to support all the features that are available, for example, an agent that has been written and compiled in C would not be heterogeneous enough to migrate to different platforms.

The advantages of heterogeneity for distributed information management agents are clear (White, 1994b); it broadens the scope of information resources that agents can access and control on behalf of their user and allows the agents to integrate with more distributed resources and applications.

5.1.7 Security

To ensure that agents can do no harm to their environment or to another agent, a number of security considerations need to be taken into account:

Permission. The actions that an agent is able to perform should be regulated against a permission set for the intended recipient to ensure that they are allowed within the current context. This means that the permission set used to protect objects must be rich and flexible enough to ensure that its integrity is maintained from unauthorised accesses. Additionally, different nodes may assign different permission sets to similar objects, for example, on a node where security is very tight, all query accesses must be checked.
Authentication. Before a permission system can be implemented, the identity of the agent must be confirmed and its point or origin established. If neither of these points can be determined, then the agent may be granted the lowest access permission or removed from the node, as detailed previously. However, authentication may be required once per access (for high security systems) or once per session (for lower security systems).
Transmission. While the agent is in transit, it is vulnerable either to being modified in order to change its original purpose or to being duplicated to obtain its authentication signature. Therefore, when the agent is transmitted, it needs to be protected in some way. Encryption mechanisms such as PGP (Garfinkel, 1994) can be used to protect the agent at a data level, and mechanisms such as the Secure Sockets Layer (SSL) (Freier et al., 1995) at a network level.
Verification. When the agent arrives at a node, certain verification tests can be made upon its byte code to ensure that it will neither perform malicious acts nor prohibited acts. This type of verification is performed rigorously by Telescript engines and less complete security is provided by the validation component of a Java-aware browser.
Charging. As an agent interrogates information, it may be charged for access on the amount of data it requests, or once per access. Either way, some system needs to be employed to ensure that the agent has funds to pay for services, that those funds belong to the agent and the agent belongs to the right user. Existing technologies, such as the Electronic Cash Unit (ECU) (Chaum, 1992) can ensure that mobile agents have to pay for the services that they use.

The discussion surrounding the security of mobile agents is currently a topic which generates much debate (Chess et al., 1995). There is genuine concern that mobile agents could present a fundamental problem to networks and systems unless their movement and execution are closely monitored and regulated. It is obvious that some forms of control will have to be implemented to ensure that mobile agents cannot execute on prohibited nodes, cannot monopolise resources or cannot perform malicious acts across network nodes.

One policy involves giving an agent exactly the right amount of permission required to complete its pre-authorised task. If an agent requires more permission, it must apply to the regulatory body of the node. Access or denial will depend upon a number of factors; the ability of the agent to pay for the new permission and the trustworthiness of the agent in the past, as examples.

5.1.8 Summary

The requirements that have been previously outlined are considered the minimum necessary to build an initial framework for mobile agents to support distributed information management tasks.

It is hoped that requirements such as arbitrary migration and ACLs will give the programmer flexibility when writing agents, heterogeneity and communication will allow an agent to reach as many distributed information resources as possible, and security and fault tolerance will make the mobile agent system both trustworthy and robust.

5.2 The Framework

The framework that is to be presented is based around an agent-oriented model; everything in the system is abstracted through agents (figure 5.1). The system is comprised of both static and mobile agents; static agents provide resources and facilities to mobile agents and mobile agents move between network domains taking advantage of these resources to fulfil their goals.