The University of Southampton
University of Southampton Institutional Repository

Incremental search algorithms for on-line planning

Incremental search algorithms for on-line planning
Incremental search algorithms for on-line planning

An on-line planning problem is one where an agent must optimise some objective criterion by making a sequence of action selection decisions, where the time and resources used in making decisions count in assessing overall solution quality.  Typically in these problems it is  not possible to find an optimal complete solution before an initial action must be executed, instead to maximise its performance the agent must interleave decision making and execution.  This thesis investigates using decision theoretic techniques to solve these problems by equipping the agent with the ability to reason about the “complexity induced” uncertainties in its information and the costs of computation.

The major sub-problems such an agent must solve are:  i) decision making – how to make decisions whilst in a state of “complexity induced uncertainty”, ii) search control – which node to expand next, iii) stopping – when to stop searching.  Decision making is treated as a value estimation problem.  By representing the agent’s uncertainty in probabilistic terms this can be solved using decision theoretic techniques. Existing decision making systems are analysed and new low computational cost approximate decision theoretic algorithms developed.  These are shown to give significant improvements in decision quality.  Search control is also treated as an estimation problem, in this case estimating the expected value of computation (EVC), which is the expected benefit of a computation in reducing the agent’s uncertainty (hence improving action selection).  New, low computational cost, approximations for the EVC are also developed and shown to give significant improvements in decision quality.  Sophisticated stopping can be achieved by trading-off the EVC of further search against its computational cost. Experimental results show that integration of the sub-problem solutions is critical to agent performance.  The results also show that (assuming no adverse interactions) improving decision making gives the greatest improvements, with search control and stopping offering more modest benefits.

University of Southampton
Farquhar, Jason D.R
d2500514-537c-4c24-a3bb-93ad05be822e
Farquhar, Jason D.R
d2500514-537c-4c24-a3bb-93ad05be822e

Farquhar, Jason D.R (2004) Incremental search algorithms for on-line planning. University of Southampton, Doctoral Thesis.

Record type: Thesis (Doctoral)

Abstract

An on-line planning problem is one where an agent must optimise some objective criterion by making a sequence of action selection decisions, where the time and resources used in making decisions count in assessing overall solution quality.  Typically in these problems it is  not possible to find an optimal complete solution before an initial action must be executed, instead to maximise its performance the agent must interleave decision making and execution.  This thesis investigates using decision theoretic techniques to solve these problems by equipping the agent with the ability to reason about the “complexity induced” uncertainties in its information and the costs of computation.

The major sub-problems such an agent must solve are:  i) decision making – how to make decisions whilst in a state of “complexity induced uncertainty”, ii) search control – which node to expand next, iii) stopping – when to stop searching.  Decision making is treated as a value estimation problem.  By representing the agent’s uncertainty in probabilistic terms this can be solved using decision theoretic techniques. Existing decision making systems are analysed and new low computational cost approximate decision theoretic algorithms developed.  These are shown to give significant improvements in decision quality.  Search control is also treated as an estimation problem, in this case estimating the expected value of computation (EVC), which is the expected benefit of a computation in reducing the agent’s uncertainty (hence improving action selection).  New, low computational cost, approximations for the EVC are also developed and shown to give significant improvements in decision quality.  Sophisticated stopping can be achieved by trading-off the EVC of further search against its computational cost. Experimental results show that integration of the sub-problem solutions is critical to agent performance.  The results also show that (assuming no adverse interactions) improving decision making gives the greatest improvements, with search control and stopping offering more modest benefits.

Text
982941.pdf - Version of Record
Available under License University of Southampton Thesis Licence.
Download (6MB)

More information

Published date: 2004

Identifiers

Local EPrints ID: 465689
URI: http://eprints.soton.ac.uk/id/eprint/465689
PURE UUID: bf648453-4a2d-49ea-b629-3a91aee475aa

Catalogue record

Date deposited: 05 Jul 2022 02:35
Last modified: 16 Mar 2024 20:19

Export record

Contributors

Author: Jason D.R Farquhar

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×