Incremental search algorithms for on-line planning
Incremental search algorithms for on-line planning
An on-line planning problem is one where an agent must optimise some objective criterion by making a sequence of action selection decisions, where the time and resources used in making decisions count in assessing overall solution quality. Typically in these problems it is not possible to find an optimal complete solution before an initial action must be executed, instead to maximise its performance the agent must interleave decision making and execution. This thesis investigates using decision theoretic techniques to solve these problems by equipping the agent with the ability to reason about the “complexity induced” uncertainties in its information and the costs of computation.
The major sub-problems such an agent must solve are: i) decision making – how to make decisions whilst in a state of “complexity induced uncertainty”, ii) search control – which node to expand next, iii) stopping – when to stop searching. Decision making is treated as a value estimation problem. By representing the agent’s uncertainty in probabilistic terms this can be solved using decision theoretic techniques. Existing decision making systems are analysed and new low computational cost approximate decision theoretic algorithms developed. These are shown to give significant improvements in decision quality. Search control is also treated as an estimation problem, in this case estimating the expected value of computation (EVC), which is the expected benefit of a computation in reducing the agent’s uncertainty (hence improving action selection). New, low computational cost, approximations for the EVC are also developed and shown to give significant improvements in decision quality. Sophisticated stopping can be achieved by trading-off the EVC of further search against its computational cost. Experimental results show that integration of the sub-problem solutions is critical to agent performance. The results also show that (assuming no adverse interactions) improving decision making gives the greatest improvements, with search control and stopping offering more modest benefits.
University of Southampton
Farquhar, Jason D.R
d2500514-537c-4c24-a3bb-93ad05be822e
2004
Farquhar, Jason D.R
d2500514-537c-4c24-a3bb-93ad05be822e
Farquhar, Jason D.R
(2004)
Incremental search algorithms for on-line planning.
University of Southampton, Doctoral Thesis.
Record type:
Thesis
(Doctoral)
Abstract
An on-line planning problem is one where an agent must optimise some objective criterion by making a sequence of action selection decisions, where the time and resources used in making decisions count in assessing overall solution quality. Typically in these problems it is not possible to find an optimal complete solution before an initial action must be executed, instead to maximise its performance the agent must interleave decision making and execution. This thesis investigates using decision theoretic techniques to solve these problems by equipping the agent with the ability to reason about the “complexity induced” uncertainties in its information and the costs of computation.
The major sub-problems such an agent must solve are: i) decision making – how to make decisions whilst in a state of “complexity induced uncertainty”, ii) search control – which node to expand next, iii) stopping – when to stop searching. Decision making is treated as a value estimation problem. By representing the agent’s uncertainty in probabilistic terms this can be solved using decision theoretic techniques. Existing decision making systems are analysed and new low computational cost approximate decision theoretic algorithms developed. These are shown to give significant improvements in decision quality. Search control is also treated as an estimation problem, in this case estimating the expected value of computation (EVC), which is the expected benefit of a computation in reducing the agent’s uncertainty (hence improving action selection). New, low computational cost, approximations for the EVC are also developed and shown to give significant improvements in decision quality. Sophisticated stopping can be achieved by trading-off the EVC of further search against its computational cost. Experimental results show that integration of the sub-problem solutions is critical to agent performance. The results also show that (assuming no adverse interactions) improving decision making gives the greatest improvements, with search control and stopping offering more modest benefits.
Text
982941.pdf
- Version of Record
More information
Published date: 2004
Identifiers
Local EPrints ID: 465689
URI: http://eprints.soton.ac.uk/id/eprint/465689
PURE UUID: bf648453-4a2d-49ea-b629-3a91aee475aa
Catalogue record
Date deposited: 05 Jul 2022 02:35
Last modified: 16 Mar 2024 20:19
Export record
Contributors
Author:
Jason D.R Farquhar
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics