The University of Southampton

×

Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm

Arruda, E. F. and Fragoso, M. D. (2015) Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm. European Journal of Operational Research, 240 (3), 697-705. (doi:10.1016/j.ejor.2014.08.023).

Record type: Article

Abstract

This paper introduces a two-phase approach to solve average cost Markov decision processes, which is based on state space embedding or time aggregation. In the first phase, time aggregation is applied for policy optimization in a prescribed subset of the state space, and a novel result is applied to expand the evaluation to the whole state space. This evaluation is then used in the second phase in a policy improvement step, and the two phases are then alternated until convergence is attained. Some numerical experiments illustrate the results.

This record has no associated files available for download.

More information

Published date: 1 February 2015

Keywords: Dynamic programming, Embedding, Markov decision processes, Stochastic optimal control, Time aggregation

Identifiers

Local EPrints ID: 446040

URI: http://eprints.soton.ac.uk/id/eprint/446040

DOI: doi:10.1016/j.ejor.2014.08.023

ISSN: 0377-2217

PURE UUID: 1cf1528f-040e-4ac4-bc43-b831062ae521

ORCID for E. F. Arruda:

orcid.org/0000-0002-9835-352X

Catalogue record

Date deposited: 19 Jan 2021 17:33

Last modified: 06 Jun 2024 02:09

Export record

Altmetrics

Share this record

Share this on Facebook Share this on Twitter Share this on Weibo

Contributors

Author: E. F. Arruda

Author: M. D. Fragoso

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Library staff additional information

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×