Evaluating reinforcement learning for game theory application
learning to price airline seats under competition
Evaluating reinforcement learning for game theory application
learning to price airline seats under competition
Applied Game Theory has been criticised for not being able to model real decision making situations. A game's sensitive nature and the difficultly in determining the utility payoff functions make it hard for a decision maker to rely upon any game theoretic results. Therefore the models tend to be simple due to the complexity of solving them (i.e. finding the equilibrium).
In recent years, due to the increases of computing power, different computer modelling techniques have been applied in Game Theory. A major example is Artificial Intelligence methods e.g. Genetic Algorithms, Neural Networks and Reinforcement Learning (RL). These techniques allow the modeller to incorporate Game Theory within their models (or simulation) without necessarily knowing the optimal solution. After a warm up period of repeated episodes is run, the model learns to
play the game well (though not necessarily optimally). This is a form of simulation-optimization.
The objective of the research is to investigate the practical usage of RL within a simple sequential stochastic airline seat pricing game. Different forms of RL are considered and compared to the optimal policy, which is found using standard dynamic programming techniques. The airline game and RL methods displays various interesting phenomena, which are also discussed. For completeness, convergence proofs for
the RL algorithms were constructed.
Collins, Andrew
c36a17aa-3b87-48dc-bc7d-ba8bbf769cbf
January 2009
Collins, Andrew
c36a17aa-3b87-48dc-bc7d-ba8bbf769cbf
Thomas, Lyn
a3ce3068-328b-4bce-889f-965b0b9d2362
Collins, Andrew
(2009)
Evaluating reinforcement learning for game theory application
learning to price airline seats under competition.
University of Southampton, School of Management, Doctoral Thesis, 266pp.
Record type:
Thesis
(Doctoral)
Abstract
Applied Game Theory has been criticised for not being able to model real decision making situations. A game's sensitive nature and the difficultly in determining the utility payoff functions make it hard for a decision maker to rely upon any game theoretic results. Therefore the models tend to be simple due to the complexity of solving them (i.e. finding the equilibrium).
In recent years, due to the increases of computing power, different computer modelling techniques have been applied in Game Theory. A major example is Artificial Intelligence methods e.g. Genetic Algorithms, Neural Networks and Reinforcement Learning (RL). These techniques allow the modeller to incorporate Game Theory within their models (or simulation) without necessarily knowing the optimal solution. After a warm up period of repeated episodes is run, the model learns to
play the game well (though not necessarily optimally). This is a form of simulation-optimization.
The objective of the research is to investigate the practical usage of RL within a simple sequential stochastic airline seat pricing game. Different forms of RL are considered and compared to the optimal policy, which is found using standard dynamic programming techniques. The airline game and RL methods displays various interesting phenomena, which are also discussed. For completeness, convergence proofs for
the RL algorithms were constructed.
Text
Final_Phd_thesis_-_Andrew_Collins_January_2009.pdf
- Other
More information
Published date: January 2009
Organisations:
University of Southampton
Identifiers
Local EPrints ID: 69751
URI: http://eprints.soton.ac.uk/id/eprint/69751
PURE UUID: 376427ff-4b80-4144-8513-afb4fa688ce2
Catalogue record
Date deposited: 03 Dec 2009
Last modified: 13 Mar 2024 19:44
Export record
Contributors
Author:
Andrew Collins
Thesis advisor:
Lyn Thomas
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics