Partner selection in self-organised wireless sensor networks for opportunistic energy negotiation: A multi-armed bandit based approach
Partner selection in self-organised wireless sensor networks for opportunistic energy negotiation: A multi-armed bandit based approach
The proliferation of “Things” over a network creates the Internet of Things (IoT), where sensors integrate to collect data from the environment over long periods of time. The growth of IoT applications will inevitably involve co-locating multiple wireless sensor networks, each serving different applications with, possibly, different needs and constraints. Since energy is scarce in sensor nodes equipped with non-rechargeable batteries, energy harvesting technologies have been the focus of research in recent years. However, new problems arise as a result of their wide spatio-temporal variation. Such a shortcoming can be avoided if co-located networks cooperate with each other and share their available energy. Due to their unique characteristics and different owners, recently, we proposed a negotiation approach to deal with conflict of preferences. Unfortunately, negotiation can be impractical with a large number of participants, especially in an open environment. Given this, we introduce a new partner selection technique based on multi-armed bandits (MAB), that enables each node to learn the strategy that optimises its energy resources in the long term. Our results show that the proposed solution allows networks to repeatedly learn the current best energy partner in a dynamic environment. The performance of such a technique is evaluated through simulation and shows that a network can achieve an efficiency of 72% against the optimal strategy in the most challenging scenario studied in this work.
Agent-based sensor network, Automated negotiation, Energy management, Multi-armed bandit based learning, Reinforcement Learning, Wireless sensor networks
Ortega, Andre P.
f7290834-ef51-4e94-b8b0-d164485482c0
Ramchurn, Sarvapali
1d62ae2a-a498-444e-912d-a6082d3aaea3
Tran-Thanh, Long
e0666669-d34b-460e-950d-e8b139fab16c
Merrett, Geoff
89b3a696-41de-44c3-89aa-b0aa29f54020
1 March 2021
Ortega, Andre P.
f7290834-ef51-4e94-b8b0-d164485482c0
Ramchurn, Sarvapali
1d62ae2a-a498-444e-912d-a6082d3aaea3
Tran-Thanh, Long
e0666669-d34b-460e-950d-e8b139fab16c
Merrett, Geoff
89b3a696-41de-44c3-89aa-b0aa29f54020
Ortega, Andre P., Ramchurn, Sarvapali, Tran-Thanh, Long and Merrett, Geoff
(2021)
Partner selection in self-organised wireless sensor networks for opportunistic energy negotiation: A multi-armed bandit based approach.
Ad Hoc Networks, 112, [102354].
(doi:10.1016/j.adhoc.2020.102354).
Abstract
The proliferation of “Things” over a network creates the Internet of Things (IoT), where sensors integrate to collect data from the environment over long periods of time. The growth of IoT applications will inevitably involve co-locating multiple wireless sensor networks, each serving different applications with, possibly, different needs and constraints. Since energy is scarce in sensor nodes equipped with non-rechargeable batteries, energy harvesting technologies have been the focus of research in recent years. However, new problems arise as a result of their wide spatio-temporal variation. Such a shortcoming can be avoided if co-located networks cooperate with each other and share their available energy. Due to their unique characteristics and different owners, recently, we proposed a negotiation approach to deal with conflict of preferences. Unfortunately, negotiation can be impractical with a large number of participants, especially in an open environment. Given this, we introduce a new partner selection technique based on multi-armed bandits (MAB), that enables each node to learn the strategy that optimises its energy resources in the long term. Our results show that the proposed solution allows networks to repeatedly learn the current best energy partner in a dynamic environment. The performance of such a technique is evaluated through simulation and shows that a network can achieve an efficiency of 72% against the optimal strategy in the most challenging scenario studied in this work.
Text
AdHocNetworks-AOSRLTGM
- Accepted Manuscript
More information
Accepted/In Press date: 3 November 2020
e-pub ahead of print date: 7 November 2020
Published date: 1 March 2021
Keywords:
Agent-based sensor network, Automated negotiation, Energy management, Multi-armed bandit based learning, Reinforcement Learning, Wireless sensor networks
Identifiers
Local EPrints ID: 445733
URI: http://eprints.soton.ac.uk/id/eprint/445733
ISSN: 1570-8705
PURE UUID: c8133fa1-2344-491b-971b-e916318b063f
Catalogue record
Date deposited: 06 Jan 2021 17:44
Last modified: 17 Mar 2024 06:08
Export record
Altmetrics
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics