A Centralized Routing for Lifetime and Energy Optimization in WSNs Using Genetic Algorithm and Least-Square Policy Iteration

Q-learning has been primarily used as one of the reinforcement learning (RL) techniques to find the optimal routing path in wireless sensor networks (WSNs). However, for the centralized RL-based routing protocols with a large state space and action space, the baseline Q-learning used to implement th...

Full description

Bibliographic Details
Main Authors: Elvis Obi, Zoubir Mammeri, Okechukwu E. Ochia
Format: Article
Language:English
Published: MDPI AG 2023-01-01
Series:Computers
Subjects:
Online Access:https://www.mdpi.com/2073-431X/12/2/22