Reinforcement Learning-Based Multihop Relaying: A Decentralized Q-Learning Approach

Conventional optimization-based relay selection for multihop networks cannot resolve the conflict between performance and cost. The optimal selection policy is centralized and requires local channel state information (CSI) of all hops, leading to high computational complexity and signaling overhead....

Full description

Bibliographic Details
Main Authors: Xiaowei Wang, Xin Wang
Format: Article
Language:English
Published: MDPI AG 2021-10-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/23/10/1310