Deep Q-Learning Based Optimization of VLC Systems With Dynamic Time-Division Multiplexing

The traditional method to solve nondeterministic-polynomial-time (NP)-hard optimization problems is to apply meta-heuristic algorithms. In contrast, Deep Q Learning (DQL) uses memory of experience and deep neural network (DNN) to choose steps and progress towards solving the problem. The dynamic tim...

Full description

Bibliographic Details
Main Authors: Umair F. Siddiqi, Sadiq M. Sait, Murat Uysal
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9130159/