A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks

The usage of distributed Peer-to-Peer (P2P) networks has been growing steadily for a reasonable period. Various applications rely on the infrastructure of P2P networks, where nodes communicate to accomplish a task without the need for a central authority. One of the significant challenges in P2P net...

Full description

Bibliographic Details
Main Authors: Arafat Abu Mallouh, Zakariya Qawaqneh, Omar Abuzaghleh, Ahmad Al-Rababa'A
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10105270/
_version_ 1797840085672001536
author Arafat Abu Mallouh
Zakariya Qawaqneh
Omar Abuzaghleh
Ahmad Al-Rababa'A
author_facet Arafat Abu Mallouh
Zakariya Qawaqneh
Omar Abuzaghleh
Ahmad Al-Rababa'A
author_sort Arafat Abu Mallouh
collection DOAJ
description The usage of distributed Peer-to-Peer (P2P) networks has been growing steadily for a reasonable period. Various applications rely on the infrastructure of P2P networks, where nodes communicate to accomplish a task without the need for a central authority. One of the significant challenges in P2P networks is the ability of the network nodes to reach a consensus on a shared item; the challenge increases as time passes. Thus, this work proposes a new effective method for tweaking the Deep Reinforcement Learning (DRL) algorithm to train Deep Q Network (DQN) learning agents to reach a consensus among the P2P nodes. We propose various hierarchies of deep agents to address this crucial challenge in P2P networks. DRL is utilized to build and train agents; more precisely, DQN learning agents are constructed and trained. Two scenarios are proposed and evaluated. In the first scenario, one DQN agent is trained to find the consensus between the network nodes. In the second scenario, three hierarchies with different numbers of layers of agents are proposed and evaluated. In both scenarios, the P2P network used is a blockchain network. The best result was obtained using the third hierarchy of the second scenario; the overall accuracy of the model is 87.8%.
first_indexed 2024-04-09T16:08:46Z
format Article
id doaj.art-876490b019314715a090da5ec5bec6d0
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-09T16:08:46Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-876490b019314715a090da5ec5bec6d02023-04-24T23:00:43ZengIEEEIEEE Access2169-35362023-01-0111386653867910.1109/ACCESS.2023.326828310105270A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P NetworksArafat Abu Mallouh0https://orcid.org/0000-0002-5953-1506Zakariya Qawaqneh1Omar Abuzaghleh2Ahmad Al-Rababa'A3https://orcid.org/0000-0003-3995-8956Computer Science Department, Manhattan College, Riverdale, NY, USADepartment of Computing Sciences, State University of New York Brockport, Brockport, NY, USADepartment of Computer Information Science, Higher Colleges of Technology, Dubai, United Arab EmiratesComputer Science Department, The World Islamic Science and Education University, Amman, JordanThe usage of distributed Peer-to-Peer (P2P) networks has been growing steadily for a reasonable period. Various applications rely on the infrastructure of P2P networks, where nodes communicate to accomplish a task without the need for a central authority. One of the significant challenges in P2P networks is the ability of the network nodes to reach a consensus on a shared item; the challenge increases as time passes. Thus, this work proposes a new effective method for tweaking the Deep Reinforcement Learning (DRL) algorithm to train Deep Q Network (DQN) learning agents to reach a consensus among the P2P nodes. We propose various hierarchies of deep agents to address this crucial challenge in P2P networks. DRL is utilized to build and train agents; more precisely, DQN learning agents are constructed and trained. Two scenarios are proposed and evaluated. In the first scenario, one DQN agent is trained to find the consensus between the network nodes. In the second scenario, three hierarchies with different numbers of layers of agents are proposed and evaluated. In both scenarios, the P2P network used is a blockchain network. The best result was obtained using the third hierarchy of the second scenario; the overall accuracy of the model is 87.8%.https://ieeexplore.ieee.org/document/10105270/Blockchainconsensusdeep reinforcement learningDQNP2P
spellingShingle Arafat Abu Mallouh
Zakariya Qawaqneh
Omar Abuzaghleh
Ahmad Al-Rababa'A
A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
IEEE Access
Blockchain
consensus
deep reinforcement learning
DQN
P2P
title A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
title_full A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
title_fullStr A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
title_full_unstemmed A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
title_short A New Efficient Method for Refining the Reinforcement Learning Algorithm to Train Deep Q Agents for Reaching a Consensus in P2P Networks
title_sort new efficient method for refining the reinforcement learning algorithm to train deep q agents for reaching a consensus in p2p networks
topic Blockchain
consensus
deep reinforcement learning
DQN
P2P
url https://ieeexplore.ieee.org/document/10105270/
work_keys_str_mv AT arafatabumallouh anewefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT zakariyaqawaqneh anewefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT omarabuzaghleh anewefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT ahmadalrababaa anewefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT arafatabumallouh newefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT zakariyaqawaqneh newefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT omarabuzaghleh newefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks
AT ahmadalrababaa newefficientmethodforrefiningthereinforcementlearningalgorithmtotraindeepqagentsforreachingaconsensusinp2pnetworks