Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing

Organised attacks on a computer system to test existing defences, i.e., penetration testing, have been used extensively to evaluate network security. However, penetration testing is a time-consuming process. Additionally, establishing a strategy that resembles a real cyber-attack typically requires...

Full description

Bibliographic Details
Main Authors:	Khuong Tran, Maxwell Standen, Junae Kim, David Bowman, Toby Richer, Ashlesha Akella, Chin-Teng Lin
Format:	Article
Language:	English
Published:	MDPI AG 2022-11-01
Series:	Applied Sciences
Subjects:	cybersecurity penetration testing deep reinforcement learning large action space
Online Access:	https://www.mdpi.com/2076-3417/12/21/11265

_version_	1797469168358916096
author	Khuong Tran Maxwell Standen Junae Kim David Bowman Toby Richer Ashlesha Akella Chin-Teng Lin
author_facet	Khuong Tran Maxwell Standen Junae Kim David Bowman Toby Richer Ashlesha Akella Chin-Teng Lin
author_sort	Khuong Tran
collection	DOAJ
description	Organised attacks on a computer system to test existing defences, i.e., penetration testing, have been used extensively to evaluate network security. However, penetration testing is a time-consuming process. Additionally, establishing a strategy that resembles a real cyber-attack typically requires in-depth knowledge of the cybersecurity domain. This paper presents a novel architecture, named deep cascaded reinforcement learning agents, or CRLA, that addresses large discrete action spaces in an autonomous penetration testing simulator, where the number of actions exponentially increases with the complexity of the designed cybersecurity network. Employing an algebraic action decomposition strategy, CRLA is shown to find the optimal attack policy in scenarios with large action spaces faster and more stably than a conventional deep Q-learning agent, which is commonly used as a method for applying artificial intelligence to autonomous penetration testing.
first_indexed	2024-03-09T19:17:37Z
format	Article
id	doaj.art-eff9027eecda4bf7a1eb045a0d7bfb80
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-09T19:17:37Z
publishDate	2022-11-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-eff9027eecda4bf7a1eb045a0d7bfb802023-11-24T03:40:48ZengMDPI AGApplied Sciences2076-34172022-11-0112211126510.3390/app122111265Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration TestingKhuong Tran0Maxwell Standen1Junae Kim2David Bowman3Toby Richer4Ashlesha Akella5Chin-Teng Lin6School of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, AustraliaDefence Science and Technology Group, Edinburgh, SA 5111, AustraliaDefence Science and Technology Group, Edinburgh, SA 5111, AustraliaDefence Science and Technology Group, Edinburgh, SA 5111, AustraliaDefence Science and Technology Group, Edinburgh, SA 5111, AustraliaSchool of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, AustraliaSchool of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, AustraliaOrganised attacks on a computer system to test existing defences, i.e., penetration testing, have been used extensively to evaluate network security. However, penetration testing is a time-consuming process. Additionally, establishing a strategy that resembles a real cyber-attack typically requires in-depth knowledge of the cybersecurity domain. This paper presents a novel architecture, named deep cascaded reinforcement learning agents, or CRLA, that addresses large discrete action spaces in an autonomous penetration testing simulator, where the number of actions exponentially increases with the complexity of the designed cybersecurity network. Employing an algebraic action decomposition strategy, CRLA is shown to find the optimal attack policy in scenarios with large action spaces faster and more stably than a conventional deep Q-learning agent, which is commonly used as a method for applying artificial intelligence to autonomous penetration testing.https://www.mdpi.com/2076-3417/12/21/11265cybersecuritypenetration testingdeep reinforcement learninglarge action space
spellingShingle	Khuong Tran Maxwell Standen Junae Kim David Bowman Toby Richer Ashlesha Akella Chin-Teng Lin Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing Applied Sciences cybersecurity penetration testing deep reinforcement learning large action space
title	Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing
title_full	Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing
title_fullStr	Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing
title_full_unstemmed	Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing
title_short	Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing
title_sort	cascaded reinforcement learning agents for large action spaces in autonomous penetration testing
topic	cybersecurity penetration testing deep reinforcement learning large action space
url	https://www.mdpi.com/2076-3417/12/21/11265
work_keys_str_mv	AT khuongtran cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT maxwellstanden cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT junaekim cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT davidbowman cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT tobyricher cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT ashleshaakella cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting AT chintenglin cascadedreinforcementlearningagentsforlargeactionspacesinautonomouspenetrationtesting

Cascaded Reinforcement Learning Agents for Large Action Spaces in Autonomous Penetration Testing

Similar Items