Research on the Application of Deep Reinforcement Learning in Automated Penetration Testing Path Optimization and Decision Making

doi:70517/ijhsa47120

Research article
DOI: https://doi.org/10.70517/ijhsa47120

Volume 47, Issue 1
Pages: 236
-247
Open Access
Download

Research on the Application of Deep Reinforcement Learning in Automated Penetration Testing Path Optimization and Decision Making

By: ^¹, ^¹, ^¹, ^¹, ^¹, ^¹

¹Digital Intelligence Technology Company, PetroChina Xinjiang Oilfield Company, Karamay, Xinjiang, 834000, China

Published: 09/09/2025

Abstract

This paper introduces deep reinforcement learning into automated penetration testing to plan and optimize penetration testing supply and defense paths. After modeling the automated penetration problem, the paper simplifies and evaluates the benefits of the DQN algorithm in deep reinforcement learning, finds the optimal penetration path through sample augmentation, and proposes the MASK-SALT-DQN algorithm. Through simulation experiments, the paper verifies the operation and effectiveness of the algorithm. In both simple and complex scenarios, the MASK-SALT-DQN algorithm achieves the fastest runtime speed, significantly enhancing the agent’s learning efficiency. The algorithm provides accurate evaluation criteria for penetration testing path planning results. Compared to penetration testing learning algorithms based on Nature DQN, the MASK-SALT-DQN algorithm demonstrates a higher convergence value in its learning curve, indicating superior convergence performance.

Keywords: deep reinforcement learning, penetration testing, path optimization, MASK-SALT-DQN

On this page

Research on the Application of Deep Reinforcement Learning in Automated Penetration Testing Path Optimization and Decision Making

Abstract