Explainable firewall penetration testing method employing machine learning

Algimantas Venčkauskas; Jevgenijus Toldinas; Nerijus Morkevičius

doi:10.3390/electronics15051030

Title	Explainable firewall penetration testing method employing machine learning
Authors	Venčkauskas, Algimantas ; Toldinas, Jevgenijus ; Morkevičius, Nerijus
DOI	10.3390/electronics15051030
Full Text
Is Part of	Electronics.. Basel : MDPI. 2026, vol. 15, iss. 5, art. no. 1030, p. 1-24.. ISSN 2079-9292
Keywords [eng]	penetration testing ; firewall ; machine learning ; explainability ; ontology application
Abstract [eng]	Cyber adversaries are becoming more sophisticated, creating complex security challenges as digital services expand. The reliability of the firewall is of the utmost importance in the context of network security since it serves as the first line of protection. Penetration testing is an approach used to evaluate the reliability of a firewall and improve security by uncovering exploitable flaws. Frequently, penetration testing solutions are developed using machine learning, and it is of the utmost importance to explain the obtained results during the penetration testing. The emergence of explainable AI (XAI) addresses transparency in ML models, which is essential for informed cybersecurity decisions. Additionally, effective penetration testing reports are crucial for organizations, helping them comprehend and address vulnerabilities with tailored mitigation strategies. This study contributes to firewall security by developing an explainable penetration testing method, which includes two machine learning classification models: a binary model for detecting attacks and a multiclass model for identifying attack types with an explainability feature. This research introduces a novel explainability method that emphasizes significant features related to attack types based on multiclass predictions and proposes an approach using the extended System Security Assurance Ontology (SSAO) to clarify vulnerabilities and suggest alternative mitigation strategies. After evaluating numerous ML algorithms for the CIC-IDS2017 dataset, the Fine Tree model was considered to have the greatest performance. For the binary model, it achieved a validation accuracy of 99.7%, while for the multiclass model, it achieved a validation accuracy of 99.6%. Both models were used to test the firewall for vulnerabilities. Firewall penetration testing using the binary model achieves an accuracy of 82.1%, while the multiclass model achieves an accuracy of 78.7%.
Published	Basel : MDPI
Type	Journal article
Language	English
Publication date	2026
CC license

„Explainable firewall penetration testing method employing machine learning“