Ensemble‐based classification using neural networks and machine learning models for windows pe malware detection

Robertas Damaševičius; Algimantas Venčkauskas; Jevgenijus Toldinas; Šarūnas Grigaliūnas

doi:10.3390/electronics10040485

Title	Ensemble‐based classification using neural networks and machine learning models for windows pe malware detection
Authors	Damaševičius, Robertas ; Venčkauskas, Algimantas ; Toldinas, Jevgenijus ; Grigaliūnas, Šarūnas
DOI	10.3390/electronics10040485
Full Text
Is Part of	Electronics.. Basel : MDPI. 2021, vol. 10, iss. 4, art. no. 485, p. 1-23.. ISSN 2079-9292
Keywords [eng]	Applied machine learning ; Ensemble classification ; Malware analysis and detection ; Mobile security ; Neural network
Abstract [eng]	The security of information is among the greatest challenges facing organizations and institutions. Cybercrime has risen in frequency and magnitude in recent years, with new ways to steal, change and destroy information or disable information systems appearing every day. Among the types of penetration into the information systems where confidential information is processed is malware. An attacker injects malware into a computer system, after which he has full or partial access to critical information in the information system. This paper proposes an ensemble classification‐based methodology for malware detection. The first‐stage classification is performed by a stacked ensemble of dense (fully connected) and convolutional neural networks (CNN), while the final stage classification is performed by a meta‐learner. For a meta‐learner, we explore and compare 14 classifiers. For a baseline comparison, 13 machine learning methods are used: K‐Nearest Neighbors, Linear Support Vector Machine (SVM), Radial basis function (RBF) SVM, Random Forest, Ada‐ Boost, Decision Tree, ExtraTrees, Linear Discriminant Analysis, Logistic, Neural Net, Passive Classifier, Ridge Classifier and Stochastic Gradient Descent classifier. We present the results of experiments performed on the Classification of Malware with PE headers (ClaMP) dataset. The best performance is achieved by an ensemble of five dense and CNN neural networks, and the ExtraTrees classifier as a meta‐learner.
Published	Basel : MDPI
Type	Journal article
Language	English
Publication date	2021
CC license

„Ensemble‐based classification using neural networks and machine learning models for windows pe malware detection“