Sunday , October 17 2021

A Method Based on Multiple Population Genetic Algorithm to Select Hyper-Parameters of Industrial Intrusion Detection Classifier

Xuejun LIU1*, Hao WANG1, Xiaoni ZHANG1, Haiying LUAN2, Yun SHA1, Yong YAN1
1 College of Information Engineering, Beijing Institute of Petrochemical Technology,
19 Qingyuan North Road, Daxing District, Beijing, 102617, China
lxj@bipt.edu.cn (*Corresponding author), whbeats@163.com, 2019540019@bipt.edu.cn,
shayun@bipt.edu.cn, yanyong@bipt.edu.cn
2 Fluid Power and Automotive Equipment Center, Beijing Research Institute of Automation
for Machinery Industry CO., LTD, Beijing, 100120, China
lhying1129@aliyun.com

Abstract: The security of industrial control systems is increasingly prominent, and the performance of intrusion detection classifiers depends more on hyper-parameters. This paper proposes an improved multiple population genetic algorithm (IMPGA) used to intelligently search hyper-parameters of classifiers, and the simulated annealing algorithm (SAA) is used to control the evolution rate among various populations. In addition, the hash fitness value is used to reduce resource consumption and the directional evolution operator is introduced to optimize the population. This method can effectively avoid the algorithm falling into local optimal solution and save the optimal solution in the process of evolution. Thus, the optimal or approximate optimal combinations of hyper-parameters of classifiers are obtained and the accuracy of the classifiers is finally improved. In this paper, experiments are conducted on the following datasets: the natural gas pipeline experimental dataset of Mississippi State University from 2014 (a gas dataset), the intrusion detection systems dataset of Canadian Institute for Cybersecurity from 2017 (CICIDS2017 dataset) and an oil depot dataset. The experimental results of those three datasets show that the area under curve (AUC) of the back propagation neural network (BPNN) is more than 98%, of the extreme gradient boosting (XGBoost) is more than 99%, and of the support vector machines (SVM) is more than 98%. This selection method can effectively detect the intrusion attacks.

Keywords: Industrial control network, Intrusion detection, Genetic algorithm, Hyper-parameters optimization.

>>FULL TEXT: PDF

CITE THIS PAPER AS:
Xuejun LIU, Hao WANG, Xiaoni ZHANG, Haiying LUAN, Yun SHA, Yong YAN, A Method Based on Multiple Population Genetic Algorithm to Select Hyper-Parameters of Industrial Intrusion Detection Classifier, Studies in Informatics and Control, ISSN 1220-1766, vol. 30(3), pp. 39-49, 2021. https://doi.org/10.24846/v30i3y202104