基于PPO算法的恶意代码对抗攻击方法
首发时间:2023-03-13
摘要:随着互联网的发展,恶意代码已经成为用户信息安全的威胁之一。传统的恶意代码检测方法难以处理每天新增的恶意代码样本。因此,机器学习算法在恶意代码检测领域中的应用越来越普遍。然而,使用机器学习技术进行恶意代码检测的算法,通常只注重优化性能指标(如正确率和召回率),而没有充分考虑到攻击者可能会故意制造具有欺骗性的输入数据,以此来欺骗机器学习模型。当前面向恶意代码检测的对抗攻击方法存在模型训练困难,逃逸率效果不理想等问题。本文基于上述问题提出了基于PPO算法的恶意代码对抗攻击方法,利用强化学习进行恶意代码对抗攻击,设计了合理的动作空间用于扰动PE文件,利用PPO算法有效训练智能体模型,并在多种恶意代码检测器上进行实验,最终结果表明,本文对主流的恶意代码检测器表现出较高的逃逸率,相对于同类方法,在逃逸率和平均逃逸步长上具有明显优势
For information in English, please click here
PPO-Based Adversarial Attack Method for Malware Detection
Abstract:With the development of the Internet, malicious code has become one of the threats to user information security. Traditional methods of detecting malicious code are unable to handle the increasing number of new malicious code samples generated every day. Therefore, machine learning algorithms are becoming increasingly common in the field of malicious code detection. However, algorithms that use machine learning technology for malicious code detection typically focus only on optimizing performance metrics such as accuracy and recall, without fully considering the fact that attackers may intentionally create deceptive input data to deceive the machine learning model and make it unable to function properly in real environments. Currently, there are difficulties in training models for adversarial attacks against malicious code detection, and the escape rate is not ideal. Based on the above issues, the main research work of this paper is as follows: A malwre adversarial attack method based on the PPO algorithm is proposed, and an action space is designed for perturbing PE files and traininng the model. Experiments are conducted on multiple malicious code detectors, and the results show that this method performs well in terms of evasion rate compared to otherattack methods.
Keywords: Cyber Security Malware Detection Adversarial Attack
基金:
引用
No.****
同行评议
勘误表
基于PPO算法的恶意代码对抗攻击方法
评论
全部评论