基于VAE-GMM的高级持续威胁攻击异常流量检测

ADVANCED PERSISTENT THREAT ATTACK ANOMALY TRAFFIC DETECTION BASED ON VAE-GMM

  • 摘要: 基于多维或高维流量数据的异常检测在APT攻击研究中具有重要的应用场景。高维流量数据传统处理方法未能充分关注数据内部结构,检测效率低,因此提出变分自编码-高斯混合模型的半监督异常检测方法(VAE-GMM)。该方法将流量数据输入生成网络中获得相应的低维表示和重构误差并作为后续估计网络的输入;估计网络则利用高斯混合模型预测样本似然能量值并根据似然能量值判断异常流量。模型重新设计损失函数和重构误差,以端到端的双网络联合训练方式摆脱局部最优,优化检测效果。实验结果表明,该模型优于其他异常检测模型,与传统DAEGMM方法相比,召回率提升了13%。

     

    Abstract: Anomaly detection based on multidimensional or high-dimensional traffic data has important application scenarios in APT attack research. Traditional processing methods for high-dimensional traffic data fail to pay sufficient attention to the internal structure of the data and have low detection efficiency, thus a semi-supervised anomaly detection method (VAE-GMM) with a variational self-coding-Gaussian mixture model is proposed. The method inputted the traffic data into the generative network to obtain the corresponding low-dimensional representation and reconstruction errors and used them as the input of the subsequent estimation network. The estimation network used the Gaussian mixture model to predict the sample likelihood energy values and determined the abnormal traffic based on the likelihood energy values. The model redesigned the loss function and reconstruction error to get rid of the local optimum and optimize the detection results by joint end-to-end dual network training. The experimental results show that the model outperforms other anomaly detection models and improves the recall rate by 13% compared with the traditional DAEGMM method.

     

/

返回文章
返回