应用科学学报 ›› 2019, Vol. 37 ›› Issue (5): 711-720.doi: 10.3969/j.issn.0255-8297.2019.05.012

• 多媒体信息安全 • 上一篇    下一篇

一种基于DPI和负载随机性的加密流量识别方法

孙中军1, 翟江涛1, 戴跃伟2   

  1. 1. 江苏科技大学 电子信息学院, 江苏 镇江 212003;
    2. 南京信息工程大学 计算机与软件学院, 南京 210044
  • 收稿日期:2019-07-27 修回日期:2019-07-29 出版日期:2019-09-30 发布日期:2019-10-18
  • 通信作者: 翟江涛,副教授,研究方向:多媒体与信息安全,E-mail:jiangtaozhai@gmail.com E-mail:jiangtaozhai@gmail.com
  • 基金资助:
    国家自然科学基金(No.61702235,No.61472188,No.61602247,No.U1636117);江苏省自然科学基金(No.BK20150472,No.BK20160840)资助

An Encrypted Traffic Identification Method Based on DPI and Load Randomness

SUN Zhongjun1, ZHAI Jiangtao1, DAI Yuewei2   

  1. 1. School of Electronics and Information, Jiangsu University of Science and Technology, Zhenjiang 212003, Jiangsu Province, China;
    2. School of Computer and Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
  • Received:2019-07-27 Revised:2019-07-29 Online:2019-09-30 Published:2019-10-18

摘要: 随着加密技术的发展和私有协议的不断出现,加密流量的识别已经成为信息安全领域的重要研究方向.该文在现有加密流量识别技术研究基础上提出一种基于深度包检测技术(deep packet inspection,DPI)和负载随机性的加密流量识别方法,该方法主要分为三部分:首先用DPI技术对网络流量快速筛选识别;其次对DPI无法识别流量的有效负载计算信息熵值和蒙特卡罗仿真估计π值的误差;最后输入C4.5决策树分类器进行分类评估.所提方法不仅可克服了DPI无法完全识别协议交互阶段的加密数据和私有协议的缺陷,同时解决了用信息熵识别加密流量和非加密压缩流量误判的问题.实验表明,所提方法较现有的识别模型对加密流量的识别效果有较大提高,同时验证了所提方法的鲁棒性.

关键词: 加密流量, 深度包检测技术, 信息熵, 蒙特卡罗仿真, C4.5决策树

Abstract: With the development of encryption technologies and the emergence of private protocols, the identification of encrypted traffic has become an important research area in the field of information security. Based on the research of existing encrypted traffic identification technologies, an encrypted traffic identification algorithm based on DPI (deep packet inspection) and load randomness is proposed in this paper. The proposed algorithm mainly contains three steps. First, the DPI is used to filter and identify network traffic rapidly. Second, for those payload which could not be recognized by the DPI, their information entropies are calculated and the error of π-value is computed by Monte Carlo simulation. Finally, the C4.5 decision tree classifier is input for classification evaluation. The method can not only overcome the limitation that DPI can't fully identify the encrypted traffic and private protocol in the protocol interaction phase, but also solve the mis-distinguish of encrypted traffic and compressed file traffic as employing information entropy independently. Experimental results show that the proposed method is much more effective on encrypted traffic than the existing methods. At the same time, the method is proved to have good robustness.

Key words: encrypted traffic, deep packet inspection(DPI), information entropy, Monte Carlo simulation, C4.5 decision tree

中图分类号: