Journal of Applied Sciences ›› 2019, Vol. 37 ›› Issue (5): 711-720.doi: 10.3969/j.issn.0255-8297.2019.05.012

• Special Issue: Information Security of Multimedia • Previous Articles     Next Articles

An Encrypted Traffic Identification Method Based on DPI and Load Randomness

SUN Zhongjun1, ZHAI Jiangtao1, DAI Yuewei2   

  1. 1. School of Electronics and Information, Jiangsu University of Science and Technology, Zhenjiang 212003, Jiangsu Province, China;
    2. School of Computer and Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
  • Received:2019-07-27 Revised:2019-07-29 Online:2019-09-30 Published:2019-10-18

Abstract: With the development of encryption technologies and the emergence of private protocols, the identification of encrypted traffic has become an important research area in the field of information security. Based on the research of existing encrypted traffic identification technologies, an encrypted traffic identification algorithm based on DPI (deep packet inspection) and load randomness is proposed in this paper. The proposed algorithm mainly contains three steps. First, the DPI is used to filter and identify network traffic rapidly. Second, for those payload which could not be recognized by the DPI, their information entropies are calculated and the error of π-value is computed by Monte Carlo simulation. Finally, the C4.5 decision tree classifier is input for classification evaluation. The method can not only overcome the limitation that DPI can't fully identify the encrypted traffic and private protocol in the protocol interaction phase, but also solve the mis-distinguish of encrypted traffic and compressed file traffic as employing information entropy independently. Experimental results show that the proposed method is much more effective on encrypted traffic than the existing methods. At the same time, the method is proved to have good robustness.

Key words: encrypted traffic, deep packet inspection(DPI), information entropy, Monte Carlo simulation, C4.5 decision tree

CLC Number: