Journal of Systems Engineering and Electronics ›› 2009, Vol. 20 ›› Issue (2): 405-411.

• SOFTWARE ALGORITHM AND SIMULATION • Previous Articles     Next Articles

Detecting network intrusions by data mining and variable-length sequence pattern matching

Tian Xinguang1,2, Duan Miyi1,2, Sun Chunlai & Liu Xin   

  1. 1. Inst. of Computing Technology, Chinese Academy of Sciences, Beijing 100190, P. R. China;
    2. Inst. of Computing Technology, Beijing Jiaotong Univ., Beijing 100029, P. R. China
  • Online:2009-04-17 Published:2010-01-03

Abstract:

Anomaly detection has been an active research topic in the field of network intrusion detection for many years. A novel method is presented for anomaly detection based on system calls into the kernels of Unix or Linux systems. The method uses the data mining technique to model the normal behavior of a privileged program and uses a variable-length pattern matching algorithm to perform the comparison of the current behavior and historic normal behavior, which is more suitable for this problem than the fixed-length pattern matching algorithm proposed by Forrest et al. At the detection stage, the particularity of the audit data is taken into account, and two alternative schemes could be used to distinguish between normalities and intrusions. The method gives attention to both computational efficiency and detection accuracy and is especially applicable for on-line detection. The performance of the method is evaluated using the typical testing data set, and the results show that it is significantly better than the anomaly detection method based on hidden Markov models proposed by Yan et al. and the method based on fixed-length patterns proposed by Forrest and Hofmeyr. The novel method has been applied to practical hosted-based intrusion detection systems and achieved high detection performance.