Chinese Journal of Network and Information Security ›› 2023, Vol. 9 ›› Issue (1): 130-139.doi: 10.11959/j.issn.2096-109x.2023005

• Papers • Previous Articles     Next Articles

IoT intrusion detection method for unbalanced samples

ANTONG P, Wen CHEN, Lifa WU   

  1. School of Computer Science, School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
  • Revised:2022-11-12 Online:2023-02-25 Published:2023-02-01
  • Supported by:
    TheNational Key R&D Program of China(2019YFB2101704)

Abstract:

In recent years, network traffic increases exponentially with the iteration of devices, while more and more attacks are launched against various applications.It is significant to identify and classify attacks at the traffic level.At the same time, with the explosion of Internet of Things (IoT) devices in recent years, attacks on IoT devices are also increasing, causing more and more damages.IoT intrusion detection is able to distinguish attack traffic from such a large volume of traffic, secure IoT devices at the traffic level, and stop the attack activity.In view of low detection accuracy of various attacks and sample imbalance at present, a random forest based intrusion detection method (Resample-RF) was proposed, which consisted of three specific methods: optimal sample selection algorithm, feature merging algorithm based on information entropy, and multi-classification greedy transformation algorithm.Aiming at the problem of unbalanced samples in the IoT environment, an optimal sample selection algorithm was proposed to increase the weight of small samples.Aiming at the low efficiency problem of random forest feature splitting, a feature merging method based on information entropy was proposed to improve the running efficiency.Aiming at the low accuracy problem of random forest multi-classification, a multi-classification greedy transformation method was proposed to further improve the accuracy.The method was evaluated on two public datasets.F1 reaches 0.99 on IoT-23 dataset and 1.0 on Kaggle dataset, both of which have good performance.The experimental results show that the proposed model can effectively identify the attack traffic from the massive traffic, better prevent the attack of hackers on the application, protect the IoT devices, and thus protect the related users.

Key words: traffic analysis, IoT, intrusion detection, random forest, unbalanced sample

CLC Number: 

No Suggested Reading articles found!