Optimizing Attack Detection for High Dimensionality and Imbalanced Data with SMOTE, Chi-Square and Random Forest Classifier

The IJICS (International Journal of Informatics and Computer Science)(2022)

Cited 0|Views1
No score
Abstract
The rapid growth of the network generates a very large and varied amount of traffic which has an impact on data and information security. This study resolves two common problems in attack detection, namely high dimensionality and high-class imbalance of the network traffic. This study used the ISCX CICIDS-2017 dataset. This study used the ISCX CICIDS-2017 dataset. The CICIDS-2017 dataset is imbalance that contains very diverse types of traffic including normal traffic and several types of attacks (multi-class). This study proposes a combination of the Chi-Square feature selection technique with the Tree-Based Classifier Random Forest. In the experiment first the Chi-Square Correlation Based feature selection technique was applied to the imbalance dataset. The selected features are then validated using several Random Forest algorithms. The test was also performed comparisons with other classification algorithms such as Naïve Bayes, Bayes Network, J48, REPTree, and Adaboost. This study also examines the implementation of SMOTE to overcome the problem of high calass imbalance. The test results also show that the proposed ensemble method has a very good performance from the Accuracy, TPR, FPR, Precision, F-Measure, and ROC values
More
Translated text
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined