Detail publikace

Comparison of Multiple Feature Selection Techniques for Machine Learning-Based Detection of IoT Attacks

PHAN, V. JEŘÁBEK, J. MALINA, L.

Originální název

Comparison of Multiple Feature Selection Techniques for Machine Learning-Based Detection of IoT Attacks

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

The Internet of Things (IoT) has become increasingly practical in applications such as smart homes, autonomous vehicles, and environmental monitoring. However, this rapid expansion has led to significant cybersecurity threats. Detecting these threats is critical, and while machine learning techniques are valuable, they struggle with high-dimensional data. Feature selection helps by reducing computational costs while maintaining model generalization. Selecting the most effective feature selection method is a crucial task. This research addresses this gap by testing five feature selection methods: Random Forest (RF), Recursive Feature Elimination (RFE), Logistic Regression (LR), XGBoost Regression (XGBoost), and Information Gain (IG) using the CIC-IoT 2023 dataset. It evaluates these methods when being used with five machine learning models: Decision Tree (DT), Random Forest (RF), k-Nearest Neighbors (k-NN), Gradient Boosting (GB), and Multi-layer Perceptron (MLP) using metrics like accuracy, precision, recall, and F1-score across three datasets. The results show that RFE, especially with the RF model, achieves the highest accuracy (99.57%) with 30 features. RF is the most stable, with accuracy from 83% to 99.56%. Additionally, the 5-feature scheme is best for implementing IDS on resource-limited IoT devices, with RFE paired with the k-NN model being the optimal combination.

Klíčová slova

IoT; Anomaly Detection; IDS; Machine Learning; Feature Selection

Autoři

PHAN, V.; JEŘÁBEK, J.; MALINA, L.

Vydáno

30. 7. 2024

Nakladatel

Association for Computing Machinery

Místo

New York, NY, USA

ISBN

979-8-4007-1718-5

Kniha

ARES '24: Proceedings of the 19th International Conference on Availability, Reliability and Security

Strany od

1

Strany do

10

Strany počet

10

URL

BibTex

@inproceedings{BUT189196,
  author="Viet Anh {Phan} and Jan {Jeřábek} and Lukáš {Malina}",
  title="Comparison of Multiple Feature Selection Techniques for Machine Learning-Based Detection of IoT Attacks",
  booktitle="ARES '24: Proceedings of the 19th International Conference on Availability, Reliability and Security",
  year="2024",
  pages="1--10",
  publisher="Association for Computing Machinery",
  address="New York, NY, USA",
  doi="10.1145/3664476.3670440",
  isbn="979-8-4007-1718-5",
  url="https://dl.acm.org/doi/10.1145/3664476.3670440"
}