2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Paper Detail

Paper IDCHLG-2.1
Paper Title HYBRID MODEL FOR NETWORK ANOMALY DETECTION WITH GRADIENT BOOSTING DECISION TREES AND TABTRANSFORMER
Authors Xinyue Xu, Australian National University, Australia; Xiaolu Zheng, Beihang University, China
SessionCHLG-2: ZYELL - NCTUNetwork Anomaly Detection Challenge
LocationZoom
Session Time:Monday, 07 June, 13:00 - 14:45
Presentation Time:Monday, 07 June, 13:00 - 14:45
Presentation Poster
Topic Grand Challenge: ZYELL - NCTUNetwork Anomaly Detection Challenge
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Virtual Presentation  Click here to watch in the Virtual Conference
Abstract Last decades have witnessed the monumental growth of Internet applications, and the network security fields have attracted increasing interests as it has become a necessity due to proliferation of information technologies in everyday life. Network anomaly detection aims to detect the potential anomalous behavior in traffic data, which is critical for detecting and reacting to the presence of attacks in the network. In this paper, we present our solution for the ICASSP 2021 Network Anomaly Detection Challenge (NAD) challenge. Firstly, we present our feature extraction method in detecting network anomalies, as efficiently extracting the discriminate features from the traffic data over time can be greatly helpful to improve the anomaly detection performance. Secondly, both gradient boosting decision trees and TabTransformer-Based classification model are trained for the multi-class classification task. The TabTransformer is based on self-attention and can transform the categorical feature' embeddings into robust contextual embeddings, thus, can achieve higher prediction accuracy. Finally, we ensemble the results to improve the performance further. To demonstrate the effectiveness of our approach, extensive experiments have been conducted on the NAD datasets. Our approach achieves the score of 0.625 and ranked as $2nd$ place in the final leaderboard.