
The GECAD Network Intrusion Scenarios (GeNIS) dataset contains multiple sequential attack scenarios and different types of realistic normal network activity, recorded during advanced network simulations on the Airbus CyberRange platform. The raw network packets were analyzed to generate labelled network flows, with the computation of statistical features to represent the traffic patterns of local and remote attackers, normal users and administrators, and background traffic of an enterprise computer network. GeNIS follows a modular design, providing raw packet capture next generation (PCAPNG) files with over 37 million packets of each intermediate attack step to enable an in-depth analysis with different flow exporters, feature extraction, and feature selection tools, as well as filtered CSV files with over 2.8 million flows created with 5, 10, 30, and 60 second flow intervals. The flows were preprocessed to provide a reliable benchmark dataset with the most relevant features for the training, validation, and testing of robust machine learning and deep learning models.If you use this dataset, please cite the primary data article: https://doi.org/10.1016/j.dib.2025.111487
This work was partially supported by the “Cybers SeC IP” project (NORTE-01-0145-FEDER-000044) and “AIDA” project (EDF-2023-DA-CYBER-DAAI, grant no. 101168202), through the European Regional Development Fund (ERDF) and European Defence Fund (EDF). This work has also received funding from UIDB/00760/2020.
Cybersecurity, Packet capture, Machine learning, Attack classification, Anomaly detection, Network flow
Cybersecurity, Packet capture, Machine learning, Attack classification, Anomaly detection, Network flow
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
