
NOTE: Our work is under review, and this dataset is released for open science purposes. 💾 data/ # Datasets (AWS, AZURE, GCP)├── 1_raw_data/ # Original incident reports├── 2_clean_data/ # Processed clean data├── 3_sample_data/ # Sampled data by K-means clustering├── 4_label_data/ # Annotated data for evaluation└── data_process.py # Data process, clean, and sample ID Name Period #Rows #Labeled Avg.Words 1 AWS 2016-2022 774 150(19%) 151 2 AZURE 2019-2024 127 95(75%) 575 3 GCP 2016-2021 2,186 215(10%) 533 TOTAL TOTAL 2016-2024 3,087 460(15%) -
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
