
This dataset contains preprocessed earthquake event and noise waveforms, supplementary to the publication: "Towards Operational Earthquake Seismogram Denoising"by Nikolaj Dahmen, John Clinton, Men-Andrin Meier, Luca ScarabelloBulletin of the Seismological Society of America Waveform data are provided in physical units [m/s], Z12 components and with a sampling rate of 100 Hz. Event signals have been preprocessed according to the methods described in the publication to reduce background noise (see Figure 2). Event signals have 18k samples with P arrival aligned at sample 6000, noise signals have 8k samples. Dataset statistics for event and noise waveforms are summarized in Figures S1 and S2. The dataset is separated into training, validation, and testing dataset, and are here available for publicly accessible networks. Metadata includes information on recording station, earthquake, and identified phase arrivals. A subset of the test dataset with fixed event and noise waveform pairs is provided in the benchmark dataset. Data are provided in HDF5 format with accompanying metadata, compatible with SeisBench. Proposed dataset structure: Event waveforms: Event_data/waveforms.hdf5 Event_data/metadata.csv Noise waveforms: Noise_data/waveforms_noise.hdf5 Noise_data/metadata_noise.csv Importing dataset with SeisBench: import seisbench.data as sbd # load noise dataset noise_ds = sbd.WaveformDataset("/Noise_data/", sampling_rate=100, component_order="Z12") # Training examples: 274,240 # Development examples: 81,416 # Test examples: 40,096 # load event dataset event_ds = sbd.WaveformDataset("/Event_data/", sampling_rate=100, component_order="Z12") # Training examples: 19,022 # Development examples: 5,385 # Test examples: 2,695 Data Sources: Network Source Year Link CH National Seismic Networks of Switzerland (SED, ETH Zurich) 1983 Link 9S Landslide deployments, Switzerland 2012 Link 8D Aftershock deployments, Switzerland 2025 Link G2 GEOBEST geothermal project networks 2006 Link S Seismology at School Program 2008 Link C4 CERN Seismic Network 2016 Link 1I Matterhorn Cryosphere Observatory 2015 Link Z3 AlpArray Temporary Network 2015 Link GU Regional Network NW Italy 1967 Link OX Northeast Italy OGS 2016 Link IV Rete Sismica Nazionale (INGV) 2005 Link 8C Mont-Blanc & Vallorcine Monitoring (RESIF-SISMOB) 2020 Link FR EPOS-France Broad-Band Network (RLBP) 1995 Link RA RESIF-RAP French Accelerometric Network 1995 Link BW BayernNetz, Germany 2001 Link GR German Regional Seismic Network (GRSN) 1976 Link LE Erdbebendienst SW Baden-Württemberg & RP 2009 Link OE Austrian Seismic Network (ZAMG) 1987 Link MN Mediterranean Very Broadband Seismographic Network (MedNet) 1990 Link Dataset for benchmarking (Benchmark.zip), contains event and noise signals part of "test" partition in hdf5 files. Contains event and noise signal combinations as used in performance test in publication, with randomly sampled noise (TestDataSet.zip) and impulsive noise signals (TestDataSet_ImpulsiveNoise.zip). The ReadMe File documents test procedure. Related code
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
