
Sound Event Classification (SEC) has attracted considerable attention in recent times, with applications in a variety of fields, including environmental acoustics. In particular, there has been a notable increase in the level of interest in outdoor measurements, with a view to distinguishing the contributions of a particular source from the background noise. The utilisation of machine learning tools is contingent upon the availability of substantial datasets for the purposes of training or validation. DataSEC is an open-access dataset specifically designed for Sound Event Classification of environmental noise that can be listened in outdoor environments.The collection consists of 23 hours and 32 minutes of authentic, non-synthesized audio recordings, which have been meticulously gathered from two distinct sources: sound level measurements and online repositories. The authors have conducted a comprehensive analysis of the sound samples, encompassing a diverse range of environments, from urban to rural settings.DataSEC comprises a total of 5,024 mono-channel .wav audio samples, with a sampling rate of 44.1 kHz. Each sample represents a single event that has been classified into one of the following 22 defined sound classes. The utilisation of the symbol "/" serves to denote the subclasses. The identified classes and subclasses are: Bells; Birds; Cat fights and moans; Chicken coop; Cicadas and crickets; Crows seagulls and magpies; Dog barkings and howlings; Glass breaking; Horn; Jet aircrafts; Lawn mower brush cutter and olive shaker; Music; Propeller aircrafts \Airplanes, \Helicopters; Sirens and alarms; Thunder fireworks and gunshot; Train; Vacuum cleaner fan and hairdryer; Vehicle idling \Car-truck idling, \Motorbike idling; Vehicle pass-by \Car pass-by, \Motorbike pass-by, \Truck pass-by; Voices; Wind turbine; Workshop \Air compressor, \Drill, Grinder, \Jackhammer, \Saw.The minimum number of entries required for each class has been set at 50, with a minimum of 30 entries stipulated for each sub-class. A thorough examination of all samples has been conducted by the authors, with all files being meticulously preprocessed to eliminate silences and superfluous parts. This preprocessing involved the removal of irrelevant background activity and overlapping sounds.The authors expect that the dataset will contribute to future research in real-world sound event analysis and automated acoustic evaluation by means of machine learning.
Environmental Acoustics, Sound dataset, Deep-learning in Acoustic, Outdoor Noise Assessment
Environmental Acoustics, Sound dataset, Deep-learning in Acoustic, Outdoor Noise Assessment
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 1 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
