Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 13 Dec 2022Publisher:ACMJournal:Proceedings of the 4th ACM International Conference on Multimedia in Asia

Authors: Lam Pham; Khoa Tran; Dat Ngo; Hieu Tang; Son Phan; Alexander Schindler;

doi: 10.1145/3551626.3564962

Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

- Summary
- Metrics

Abstract

In this paper, we present a robust and low complexity model for Acoustic Scene Classification (ASC), the task of identifying the scene of an audio recording. We firstly construct an ASC model in which a novel inception-residual-based network architecture is proposed to deal with the issue of mismatched recording devices. To further improve the model performance but still satisfy the low footprint, we apply two techniques of ensemble of multiple spectrograms and model compression to the proposed ASC model. By conducting extensive experiments on the benchmark DCASE 2020 Task 1A Development dataset, we achieve the best model performing an accuracy of 71.3% and a low complexity of 0.5 Million (M) trainable parameters, which is very competitive to the state-of-the-art systems and potential for real-life applications on edge devices.

Related Organizations

FPT University
Viet Nam
University of Essex
United Kingdom
Austrian Institute of Technology
Austria
University of Da Nang
Viet Nam

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	6
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

6

Top 10%

Green

hybrid