Adapting a ConvNeXt model to audio classification on AudioSet (pretrained models)

Name: Adapting a ConvNeXt model to audio classification on AudioSet (pretrained models)
Keywords: AudioSet, ConvNeXt, Audio tagging

appsOther research productkeyboard_double_arrow_right Other ORP type 09 Jun 2023Publisher:Zenodo

Authors: Pellegrini, Thomas; Khalfaoui-Hassani, Ismail; Labbé, Etienne; Masquelier, Timothée;

doi: 10.5281/zenodo.8020843 , 10.5281/zenodo.8020842

Adapting a ConvNeXt model to audio classification on AudioSet (pretrained models)

- Summary
- Subjects
- Metrics

Abstract

This deposit contains models checkpoints of our paper: Pellegrini, T., Khalfaoui-Hassani, I., Labbé, E., & Masquelier, T. (2023). Adapting a ConvNeXt model to audio classification on AudioSet. arXiv preprint arXiv:2306.00830 Please check our code: https://github.com/topel/audioset-convnext-inf Two checkpoints are provided, both a ConvNeXt-Tiny architecture adapted to AudioSet tagging: convnext_tiny_471mAP.pth --> trained on AudioSet unbalanced and balanced subsets. Training set size: 1921982 files --> mAP=0.471 on the test subset convnext_tiny_465mAP_BL_AC_70kit.pth --> the same but we removed the files from the AudioCaps dataset, from the AudioSet training set. AudioCaps is an audio captioning dataset, comprised of 57188 files coming from AudioSet. To avoid using a biased audio encoder, this checkpoint may be useful in audio-text retrieval and audio captioning experiments on AudioCaps. BL_AC : Black list of AudioCaps files.

{"references": ["Kim, C. D., Kim, B., Lee, H., & Kim, G. (2019, June). AudioCaps: Generating captions for audios in the wild."]}

Keywords

AudioSet, ConvNeXt, Audio tagging

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Usage byUsageCounts

visibility	views	42
download	downloads	68

42
views
68
downloads
Powered by

Found an issue? Give us feedback

visibility

download

Average