Exploration and generalization in deep learning with SwitchPath activations

Name: Exploration and generalization in deep learning with SwitchPath activations
Keywords: Deep learning, Neural network algorithms, Generative networks

Di Cecco A.; Papini A.; Metta C.; Fantozzi M.; Galfrè S. G.; Morandin F.; Parton M.

Found an issue? Give us feedback

downloadFull-Text

IRIS Cnrarrow_drop_down

IRIS Cnr

Article . 2025

Full-Text: https://iris.cnr.it/request-item?handle=20.500.14243/552044&bitstreamId=6c7874ca-f68f-43f6-b395-1de88f505d2c

Data sources: IRIS Cnr

Machine Learning

Article . 2025 . Peer-reviewed

License: Springer Nature TDM

Data sources: Crossref

DBLP

Article

Data sources: DBLP

Exploration and generalization in deep learning with SwitchPath activations

descriptionPublicationkeyboard_double_arrow_right Article 05 Aug 2025 Italy English Publisher:Springer Science and Business Media LLCJournal:Machine Learning, volume 114 (issn: 0885-6125, eissn: 1573-0565,

Copyright policy )Funded by:EC | SoBigData-PlusPlus

Authors: Di Cecco A.; Papini A.; Metta C.; Fantozzi M.; Galfrè S. G.; Morandin F.; Parton M.;

doi: 10.1007/s10994-025-06840-y

handle: 20.500.14243/552044

Exploration and generalization in deep learning with SwitchPath activations

- Summary
- Subjects
- Metrics

Abstract

This work provides a comprehensive theoretical and empirical analysis of SwitchPath, a stochastic activation function that improves learning dynamics by probabilistically toggling between a neuron standard activation and its negation. We develop theoretical foundations and demonstrate its impact in multiple scenarios. By maintaining gradient flow and injecting controlled stochasticity, the method improves generalization, uncertainty estimation, and training efficiency. Experiments in classification show consistent gains over ReLU and Leaky ReLU across CNNs and Vision Transformers, with reduced overfitting and better test accuracy. In generative modeling, a novel two-phase training scheme significantly mitigates mode collapse and accelerates convergence. Our theoretical analysis reveals that SwitchPath introduces a form of multiplicative noise that acts as a structural regularizer. Additional empirical investigations show improved information propagation and reduced model complexity. These results establish this activation mechanism as a simple yet effective way to enhance exploration, regularization, and reliability in modern neural networks.

Country

Italy

Related Organizations

National Research Council
Sri Lanka
University of Pisa
Italy
National Research Council
Italy
Chalmers University of Technology
Sweden
University of Chieti-Pescara
Italy

View all View all

Keywords

Deep learning, Neural network algorithms, Generative networks

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Funded by

EC| SoBigData-PlusPlus

Related to Research communities

EGI : advanced computing for research

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now