Convolutional Neural Networks With Dynamic Regularization

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 May 2021Embargo end date: 01 Jan 2019 Singapore Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Neural Networks and Learning Systems, volume 32, pages 2,299-2,304 (issn: 2162-237X, eissn: 2162-2388,

Copyright policy )

Authors: Yi Wang 0068; Zhen-Peng Bian; Junhui Hou; Lap-Pui Chau;

doi: 10.1109/tnnls.2020.2997044 , 10.48550/arxiv.1909.11862

pmid: 32511095

arXiv: 1909.11862

Convolutional Neural Networks With Dynamic Regularization

- Summary
- Subjects
- Related research
  (9)
- Metrics

Abstract

Regularization is commonly used for alleviating overfitting in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the improvement in the generalization performance. However, these methods lack a self-adaptive ability throughout training. That is, the regularization strength is fixed to a predefined schedule, and manual adjustments are required to adapt to various network architectures. In this paper, we propose a dynamic regularization method for CNNs. Specifically, we model the regularization strength as a function of the training loss. According to the change of the training loss, our method can dynamically adjust the regularization strength in the training procedure, thereby balancing the underfitting and overfitting of CNNs. With dynamic regularization, a large-scale model is automatically regularized by the strong perturbation, and vice versa. Experimental results show that the proposed method can improve the generalization capability on off-the-shelf network architectures and outperform state-of-the-art regularization methods.

7 pages. Accepted for Publication at IEEE TNNLS

Country

Singapore

Related Organizations

City University of Hong Kong
China (People's Republic of)
Nanyang Technological University
Nanyang Technological University
Nanyang Technological University
Singapore
Nanyang Technological University

View all View all

Keywords

FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), :Electrical and electronic engineering [Engineering], Generalization, Computer Science - Computer Vision and Pattern Recognition, Convolutional Neural Network

9 Research products, page 1 of 1

International Cooperation with Higher Vocational Qualifications: an Example
1996IsAmongTopNSimilarDocuments
Fundamental Bounds for Stabilizability of Continuous-Time Systems under Stochastic Multiplicative Uncertainty and Delay**This research was supported in part by the Hong Kong RGC under Projects CityU 111810, CityU 111511, in part by the City University of Hong Kong under Project 9380054
2016IsAmongTopNSimilarDocuments
Evaluation of the dielectric function of colloidal Cd1−xHgxTe quantum dot films by spectroscopic ellipsometry
2017IsAmongTopNSimilarDocuments
Deconstructing written genres in Undergraduate Biology
2013IsAmongTopNSimilarDocuments
Visibility degradation across Hong Kong: its components and their relative contributions
2001IsAmongTopNSimilarDocuments
G-quadruplex RNA motifs influence gene expression in the malaria parasite Plasmodium falciparum
2021IsAmongTopNSimilarDocuments
Automatic Recognition and Analysis of Balance Activity in Community-Dwelling Older Adults: Algorithm Validation
2021IsAmongTopNSimilarDocuments
Supporting Flipped and Gamified Learning With Augmented Reality in Higher Education
2021IsAmongTopNSimilarDocuments
Translational control by DHX36 binding to 5′UTR G-quadruplex is essential for muscle stem-cell regenerative functions
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	20
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

20

Top 10%

Green

bronze

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering