Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article 22 Jul 2011 English Publisher:Springer Science and Business Media LLCJournal:Neural Processing Letters, volume 34, pages 221-228 (issn: 1370-4621, eissn: 1573-773X,

Copyright policy )

Authors: Hongmei Shao; Dongpo Xu; Gaofeng Zheng;

doi: 10.1007/s11063-011-9193-x

Convergence of a Batch Gradient Algorithm with Adaptive Momentum for Neural Networks

- Summary
- Metrics

Abstract

In this paper, a batch gradient algorithm with adaptive momentum is considered and a convergence theorem is presented when it is used for two-layer feedforward neural networks training. Simple but necessary sufficient conditions are offered to guarantee both weak and strong convergence. Compared with existing general requirements, we do not restrict the error function to be quadratic or uniformly convex. A numerical example is supplied to illustrate the performance of the algorithm and support our theoretical finding.

Related Organizations

Rakuten (Japan)
Japan
China University of Petroleum, Beijing
China (People's Republic of)
Harbin Engineering University
China (People's Republic of)
China University of Petroleum, East China
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average