A learning algorithm with a gradient normalization and a learning rate adaptation for the mini-batch type learning

Daiki Ito; Takashi Okamoto; Seiichi Koakutsu

Found an issue? Give us feedback

https://doi.org/10.2...arrow_drop_down

https://doi.org/10.23919/sice....

Article . 2017 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.23919/si...

Article

Data sources: Microsoft Academic Graph

A learning algorithm with a gradient normalization and a learning rate adaptation for the mini-batch type learning

descriptionPublicationkeyboard_double_arrow_right Article 01 Sep 2017Publisher:IEEEJournal:2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE)

Authors: Daiki Ito; Takashi Okamoto; Seiichi Koakutsu;

doi: 10.23919/sice.2017.8105654

A learning algorithm with a gradient normalization and a learning rate adaptation for the mini-batch type learning

- Summary
- Metrics

Abstract

The development of a high-performance optimization algorithm to solve the learning problem of the neural networks is strongly demanded with the advance of the deep learning. The learning algorithms with gradient normalization mechanisms have been investigated, and their effectiveness has been shown. In the learning algorithms, the adaptation of the learning rate is very important issue. The learning algorithms of the neural networks are classified into the batch learning and the mini-batch learning. In the learning with vast training data, the mini-batch type learning is often used due to the limitation of memory size and the computational cost. The mini-batch type learning algorithms with gradient normalization mechanisms have been investigated. However, the adaptation of the learning rate in the mini-batch type learning algorithm with the gradient normalization has not been investigated well. This study proposes to introduce a new learning rate adaptation mechanism based on sign variation of gradient to a mini-batch type learning algorithm with the gradient normalization. The effectiveness of the proposed algorithm is verified through applications to a learning problem of the multi-layered neural networks and a learning problem of the convolutional neural networks.

Related Organizations

Chiba University
Japan

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average