Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Aug 2017Embargo end date: 01 Jan 2023Publisher:International Joint Conferences on Artificial Intelligence OrganizationJournal:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence

Authors: Shen, Zebang; Qian, Hui; Mu, Tongzhou; Zhang, Chao;

doi: 10.24963/ijcai.2017/378 , 10.48550/arxiv.2304.11665

arXiv: 2304.11665

Accelerated Doubly Stochastic Gradient Algorithm for Large-scale Empirical Risk Minimization

- Summary
- Subjects
- Metrics

Abstract

Nowadays, algorithms with fast convergence, small memory footprints, and low per-iteration complexity are particularly favorable for artificial intelligence applications. In this paper, we propose a doubly stochastic algorithm with a novel accelerating multi-momentum technique to solve large scale empirical risk minimization problem for learning tasks. While enjoying a provably superior convergence rate, in each iteration, such algorithm only accesses a mini batch of samples and meanwhile updates a small block of variable coordinates, which substantially reduces the amount of memory reference when both the massive sample size and ultra-high dimensionality are involved. Specifically, to obtain an ε-accurate solution, our algorithm requires only O(log(1/ε)/sqrt(ε)) overall computation for the general convex case and O((n+sqrt{nκ})log(1/ε)) for the strongly convex case. Empirical studies on huge scale datasets are conducted to illustrate the efficiency of our method in practice.

Related Organizations

University of Hannover
Germany
Zhejiang Ocean University
China (People's Republic of)
Zhejiang University
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

4

Average

Green

bronze

Fields of Science (3) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all