ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

Name: ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders
Keywords: FOS: Computer and information sciences, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Distributed, Parallel, and Cluster Computing (cs.DC), Neural and Evolutionary Computing (cs.NE), Machine Learning (cs.LG)

Carreira-Perpi����n, Miguel ��.; Alizadeh, Mehdi

Found an issue? Give us feedback

https://dx.doi.org/1...arrow_drop_down

https://dx.doi.org/10.48550/ar...

Article . 2016

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2016Embargo end date: 01 Jan 2016Publisher:arXiv

Authors: Carreira-Perpi��n, Miguel ��.; Alizadeh, Mehdi;

doi: 10.48550/arxiv.1605.09114

ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

- Summary
- Subjects
- Related research
  (7)
- Metrics

Abstract

Many powerful machine learning models are based on the composition of multiple processing layers, such as deep nets, which gives rise to nonconvex objective functions. A general, recent approach to optimise such "nested" functions is the method of auxiliary coordinates (MAC). MAC introduces an auxiliary coordinate for each data point in order to decouple the nested model into independent submodels. This decomposes the optimisation into steps that alternate between training single layers and updating the coordinates. It has the advantage that it reuses existing single-layer algorithms, introduces parallelism, and does not need to use chain-rule gradients, so it works with nondifferentiable layers. With large-scale problems, or when distributing the computation is necessary for faster training, the dataset may not fit in a single machine. It is then essential to limit the amount of communication between machines so it does not obliterate the benefit of parallelism. We describe a general way to achieve this, ParMAC. ParMAC works on a cluster of processing machines with a circular topology and alternates two steps until convergence: one step trains the submodels in parallel using stochastic updates, and the other trains the coordinates in parallel. Only submodel parameters, no data or coordinates, are ever communicated between machines. ParMAC exhibits high parallelism, low communication overhead, and facilitates data shuffling, load balancing, fault tolerance and streaming data processing. We study the convergence of ParMAC and propose a theoretical model of its runtime and parallel speedup. We develop ParMAC to learn binary autoencoders for fast, approximate image retrieval. We implement it in MPI in a distributed system and demonstrate nearly perfect speedups in a 128-processor cluster with a training set of 100 million high-dimensional points.

40 pages, 13 figures. The abstract appearing here is slightly shorter than the one in the PDF file because of the arXiv's limitation of the abstract field to 1920 characters

Keywords

FOS: Computer and information sciences, Optimization and Control (math.OC), FOS: Mathematics, Machine Learning (stat.ML), Distributed, Parallel, and Cluster Computing (cs.DC), Neural and Evolutionary Computing (cs.NE), Machine Learning (cs.LG)

7 Research products, page 1 of 1

Shared-memory parallel programming in C++
1990IsAmongTopNSimilarDocuments
MPI: Past, Present and Future
2007IsAmongTopNSimilarDocuments
Xmipp: An Image Processing Package for Electron Microscopy
1996IsAmongTopNSimilarDocuments
STAVOVI PREMA OSOBAMA ISTOSPOLNE SEKSUALNE ORIJENTACIJE U SEKTORU ZDRAVSTVA I POLICIJE
2016IsAmongTopNSimilarDocuments
ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders
2016IsAmongTopNSimilarDocuments
Porting SPLASH-2 Benchmarks to the T3E
2001IsAmongTopNSimilarDocuments
Characterization of Changes in Particle Size Distribution by the PaRMAC Evaluation Method
1999IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now

ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

7 Research products, page 1 of 1

Shared-memory parallel programming in C++

MPI: Past, Present and Future

Xmipp: An Image Processing Package for Electron Microscopy

STAVOVI PREMA OSOBAMA ISTOSPOLNE SEKSUALNE ORIJENTACIJE U SEKTORU ZDRAVSTVA I POLICIJE

ParMAC: distributed optimisation of nested functions, with application to learning binary autoencoders

Porting SPLASH-2 Benchmarks to the T3E

Characterization of Changes in Particle Size Distribution by the PaRMAC Evaluation Method