Deep Neural Network Hyperparameter Optimization with Orthogonal Array Tuning

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Preprint , Article , Conference object 01 Jan 2019Embargo end date: 01 Jan 2019 English Publisher:Springer International Publishing

Authors: Xiang Zhang 0012; Xiaocong Chen; Lina Yao 0001; Chang Ge 0004; Manqing Dong;

doi: 10.1007/978-3-030-36808-1_31 , 10.13140/rg.2.2.16378.44481 , 10.48550/arxiv.1907.13359

arXiv: 1907.13359

Deep Neural Network Hyperparameter Optimization with Orthogonal Array Tuning

- Summary
- Subjects
- Metrics

Abstract

Deep learning algorithms have achieved excellent performance lately in a wide range of fields (e.g., computer version). However, a severe challenge faced by deep learning is the high dependency on hyper-parameters. The algorithm results may fluctuate dramatically under the different configuration of hyper-parameters. Addressing the above issue, this paper presents an efficient Orthogonal Array Tuning Method (OATM) for deep learning hyper-parameter tuning. We describe the OATM approach in five detailed steps and elaborate on it using two widely used deep neural network structures (Recurrent Neural Networks and Convolutional Neural Networks). The proposed method is compared to the state-of-the-art hyper-parameter tuning methods including manually (e.g., grid search and random search) and automatically (e.g., Bayesian Optimization) ones. The experiment results state that OATM can significantly save the tuning time compared to the state-of-the-art methods while preserving the satisfying performance. The codes are open in GitHub (https://github.com/xiangzhang1015/OATM)

Related Organizations

UNSW Sydney
Australia

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	57
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

57

Top 1%

Top 10%

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering