
arXiv: 1808.08618
AbstractIn this article, we review computational aspects of deep learning (DL). DL uses network architectures consisting of hierarchical layers of latent variables to construct predictors for high‐dimensional input–output models. Training a DL architecture is computationally intensive, and efficient linear algebra library is the key for training and inference. Stochastic gradient descent (SGD) optimization and batch sampling are used to learn from massive datasets.This article is categorized under:Statistical Learning and Exploratory Methods of the Data Sciences > Deep LearningStatistical Learning and Exploratory Methods of the Data Sciences > Modeling MethodsStatistical Learning and Exploratory Methods of the Data Sciences > Neural Networks
FOS: Computer and information sciences, Computer Science - Machine Learning, deep learning, Machine Learning (stat.ML), Statistics - Computation, Machine Learning (cs.LG), linear algebra, Statistics - Machine Learning, stochastic gradient descent, Computational methods for problems pertaining to statistics, Computation (stat.CO)
FOS: Computer and information sciences, Computer Science - Machine Learning, deep learning, Machine Learning (stat.ML), Statistics - Computation, Machine Learning (cs.LG), linear algebra, Statistics - Machine Learning, stochastic gradient descent, Computational methods for problems pertaining to statistics, Computation (stat.CO)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 11 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
