
arXiv: 1809.08404
AbstractDeep learning is a machine learning methodology using a multilayer neural network. Let be mutually disjoint node sets (layers). A multilayer neural network can be regarded as a union of the complete bipartite graphs on consecutive two node sets and for . The edges of a bipartite graph function as weights which are represented as a matrix. The values of th layer are basically computed by multiplication of the weight matrix and values of th layer. Using mass training and teacher data, the weight parameters are estimated little by little. Overfitting (or overlearning) refers to a model that models the “training data” too well. It then becomes difficult for the model to generalize to new data which were not in the training set. The most popular method to avoid overfitting is called dropout. Dropout zeros out a random sample of activations (nodes) during the training process. A random sampling of nodes causes more irregular frequency of dropout edges. There is a similar sampling concept in the area of design of experiments. We propose a combinatorial design that drops out nodes from each layer. This design balances the edge frequencies. We analyze and construct such designs in this paper.
Learning and adaptive systems in artificial intelligence, deep learning, dropout, Combinatorial aspects of block designs, dropout design, split-block design, FOS: Mathematics, Applications of design theory to circuits and networks, Mathematics - Combinatorics, Combinatorics (math.CO), Artificial neural networks and deep learning
Learning and adaptive systems in artificial intelligence, deep learning, dropout, Combinatorial aspects of block designs, dropout design, split-block design, FOS: Mathematics, Applications of design theory to circuits and networks, Mathematics - Combinatorics, Combinatorics (math.CO), Artificial neural networks and deep learning
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 5 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
