Combinatorial designs for deep learning

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Other literature type 04 May 2020Embargo end date: 01 Jan 2018 English Publisher:WileyJournal:Journal of Combinatorial Designs, volume 28, pages 633-657 (issn: 1063-8539, eissn: 1520-6610,

Copyright policy )

Authors: Shoko Chisaki; Ryoh Fuji‐Hara; Nobuko Miyamoto;

doi: 10.1002/jcd.21720 , 10.48550/arxiv.1809.08404

arXiv: 1809.08404

Combinatorial designs for deep learning

- Summary
- Subjects
- Metrics

Abstract

AbstractDeep learning is a machine learning methodology using a multilayer neural network. Let be mutually disjoint node sets (layers). A multilayer neural network can be regarded as a union of the complete bipartite graphs on consecutive two node sets and for . The edges of a bipartite graph function as weights which are represented as a matrix. The values of th layer are basically computed by multiplication of the weight matrix and values of th layer. Using mass training and teacher data, the weight parameters are estimated little by little. Overfitting (or overlearning) refers to a model that models the “training data” too well. It then becomes difficult for the model to generalize to new data which were not in the training set. The most popular method to avoid overfitting is called dropout. Dropout zeros out a random sample of activations (nodes) during the training process. A random sampling of nodes causes more irregular frequency of dropout edges. There is a similar sampling concept in the area of design of experiments. We propose a combinatorial design that drops out nodes from each layer. This design balances the edge frequencies. We analyze and construct such designs in this paper.

Related Organizations

Keywords

Learning and adaptive systems in artificial intelligence, deep learning, dropout, Combinatorial aspects of block designs, dropout design, split-block design, FOS: Mathematics, Applications of design theory to circuits and networks, Mathematics - Combinatorics, Combinatorics (math.CO), Artificial neural networks and deep learning

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	5
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

5

Top 10%

Average

Green

bronze

Fields of Science (3) View all

Fields of Science