WeNet: Weighted Networks for Recurrent Network Architecture Search

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2019Embargo end date: 01 Jan 2019Publisher:arXivJournal:CoRR, volume abs/1904.03819

Authors: Zhiheng Huang; Bing Xiang;

doi: 10.48550/arxiv.1904.03819

arXiv: 1904.03819

WeNet: Weighted Networks for Recurrent Network Architecture Search

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

In recent years, there has been increasing demand for automatic architecture search in deep learning. Numerous approaches have been proposed and led to state-of-the-art results in various applications, including image classification and language modeling. In this paper, we propose a novel way of architecture search by means of weighted networks (WeNet), which consist of a number of networks, with each assigned a weight. These weights are updated with back-propagation to reflect the importance of different networks. Such weighted networks bear similarity to mixture of experts. We conduct experiments on Penn Treebank and WikiText-2. We show that the proposed WeNet can find recurrent architectures which result in state-of-the-art performance.

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Neural and Evolutionary Computing, Neural and Evolutionary Computing (cs.NE), Machine Learning (cs.LG)

4 Research products, page 1 of 1

Book Reviews: The Inner Work of Leaders Leadership as a Habit of Mind Barbara Mackoff and Gary Wenet New York, NY: Amacom, 2000 226 pp., $24.95 Hardcover
2001IsAmongTopNSimilarDocuments
Sur les peuples de nom «vénète» ou assimilé dans l’Occident européen
2003IsAmongTopNSimilarDocuments
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
2022IsAmongTopNSimilarDocuments
WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit
2021IsAmongTopNSimilarDocuments

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average