The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition

descriptionPublicationkeyboard_double_arrow_right Article 01 Feb 2013Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Audio, Speech, and Language Processing, volume 21, pages 388-396 (issn: 1558-7916, eissn: 1558-7924,

Copyright policy )

Authors: Dong Yu 0001; Li Deng 0001; Frank Seide;

doi: 10.1109/tasl.2012.2227738

The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition

- Summary
- Related research
  (11)
- Metrics

Abstract

The recently proposed context-dependent deep neural network hidden Markov models (CD-DNN-HMMs) have been proved highly promising for large vocabulary speech recognition. In this paper, we develop a more advanced type of DNN, which we call the deep tensor neural network (DTNN). The DTNN extends the conventional DNN by replacing one or more of its layers with a double-projection (DP) layer, in which each input vector is projected into two nonlinear subspaces, and a tensor layer, in which two subspace projections interact with each other and jointly predict the next layer in the deep architecture. In addition, we describe an approach to map the tensor layers to the conventional sigmoid layers so that the former can be treated and trained in a similar way to the latter. With this mapping we can consider a DTNN as the DNN augmented with DP layers so that not only the BP learning algorithm of DTNNs can be cleanly derived but also new types of DTNNs can be more easily developed. Evaluation on Switchboard tasks indicates that DTNNs can outperform the already high-performing DNNs with 4-5% and 3% relative word error reduction, respectively, using 30-hr and 309-hr training sets.

Related Organizations

Microsoft Research Asia (China)
China (People's Republic of)
Microsoft (United States)
United States
Microsoft Research (India)
India

11 Research products, page 1 of 2

Predicting Molecular Energy Using Force-Field Optimized Geometries and Atomic Vector Representations Learned from an Improved Deep Tensor Neural Network
2019IsAmongTopNSimilarDocuments
Cross-Domain State-of-Charge Estimation of Li-Ion Batteries Based on Deep Transfer Neural Network With Multiscale Distribution Adaptation
2021IsAmongTopNSimilarDocuments
The classification capability of a dynamic threshold neural network
1994IsAmongTopNSimilarDocuments
Accurate Many-Body Repulsive Potentials for Density-Functional Tight Binding from Deep Tensor Neural Networks
2020IsAmongTopNSimilarDocuments
Deep Learning Spectroscopy: Neural Networks for Molecular Excitation Spectra
2019IsAmongTopNSimilarDocuments
Deep tree neural network for multiple‐time‐step prediction of short‐term speed and confidence estimation
2021IsAmongTopNSimilarDocuments
Large vocabulary speech recognition using deep tensor neural networks
2012IsAmongTopNSimilarDocuments
Tiny Neural Networks for Environmental Predictions: An Integrated Approach with Miosix
2020IsAmongTopNSimilarDocuments
Deep transfer learning for classification of time-delayed Gaussian networks
2015IsAmongTopNSimilarDocuments
DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications
2021IsAmongTopNSimilarDocuments

chevron_left
1
2
chevron_right

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	81
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%