Information extraction and user intention identification are central topics in modern query understanding and recommendation systems. In this paper, we propose DeepProbe, a generic information-directed interaction framework which is built around an attention-based seque... View more
 J. Aron. How innovative is apple's new voice assistant, siri? New Scientist, 212(2836):24, 2011.
 T. M. Cover and J. A. Œomas. Elements of information theory. John Wiley & Sons, 2012.
 J. Duchi, E. Hazan, and Y. Singer. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research, 12(Jul):2121-2159, 2011.
 J. C. Hoch and A. S. Stern. Maximum entropy reconstruction. eMagRes, 1996.
 S. Khudanpur and J. Wu. A maximum entropy language model integrating n-grams and topic dependencies for conversational speech recognition. In Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on, volume 1, pages 553-556. IEEE, 1999.
 M.-T. Luong, H. Pham, and C. D. Manning. E‚ective approaches to aŠentionbased neural machine translation. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pages 1412-1421, Lisbon, Portugal, September 2015. Association for Computational Linguistics.
 M.-T. Luong, I. Sutskever, Q. V. Le, O. Vinyals, and W. Zaremba. Addressing the rare word problem in neural machine translation. arXiv preprint arXiv:1410.8206, 2014.
 V. Mnih, N. Heess, A. Graves, et al. Recurrent models of visual aŠention. In Advances in Neural Information Processing Systems, pages 2204-2212, 2014.
 J. Peters. Speech recognition system, training arrangement and method of calculating iteration values for free parameters of a maximum-entropy speech model, Mar. 7 2006. US Patent 7,010,486.