
arXiv: 1601.06615
Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative -- that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective. Therefore, it has become important to understand what kind of deep networks are suitable for a given problem. Although general surveys of this fast-moving paradigm (i.e. deep-networks) exist, a survey specific to computer vision is missing. We specifically consider one form of deep networks widely used in computer vision - convolutional neural networks (CNNs). We start with "AlexNet" as our base CNN and then examine the broad variations proposed over time to suit different applications. We hope that our recipe-style survey will serve as a guide, particularly for novice practitioners intending to use deep-learning techniques for computer vision.
Published in Frontiers in Robotics and AI (http://goo.gl/6691Bm)
Robotics and AI, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, deep learning, QA75.5-76.95, supervised learning, Machine Learning (cs.LG), Multimedia (cs.MM), Convolutional Neural Networks (CNN), Electronic computers. Computer science, convolutional neural networks, TJ1-1570, recurrent neural networks, object classification, Mechanical engineering and machinery, Computer Science - Multimedia
Robotics and AI, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, deep learning, QA75.5-76.95, supervised learning, Machine Learning (cs.LG), Multimedia (cs.MM), Convolutional Neural Networks (CNN), Electronic computers. Computer science, convolutional neural networks, TJ1-1570, recurrent neural networks, object classification, Mechanical engineering and machinery, Computer Science - Multimedia
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 161 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 1% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
