publication . Preprint . Conference object . 2014

Permutohedral Lattice CNNs

Kiefel, M.; Jampani, V.; Gehler, P.;
Open Access English
  • Published: 20 Dec 2014
Abstract
This paper presents a convolutional layer that is able to process sparse input features. As an example, for image recognition problems this allows an efficient filtering of signals that do not lie on a dense grid (like pixel position), but of more general features (such as color values). The presented algorithm makes use of the permutohedral lattice data structure. The permutohedral lattice was introduced to efficiently implement a bilateral filter, a commonly used image processing operation. Its use allows for a generalization of the convolution type found in current (spatial) convolutional network architectures.
Persistent Identifiers
Subjects
ACM Computing Classification System: ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
free text keywords: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Learning, Computer Science - Neural and Evolutionary Computing

Adams, Andrew, Gelfand, Natasha, Dolson, Jennifer, and Levoy, Marc. Gaussian kd-trees for fast high-dimensional filtering. In ACM SIGGRAPH 2009 Papers, SIGGRAPH '09, pp. 21:1-21:12, New York, NY, USA, 2009.

Adams, Andrew, Baek, Jongmin, and Davis, Myers Abraham. Fast high-dimensional filtering using the permutohedral lattice. Comput. Graph. Forum, 29(2):753-762, 2010.

Arbelez, Pablo, Maire, Michael, Fowlkes, Charless, and Malik, Jitendra. Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell., 33(5):898-916, May 2011.

Aurich, Volker and Weule, Jo¨ rg. Non-linear Gaussian filters performing edge preserving diffusion. In Mustererkennung 1995, 17. DAGM-Symposium, Bielefeld, 13.-15. September 1995, Proceedings, pp. 538-545, 1995.

Jia, Yangqing, Shelhamer, Evan, Donahue, Jeff, Karayev, Sergey, Long, Jonathan, Girshick, Ross, Guadarrama, Sergio, and Darrell, Trevor. Caffe: Convolutional architecture for fast feature embedding. arXiv preprint arXiv:1408.5093, 2014. [OpenAIRE]

LeCun, Yann, Bottou, Le´on, Bengio, Yoshua, and Haffner, Patrick. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, November 1998.

Mallat, Ste´phane. Group invariant scattering. Communications in Pure and Applied Mathematics, 65(10):1331-1398, 2012.

Paris, Sylvain and Durand, Fre´do. A fast approximation of the bilateral filter using a signal processing approach. International Journal of Compututer Vision, 81(1):24-52, January 2009.

Smith, Stephen M. and Brady, J. Michael. SUSAN - a new approach to low level image processing. Int. J. Comput. Vision, 23(1):45-78, May 1997. ISSN 0920-5691.

Tomasi, Carlo and Roberto, Manduchi. Bilateral filtering for gray and color images. In Proceedings of the Sixth International Conference on Computer Vision, ICCV '98, pp. 839-846, Washington, DC, USA, 1998. IEEE Computer Society.

Any information missing or wrong?Report an Issue