Video Scene Parsing with Predictive Feature Learning

Preprint English OPEN
Jin, Xiaojie; Li, Xin; Xiao, Huaxin; Shen, Xiaohui; Lin, Zhe; Yang, Jimei; Chen, Yunpeng; Dong, Jian; Liu, Luoqi; Jie, Zequn; Feng, Jiashi; Yan, Shuicheng;
(2016)
  • Subject: Computer Science - Computer Vision and Pattern Recognition
    acm: TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES | ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION

In this work, we address the challenging video scene parsing problem by developing effective representation learning methods given limited parsing annotations. In particular, we contribute two novel methods that constitute a unified parsing framework. (1) \textbf{Predic... View more
  • References (44)
    44 references, page 1 of 5

    [1] G. J. Brostow, J. Shotton, J. Fauqueur, and R. Cipolla. Segmentation and recognition using structure from motion point clouds. In ECCV. 2008. 6

    [2] L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. CoRR, abs/1606.00915, 2016. 5, 8, 9

    [3] L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille. Semantic image segmentation with deep convolutional nets and fully connected crfs. In ICLR, 2015. 1, 2, 6

    [4] M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth, and B. Schiele. The cityscapes dataset for semantic urban scene understanding. arXiv preprint arXiv:1604.01685, 2016. 1, 6

    [5] C. Farabet, C. Couprie, L. Najman, and Y. LeCun. Learning hierarchical features for scene labeling. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 35(8):1915-1929, 2013. 1, 2, 7

    [6] G. Floros and B. Leibe. Joint 2d-3d temporally consistent semantic segmentation of street scenes. In CVPR, pages 2823-2830. IEEE, 2012. 2

    [7] G. Ghiasi and C. C. Fowlkes. Laplacian reconstruction and refinement for semantic segmentation. CoRR, abs/1605.02264, 2016. 8

    [8] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio. Generative adversarial nets. In NIPS, pages 2672- 2680, 2014. 3, 4

    [9] B. L. . X. H. . S. Gould. Multi-class semantic video segmentation with exemplar-based object reasoning. In WACV, 2016. 2, 3

    [10] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385, 2015. 2, 6

  • Metrics
Share - Bookmark