RGB-D datasets using microsoft kinect or similar sensors: a survey

Article English OPEN
Cai, Ziyun ; Han, Jungong ; Liu, Li ; Shao, Ling (2017)
  • Publisher: Springer Nature
  • Journal: Multimedia Tools and Applications
  • Related identifiers: doi: 10.1007/s11042-016-3374-6
  • Subject: G400 | Media Technology | Software | Computer Networks and Communications | Hardware and Architecture
    acm: ComputingMethodologies_COMPUTERGRAPHICS | ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION

RGB-D data has turned out to be a very useful representation of an indoor scene for solving fundamental computer vision problems. It takes the advantages of the color image that provides appearance information of an object and also the depth image that is immune to the variations in color, illumination, rotation angle and scale. With the invention of the low-cost Microsoft Kinect sensor, which was initially used for gaming and later became a popular device for computer vision, high quality RGB-D data can be acquired easily. In recent years, more and more RGB-D image/video datasets dedicated to various applications have become available, which are of great importance to benchmark the state-of-the-art. In this paper, we systematically survey popular RGB-D datasets for different applications including object recognition, scene classification, hand gesture recognition, 3D-simultaneous localization and mapping, and pose estimation. We provide the insights into the characteristics of each important dataset, and compare the popularity and the difficulty of those datasets. Overall, the main goal of this survey is to give a comprehensive description about the available RGB-D datasets and thus to guide researchers in the selection of suitable datasets for evaluating their algorithms.
  • References (110)
    110 references, page 1 of 11

    lccu .45 from g o 0 e in t ∼ e ll d v % eg % 5 % iv % a % -n %

    t om 15 tra 56 .03 13 ac 70 s 6 n 3

    m 3 o 3 m t

    a 1 u ti m t

    a u ti 9 n 3 8

    5 6 7 N c 1 6 2

    e jtsecb tireego titiicv leeop leeop o a a p p 5 c 0 7 7 3 6 1 1 2

    th ,th tree ,th tree th th ,th th ,th tree taa litisaedo ,lreoodp ,lreoodp lrecceom ,lreoodp ,lteenko lrecceom ,lredoop ,lredoop ,lreoopd lteekon ,lreoopd ,lredoop lrcceeom D m C C a C s a C C C s C C a 1. Abdallah D, Charpillet F (2015) Pose estimation for a partially observable human body from rgb-d

    cameras. In: International Conference on Intelligent Robots and Systems, p 8 2. Aldoma A, Tombari F, Di Stefano L, Vincze M (2012) A global hypotheses verification method for 3d

    object recognition. In: European Conference on Computer Vision, pp 511-524 3. Aggarwal JK, Cai Q (1997) Human motion analysis: A review. In: Nonrigid and Articulated Motion

  • Software (1)
  • Metrics
    No metrics available
Share - Bookmark