R. Alp Gu¨ler, N. Neverova, and I. Kokkinos, “Densepose: Dense human pose estimation in the wild,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 7297-7306.
K. S. Arun, T. S. Huang, and S. D. Blostein, “Least-squares fitting of two 3-d point sets,” IEEE Transactions on pattern analysis and machine intelligence, no. 5, pp. 698-700, 1987.
P. Azad, D. Mu¨nch, T. Asfour, and R. Dillmann, “6-dof model-based tracking of arbitrarily shaped 3d objects,” in 2011 IEEE International Conference on Robotics and Automation, IEEE, 2011, pp. 5204-5209.
H. Bay, T. Tuytelaars, and L. Van Gool, “Surf: Speeded up robust features,” in European conference on computer vision, Springer, 2006, pp. 404-417.
Y. Bengio, A. Courville, and P. Vincent, “Representation learning: A review and new perspectives,” IEEE transactions on pattern analysis and machine intelligence, vol. 35, no. 8, pp. 1798-1828, 2013.
G. Billings and M. Johnson-Roberson, “Silhonet: An rgb method for 3d object pose estimation and grasp planning,” ArXiv preprint arXiv:1809.06893, 2018.
M. Calonder, V. Lepetit, C. Strecha, and P. Fua, “Brief: Binary robust independent elementary features,” in European conference on computer vision, Springer, 2010, pp. 778-792. [OpenAIRE]
Huang, Z. Li, S. Savarese, M. Savva, S. Song, H. Su, et al., “Shapenet: An information-rich 3d model repository,” ArXiv preprint arXiv:1512.03012, 2015.
Fidler, and R. Urtasun, “3d object proposals for accurate object class detection,” in Advances in Neural Information Processing Systems, 2015, pp. 424-432.
Y. Chen, C. Shen, H. Chen, X.-S. Wei, L. Liu, and J. Yang, “Adversarial learning of structure-aware fully convolutional networks for landmark localization,” IEEE transactions on pattern analysis and machine intelligence, 2019.
C. Choi and H. I. Christensen, “Real-time 3d model-based tracking using edge and keypoint features for robotic manipulation,” in 2010 IEEE International Conference on Robotics and Automation, IEEE, 2010, pp. 4048-4055.
A. Collet, M. Martinez, and S. S. Srinivasa, “The moped framework: Object recognition and pose estimation for manipulation,” The International Journal of Robotics Research, vol. 30, no. 10, pp. 1284-1306, 2011.
 X. Deng, A. Mousavian, Y. Xiang, F. Xia, T. Bretl, and D. Fox, “Poserbpf: A rao-blackwellized particle filter for 6d object pose tracking,” ArXiv preprint arXiv:1905.09304, 2019.
 C. Garcia Cifuentes, J. Issac, M. Wu¨thrich, S. Schaal, and J. Bohg, “Probabilistic articulated real-time tracking for robot manipulation,” IEEE Robotics and Automation Letters (RAL), vol. 2, no. 2, pp. 577-584, Apr. 2017.
 A. Geiger, P. Lenz, C. Stiller, and R. Urtasun, “Vision meets robotics: The kitti dataset,” The International Journal of Robotics Research, vol. 32, no. 11, pp. 1231-1237, 2013.