Activity recognition using a supervised non-parametric hierarchical HMM

Article English OPEN
Raman, Natraj ; Maybank, Stephen (2016)

The problem of classifying human activities occurring in depth image sequences is addressed. The 3D joint positions of a human skeleton and the local depth image pattern around these joint positions define the features. A two level hierarchical Hidden Markov Model (H-HMM), with independent Markov chains for the joint positions and depth image pattern, is used to model the features. The states corresponding to the H-HMM bottom level characterize the granular poses while the top level characterizes the coarser actions associated with the activities. Further, the H-HMM is based on a Hierarchical Dirichlet Process (HDP), and is fully non-parametric with the number of pose and action states inferred automatically from data. This is a significant advantage over classical HMM and its extensions. In order to perform classification, the relationships between the actions and the activity labels are captured using multinomial logistic regression. The proposed inference procedure ensures alignment of actions from activities with similar labels. Our construction enables information sharing, allows incorporation of unlabelled examples and provides a flexible factorized representation to include multiple data channels. Experiments with multiple real world datasets show the efficacy of our classification approach.
  • References (9)

    “Real-time human pose recognition in parts from single depth images”. CVPR (2011).

    Rabiner, L., & Juang, B. H. “An introduction to hidden Markov models”. ASSP Magazine, IEEE 3, 4- 16 (1986).

    Machine learning, 32(1), 41-62 (1998).

    Teh, Yee Whye, Michael I. Jordan, Matthew J. Beal, and David M. Blei, N. “Hierarchical dirichlet processes”. Journal of the American Statistical Association 101.476 (2006).

    Fox, Emily B., Erik B. Sudderth, Michael I. Jordan, and Alan S. Willsky. “An HDP-HMM for systems with state persistence”. Proceedings of the 25th international conference on Machine learning.

    ACM (2008).

    Krishnapuram, B., Carin, L., Figueiredo, M. A., & Hartemink, A. J. “Sparse multinomial logistic regression: Fast algorithms and generalization bounds”. Pattern Analysis and Machine Intelligence, IEEE 27(6), 957-968 (2005).

    Aggarwal, J. K., and Michael S. Ryoo. “Human activity analysis: A review”. ACM Computing Surveys (CSUR) 43.3: 16 (2011).

    Han, Jungong, Ling Shao, Dong Xu, and Jamie Shotton. “Enhanced Computer Vision with Microsoft Kinect Sensor: A Review”. IEEE Transactions on Cybernetics (2013).

  • Metrics
    views in OpenAIRE
    views in local repository
    downloads in local repository

    The information is available from the following content providers:

    From Number Of Views Number Of Downloads
    Birkbeck Institutional Research Online - IRUS-UK 0 5
Share - Bookmark