skip to main content
10.1145/2668956.2668961acmotherconferencesArticle/Chapter ViewAbstractPublication Pagessiggraph-asiaConference Proceedingsconference-collections
research-article

Activity recognition in unconstrained RGB-D video using 3D trajectories

Published:24 November 2014Publication History

ABSTRACT

Human activity recognition in unconstrained RGB--D videos has extensive applications in surveillance, multimedia data analytics, human-computer interaction, etc, but remains a challenging problem due to the background clutter, camera motion, viewpoint changes, etc. We develop a novel RGB--D activity recognition approach that leverages the dense trajectory feature in RGB videos. By mapping the 2D positions of the dense trajectories from RGB video to the corresponding positions in the depth video, we can recover the 3D trajectory of the tracked interest points, which captures important motion information along the depth direction. To characterize the 3D trajectories, we apply motion boundary histogram (MBH) to depth direction and propose 3D trajectory shape descriptors. Our proposed 3D trajectory feature is a good complementary to dense trajectory feature extracted from RGB video only. The performance evaluation on a challenging unconstrained RGB--D activity recognition dataset, i.e., Hollywood 3D, shows that our proposed method outperforms the baseline methods (STIP-based) significantly, and achieves the state-of-the-art performance.

References

  1. Chang, C.-C., and Lin, C.-J. 2011. LIBSVM: A library for support vector machines. ACM Trans. on Intell. Syst. Tech. 2, 27:1--27:27. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Farnebäck, G. 2003. Two-frame motion estimation based on polynomial expansion. In SCIA. 363--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Hadfield, S., and Bowden, R. 2013. Hollywood 3d: Recognizing actions in 3d natural scenes. In CVPR, 3398--3405. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Herbst, E., Ren, X., and Fox, D. 2013. Rgb-d flow: Dense 3-d motion estimation using color and depth. In ICRA, 2276--2282.Google ScholarGoogle Scholar
  5. Kliper-Gross, O., Gurovich, Y., Hassner, T., and Wolf, L. 2012. Motion interchange patterns for action recognition in unconstrained videos. In ECCV. 256--269. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Laptev, I., Marszalek, M., Schmid, C., and Rozenfeld, B. 2008. Learning realistic human actions from movies. In CVPR, 1--8.Google ScholarGoogle Scholar
  7. Laptev, I. 2005. On space-time interest points. Int. J. of Comput. Vision 64, 2-3, 107--123. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Liu, J., Luo, J., and Shah, M. 2009. Recognizing realistic actions from videos "in the wild". In CVPR, 1996--2003.Google ScholarGoogle Scholar
  9. Ni, B., Wang, G., and Moulin, P. 2013. Rgbd-hudaact: A color-depth video database for human daily activity recognition. In Consumer Depth Cameras for Computer Vision. 193--208.Google ScholarGoogle Scholar
  10. Perronnin, F., Sánchez, J., and Mensink, T. 2010. Improving the fisher kernel for large-scale image classification. In ECCV. 143--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Soomro, K., Zamir, A. R., and Shah, M. 2012. Ucf101: A dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402.Google ScholarGoogle Scholar
  12. Sung, J., Ponce, C., Selman, B., and Saxena, A. 2011. Human activity detection from rgbd images. AAAI Workshop on Plan, Activity, and Intent Recognition Proceedings 11, 16.Google ScholarGoogle Scholar
  13. Vedaldi, A., and Fulkerson, B. 2010. Vlfeat: An open and portable library of computer vision algorithms. In ACM Multimedia, 1469--1472. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Wang, H., and Schmid, C. 2013. Action recognition with improved trajectories. In ICCV. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wang, H., Kläser, A., Schmid, C., and Liu, C.-L. 2013. Dense trajectories and motion boundary descriptors for action recognition. Int. J. of Comput. Vision 103, 1, 60--79.Google ScholarGoogle ScholarCross RefCross Ref
  16. Wu, J., Zhang, Y., and Lin, W. 2014. Towards good practices for action video encoding. In CVPR, 2577--2584. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Yuan, J., Liu, Z., and Wu, Y. 2011. Discriminative video pattern search for efficient action detection. IEEE Trans. Pattern Anal. Mach. Intell. 33, 9 (Sept.), 1728--1743. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Zhang, H., and Parker, L. E. 2011. 4-dimensional local spatio-temporal features for human activity recognition. In IROS, 2044--2049.Google ScholarGoogle Scholar

Index Terms

  1. Activity recognition in unconstrained RGB-D video using 3D trajectories

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      SA '14: SIGGRAPH Asia 2014 Autonomous Virtual Humans and Social Robot for Telepresence
      November 2014
      51 pages
      ISBN:9781450332439
      DOI:10.1145/2668956

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 November 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate178of869submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader