TR2020-166

Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction


    •  Li, M., Chen, S., Chen, X., Zhang, Y., Wang, Y., Tian, Q., "Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction", IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2020.
      BibTeX TR2020-166 PDF
      • @article{Chen2020dec,
      • author = {Li, Maosen and Chen, Siheng and Chen, Xu and Zhang, Ya and Wang, Yanfeng and Tian, Qi},
      • title = {Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction},
      • journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
      • year = 2020,
      • month = dec,
      • url = {https://www.merl.com/publications/TR2020-166}
      • }
  • Research Areas:

    Computer Vision, Machine Learning

Abstract:

3D skeleton-based action recognition and motion prediction are two essential problems of human activity understanding. In many previous works: 1) they studied two tasks separately, neglecting internal correlations; 2) they did not capture sufficient relations inside the body. To address these issues, we propose a symbiotic model to handle two tasks jointly; and we propose two scales of graphs to explicitly capture relations among body-joints and body-parts. Together, we propose symbiotic graph neural networks, which contain a backbone, an action-recognition head, and a motion-prediction head. Two heads are trained jointly and enhance each other. For the backbone, we propose multi-branch multiscale graph convolution networks to extract spatial and temporal features. The multiscale graph convolution networks are based on joint-scale and part-scale graphs. The joint-scale graphs contain actional graphs, capturing action-based relations, and structural graphs, capturing physical constraints. The part-scale graphs integrate body-joints to form specific parts, representing high-level relations. Moreover, dual bone-based graphs and networks are proposed to learn complementary features. We conduct extensive experiments for skeleton-based action recognition and motion prediction with four datasets, NTU-RGB+D, Kinetics, Human3.6M, and CMU Mocap. Experiments show that our symbiotic graph neural networks achieve better performances on both tasks compared to the state-of-the-art methods.

 

  • Related Publication

  •  Li, M., Chen, S., Chen, X., Zhang, Y., Wang, Y., Tian, Q., "Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction", arXiv, October 2020.
    BibTeX arXiv
    • @article{Li2020oct,
    • author = {Li, Maosen and Chen, Siheng and Chen, Xu and Zhang, Ya and Wang, Yanfeng and Tian, Qi},
    • title = {Symbiotic Graph Neural Networks for 3D Skeleton-based Human Action Recognition and Motion Prediction},
    • journal = {arXiv},
    • year = 2020,
    • month = oct,
    • url = {https://arxiv.org/abs/1910.02212}
    • }