University of Wollongong
Browse

Skeleton-based action recognition using LSTM and CNN

Download (452.41 kB)
conference contribution
posted on 2024-11-14, 11:19 authored by Chuankun Li, Pichao Wang, Shuangyin Wang, Yonghong Hou, Wanqing LiWanqing Li
Recent methods based on 3D skeleton data have achieved outstanding performance due to its conciseness, robustness, and view-independent representation. With the development of deep learning, Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM)-based learning methods have achieved promising performance for action recognition. However, for CNN-based methods, it is inevitable to loss temporal information when a sequence is encoded into images. In order to capture as much spatial-temporal information as possible, LSTM and CNN are adopted to conduct effective recognition with later score fusion. In addition, experimental results show that the score fusion between CNN and LSTM performs better than that between LSTM and LSTM for the same feature. Our method achieved state-of-the-art results on NTU RGB+D datasets for 3D human action analysis. The proposed method achieved 87.40% in terms of accuracy and ranked 1 st place in Large Scale 3D Human Activity Analysis Challenge in Depth Videos.

History

Citation

Li, C., Wang, P., Wang, S., Hou, Y. & Li, W. (2017). Skeleton-based action recognition using LSTM and CNN. IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (pp. 585-590). IEEE Xplore: IEEE.

Parent title

2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017

Pagination

585-590

Language

English

RIS ID

117031

Usage metrics

    Categories

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC