Scopus Harvesting Series

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Zihui Guo, Tianjin University
Yonghong Hou, Tianjin University
Pichao Wang
Zhimin Gao, Zhengzhou University
Mingliang Xu, Zhengzhou University
Wanqing Li, University of Wollongong

Publication Name

Neural Computing and Applications

Abstract

Analysis of human interaction is one important research topic of human motion analysis. It has been studied either using first-person vision (FPV) or third-person vision (TPV). However, the joint learning of both types of vision has so far attracted little attention. One of the reasons is the lack of suitable datasets that cover both FPV and TPV. In addition, existing benchmark datasets of either FPV or TPV have several limitations, including the limited number of samples, participant subjects, interaction categories, and modalities. In this work, we contribute a large-scale human interaction dataset, namely FT-HID dataset. FT-HID contains pair-aligned samples of first-person and third-person visions. The dataset was collected from 109 distinct subjects and has more than 90K samples for three modalities. The dataset has been validated by using several existing action recognition methods. In addition, we introduce a novel multi-view interaction mechanism for skeleton sequences, and a joint learning multi-stream framework for first-person and third-person visions. Both methods yield promising results on the FT-HID dataset. It is expected that the introduction of this vision-aligned large-scale dataset will promote the development of both FPV and TPV, and their joint learning techniques for human action analysis.

Open Access Status

This publication may be available as open access

Volume

Issue

First Page

2007

Last Page

2024

Funding Number

61906173

Funding Sponsor

National Natural Science Foundation of China

Link to Full Text

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1007/s00521-022-07826-w

Scopus Harvesting Series

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Publication Name

Abstract

Open Access Status

Volume

Issue

First Page

Last Page

Funding Number

Funding Sponsor

Link to publisher version (DOI)

Search

Browse

Links

Scopus Harvesting Series

FT-HID: a large-scale RGB-D dataset for first- and third-person human interaction analysis

Authors

Publication Name

Abstract

Open Access Status

Volume

Issue

First Page

Last Page

Funding Number

Funding Sponsor

Share

Link to publisher version (DOI)

Search

Browse

Links