Faculty of Engineering and Information Sciences - Papers: Part A

Violent scene detection using a super descriptor tensor decomposition

Muhammad Rizwan KHOKHER, University of WollongongFollow
Abdesselam Bouzerdoum, University of WollongongFollow
Son Lam Phung, University of WollongongFollow

RIS ID

106885

Publication Details

M. Khokher, A. Bouzerdoum & S. Lam. Phung, "Violent scene detection using a super descriptor tensor decomposition," in Digital Image Computing: Techniques and Applications (DICTA), 2015 International Conference on, 2015, pp. 1-8.

Abstract

This article presents a new method for violent scene detection using super descriptor tensor decomposition. Multi-modal local features comprising auditory and visual features are extracted from Mel-frequency cepstral coefficients (including first and second order derivatives) and refined dense trajectories. There is usually a large number of dense trajectories extracted from a video sequence; some of these trajectories are unnecessary and can affect the accuracy. We propose to refine the dense trajectories by selecting only discriminative trajectories in the region of interest. Visual descriptors consisting of oriented gradient and motion boundary histograms are computed along the refined dense trajectories. In traditional bag-of-visual-words techniques, the feature descriptors are concatenated to form a single large feature vector for classification. This destroys the spatio-Temporal interactions among features extracted from multi-modal data. To address this problem, a super descriptor tensor decomposition is proposed. The extracted feature descriptors are first encoded using super descriptor vector method. Then the encoded features are arranged as tensors so as to retain the spatio-Temporal structure of the features. To obtain a compact set of features for classification, the TUCKER-3 decomposition is applied to the super descriptor tensors, followed by feature selection using Fisher feature ranking. The obtained features are fed to a support vector machine classifier. Experimental evaluation is performed on violence detection benchmark dataset, MediaEval VSD2014. The proposed method outperforms most of the state-of-The-Art methods, achieving MAP2014 scores of 60.2% and 67.8% on two subsets of the dataset.

Grant Number

ARC/DP140101833

Additional Grant Number

http://purl.org/au-research/grants/ARC/DP140101833

Download

Included in

Engineering Commons, Science and Technology Studies Commons

COinS

Link to publisher version (DOI)

http://dx.doi.org/10.1109/DICTA.2015.7371320

Grant Link

http://purl.org/au-research/grants/ARC/DP140101833

Faculty of Engineering and Information Sciences - Papers: Part A

Violent scene detection using a super descriptor tensor decomposition

RIS ID

Publication Details

Abstract

Grant Number

Additional Grant Number

Included in

Link to publisher version (DOI)

Grant Link

Search

Browse

Links

Faculty of Engineering and Information Sciences - Papers: Part A

Violent scene detection using a super descriptor tensor decomposition

Authors

RIS ID

Publication Details

Abstract

Grant Number

Additional Grant Number

Included in

Share

Link to publisher version (DOI)

Grant Link

Search

Browse

Links