Temporal Spiking Recurrent Neural Network for Action Recognition

Wang, Wei; Hao, Siyuan; Wei, Yunchao; Xia, Shengtao; Feng, Jiashi; Sebe, Nicu

doi:10.1109/ACCESS.2019.2936604

Wang, Wei; Hao, Siyuan; Wei, Yunchao; Xia, Shengtao; Feng, Jiashi; Sebe, Nicu

2019

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we propose a novel temporal spiking recurrent neural network (TSRNN) to perform robust action recognition in videos. The proposed TSRNN employs a novel spiking architecture which utilizes the local discriminative features from high-confidence reliable frames as spiking signals. The conventional CNN-RNNs typically used for this problem treat all the frames equally important such that they are error-prone to noisy frames. The TSRNN solves this problem by employing a temporal pooling architecture which can help RNN select sparse and reliable frames and enhances its capability in modelling long-range temporal information. Besides, a message passing bridge is added between the spiking signals and the recurrent unit. In this way, the spiking signals can guide RNN to correct its long-term memory across multiple frames from contamination caused by noisy frames with distracting factors (e.g., occlusion, rapid scene transition). With these two novel components, TSRNN achieves competitive performance compared with the state-of-the-art CNN-RNN architectures on two large scale public benchmarks, UCF101 and HMDB51.

Details

Title Temporal Spiking Recurrent Neural Network for Action Recognition

Author(s) Wang, Wei ; Hao, Siyuan ; Wei, Yunchao ; Xia, Shengtao ; Feng, Jiashi ; Sebe, Nicu

Published in IEEE Access

Volume 7

Pages 117165-117175

Date 2019-01-01

ISSN 2169-3536

Keywords

action recognition; temporal spiking; recurrent neural network; histograms

DOI https://doi.org/10.1109/ACCESS.2019.2936604

Other identifier(s) View record in Web of Science

Laboratories CVLAB

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > CVLAB - Computer Vision Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2019-09-19

Files

Abstract

Details

PDF