Two-Stream SR-CNNs for Action Recognition in Videos

Wang Yifan, Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges

Type

Publication

British Machine Vision Conference 2016

Abstract

We propose a new deep architecture by incorporating object/human detection results into the framework for action recognition, called two-stream semantic region based CNNs (SR-CNNs). We perform experiments on UCF101 dataset and demonstrate its superior performance to the original two-stream CNNs.

Two-Stream SR-CNNs for Action Recognition in Videos

Abstract

Network Architecture