Dynamic Image Networks for Action Recognition

Open Access
Authors
  • S. Gould
Publication date 2016
Book title Proceedings 29th IEEE Conference on Computer Vision and Pattern Recognition : CVPR 2016
Book subtitle 26 June-1 July 2016, Las Vegas, Nevada
ISBN
  • 9781467388528
ISBN (electronic)
  • 9781509014385
  • 9781467388511
  • 9781467388504
Event 29th IEEE Conference on Computer Vision and Pattern Recognition
Pages (from-to) 3034-3042
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis especially when convolutional neural networks (CNNs) are used. The dynamic image is based on the rank pooling concept and is obtained through the parameters of a ranking machine that encodes the temporal evolution of the frames of the video. Dynamic images are obtained by directly applying rank pooling on the raw image pixels of a video producing a single RGB image per video. This idea is simple but powerful as it enables the use of existing CNN models directly on video data with fine-tuning. We present an efficient and effective approximate rank pooling operator, speeding it up orders of magnitude compared to rank pooling. Our new approximate rank pooling CNN layer allows us to generalize dynamic images to dynamic feature maps and we demonstrate the power of our new representations on standard benchmarks in action recognition achieving state-of-the-art performance.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/CVPR.2016.331
Other links https://ivi.fnwi.uva.nl/isis/publications/2016/BilenCVPR2016
Downloads
cvpr2016bilen (Accepted author manuscript)
Permalink to this page
Back