Using phase instead of optical flow for action recognition

Authors
Publication date 2019
Host editors
  • L. Leal-Taixé
  • S. Roth
Book title Computer Vision – ECCV 2018 Workshops
Book subtitle Munich, Germany, September 8-14, 2018 : proceedings
ISBN
  • 9783030110239
ISBN (electronic)
  • 9783030110246
Series Lecture Notes in Computer Science
Event 15th European Conference on Computer Vision, Workshops
Volume | Issue number VI
Pages (from-to) 678-691
Publisher Cham: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Currently, the most common motion representation for action recognition is optical flow. Optical flow is based on particle tracking which adheres to a Lagrangian perspective on dynamics. In contrast to the Lagrangian perspective, the Eulerian model of dynamics does not track, but describes local changes. For video, an Eulerian phase-based motion representation, using complex steerable filters, has been successfully employed recently for motion magnification and video frame interpolation. Inspired by these previous works, here, we proposes learning Eulerian motion representations in a deep architecture for action recognition. We learn filters in the complex domain in an end-to-end manner. We design these complex filters to resemble complex Gabor filters, typically employed for phase-information extraction. We propose a phase-information extraction module, based on these complex filters, that can be used in any network architecture for extracting Eulerian representations. We experimentally analyze the added value of Eulerian motion representations, as extracted by our proposed phase extraction module, and compare with existing motion representations based on optical flow, on the UCF101 dataset.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-030-11024-6_51
Other links https://ivi.fnwi.uva.nl/isis/publications/2018/HommosECCV2018
Permalink to this page
Back