Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks

Authors
  • Y. Yang
  • J. Luo
Publication date 2020
Book title MM '20
Book subtitle proceedings of the 28th ACM International Conference on Multimedia : October 12-16, 2020, Virtual Event, USA
ISBN (electronic)
  • 9781450379885
Event 28th ACM International Conference on Multimedia
Pages (from-to) 3007-3015
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In the era of big data, few-shot learning has recently received much attention in multimedia analysis and computer vision due to its appealing ability of learning from scarce labeled data. However, it has been largely underdeveloped in the video domain, which is even more challenging due to the huge spatial-temporal variability of video data. In this paper, we address few-shot video classification by learning an ensemble of SlowFast networks augmented with memory units. Specifically, we introduce a family of few-shot learners based on SlowFast networks which are used to extract informative features at multiple rates, and we incorporate a memory unit into each network to enable encoding and retrieving crucial information instantly. Furthermore, we propose a choice controller network to leverage the diversity of few-shot learners by learning to adaptively assign a confidence score to each SlowFast memory network, leading to a strong classifier for enhanced prediction. Experimental results on two widely-adopted video datasets demonstrate the effectiveness of the proposed method, as well as its superior performance over the state-of-the-art approaches.
Document type Conference contribution
Note With supplemental material
Language English
Published at https://doi.org/10.1145/3394171.3416269
Permalink to this page
Back