DeepProposals: Hunting objects and actions by cascading deep convolutional layers

Open Access
Authors
  • L. Van Gool
Publication date 09-2017
Journal International Journal of Computer Vision
Volume | Issue number 124 | 2
Pages (from-to) 115-131
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In this paper, a new method for generating object and action proposals in images and videos is proposed. It builds on activations of different convolutional layers of a pretrained CNN, combining the localization accuracy of the early layers with the high informativeness (and hence recall) of the later layers. To this end, we build an inverse cascade that, going backward from the later to the earlier convolutional layers of the CNN, selects the most promising locations and refines them in a coarse-to-fine manner. The method is efficient, because (i) it re-uses the same features extracted for detection, (ii) it aggregates features using integral images, and (iii) it avoids a dense evaluation of the proposals thanks to the use of the inverse coarse-to-fine cascade. The method is also accurate. We show that DeepProposals outperform most of the previous object proposal and action proposal approaches and, when plugged into a CNN-based object detector, produce state-of-the-art detection performance.
Document type Article
Language English
Published at https://doi.org/10.1007/s11263-017-1006-x
Other links https://ivi.fnwi.uva.nl/isis/publications/2017/GhodratiIJCV2017
Downloads
DeepProposals (Final published version)
Permalink to this page
Back