Audio-Adaptive Activity Recognition Across Video Domains

Open Access
Authors
Publication date 2022
Book title 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Book subtitle New Orleans, Louisiana, 19-24 June 2022 : proceedings
ISBN
  • 9781665469470
ISBN (electronic)
  • 9781665469463
Series CVPR
Event 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
Pages (from-to) 13781-13790
Publisher Los Alamitos, California: IEEE Computer Society
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
This paper strives for activity recognition under domain shift, for example caused by change of scenery or camera viewpoint. The leading approaches reduce the shift in activity appearance by adversarial training and self-supervised learning. Different from these vision-focused works we leverage activity sounds for domain adaptation as they have less variance across domains and can reliably indicate which activities are not happening. We propose an audio-adaptive encoder and associated learning methods that discriminatively adjust the visual feature representation as well as addressing shifts in the semantic distribution. To further eliminate domain-specific features and include domain-invariant activity sounds for recognition, an audio-infused recognizer is proposed, which effectively models the cross-modal interaction across domains. We also introduce the new task of actor shift, with a corresponding audio-visual dataset, to challenge our method with situations where the activity appearance changes dramatically. Experiments on this dataset, EPIC-Kitchens and CharadesEgo show the effectiveness of our approach.
Document type Conference contribution
Note With supplementary file
Language English
Related dataset ActorShift.zip
Published at https://doi.org/10.48550/arXiv.2203.14240 https://doi.org/10.1109/CVPR52688.2022.01342
Published at https://openaccess.thecvf.com/content/CVPR2022/html/Zhang_Audio-Adaptive_Activity_Recognition_Across_Video_Domains_CVPR_2022_paper.html
Other links https://xiaobai1217.github.io/DomainAdaptation https://www.proceedings.com/65666.html
Downloads
Supplementary materials
Permalink to this page
Back