Background no more: Action recognition across domains by causal interventions

Authors	S. Rastegar H. Doughty C.G.M. Snoek
Publication date	05-2024
Journal	Computer Vision and Image Understanding
Article number	103975
Volume \| Issue number	242
Number of pages	9
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	We aim to recognize actions under an appearance distribution shift between a source training domain and a target test domain. To enable such video domain generalization, our key idea is to intervene on the action to remove the confounding effect of the domain background on the class label using causal inference. Towards this, we propose to learn a causally debiased model on a source domain that intervenes on the action through three possible -operators, which separate the action and background. To better align the source and target distributions, we also introduce a test-time action intervention. Experiments on two challenging video domain generalization benchmarks reveal that causal inference is a promising tool for action recognition as it already achieves state-of-the-art results on Kinetics2Mimetics, the benchmark with the largest domain shift.
Document type	Article
Language	English
Published at	https://doi.org/10.1016/j.cviu.2024.103975 (Final published version)
Downloads	Background no more (Final published version)
Supplementary materials	Supplementary material
Permalink to this page

Back

UvA-DARE