Skeleton-Contrastive 3D Action Representation Learning

F.M. Thoker; H. Doughty; C.G.M. Snoek

doi:https://doi.org/10.1145/3474085.3475307

Skeleton-Contrastive 3D Action Representation Learning

Authors	F.M. Thoker H. Doughty C.G.M. Snoek
Publication date	2021
Book title	MM '21
Book subtitle	Proceedings of the 29th ACM International Conference on Multimedia : October 20-24, 2021, Virtual Event, China
ISBN (electronic)	9781450386517
Event	29th ACM International Conference on Multimedia, MM 2021
Pages (from-to)	1655-1663
Publisher	New York, NY: Association for Computing Machinery
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	This paper strives for self-supervised learning of a feature space suitable for skeleton-based action recognition. Our proposal is built upon learning invariances to input skeleton representations and various skeleton augmentations via a noise contrastive estimation. In particular, we propose inter-skeleton contrastive learning, which learns from multiple different input skeleton representations in a cross-contrastive manner. In addition, we contribute several skeleton-specific spatial and temporal augmentations which further encourage the model to learn the spatio-temporal dynamics of skeleton data. By learning similarities between different skeleton representations as well as augmented views of the same sequence, the network is encouraged to learn higher-level semantics of the skeleton data than when only using the augmented views. Our approach achieves state-of-the-art performance for self-supervised learning from skeleton data on the challenging PKU and NTU datasets with multiple downstream tasks, including action recognition, action retrieval and semi-supervised learning. Code is available at https://github.com/fmthoker/skeleton-contrast.
Document type	Conference contribution
Note	With supplemental material.
Language	English
Published at	https://doi.org/10.1145/3474085.3475307 (Final published version)
Other links	https://github.com/fmthoker/skeleton-contrast
Downloads	3474085.3475307 (Final published version)
Supplementary materials	mfp0802aux
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Skeleton-Contrastive 3D Action Representation Learning