Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolu- tional autoencoder that uses graph Laplacian regularization to model the skeletal geom- etry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer that ensures generaliza- tion across camera views. The proposed method is validated on NTU-60 and NTU-120 large-scale datasets in which it outperforms all prior unsupervised skeleton-based ap- proaches on the cross-subject, cross-view, and cross-setup protocols. Although unsuper- vised, our learnable representation allows our method even to surpass a few supervised skeleton-based action recognition methods. The code is available in: www.github. com/IIT-PAVIS/UHAR_Skeletal_Laplacian

(2021). Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance . Retrieved from https://hdl.handle.net/10446/260627

Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

Paoletti, Giancarlo;Cavazza, Jacopo;Beyan, Cigdem;Del Bue, Alessio

2021-01-01

Abstract

This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolu- tional autoencoder that uses graph Laplacian regularization to model the skeletal geom- etry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer that ensures generaliza- tion across camera views. The proposed method is validated on NTU-60 and NTU-120 large-scale datasets in which it outperforms all prior unsupervised skeleton-based ap- proaches on the cross-subject, cross-view, and cross-setup protocols. Although unsuper- vised, our learnable representation allows our method even to surpass a few supervised skeleton-based action recognition methods. The code is available in: www.github. com/IIT-PAVIS/UHAR_Skeletal_Laplacian

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2021
			
	Tutti gli autori
	
						Paoletti, Giancarlo; Cavazza, Jacopo; Beyan, Cigdem; Del Bue, Alessio
					
	Nelle collezioni:
	
				1.4.01 Contributi in atti di convegno - Conference presentations

File allegato/i alla scheda:

File	Dimensione del file	Formato
IC22_Unsupervised Human Action Recognition.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Licenza default Aisberg Dimensione del file 8.58 MB Formato Adobe PDF Visualizza/Apri	8.58 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/260627

Citazioni

12

ND

social impact