Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification

This article investigates classification of emotions from full-body movements by using a novel Convolutional Neural Network-based architecture. The model is composed of two shallow networks processing in parallel when the 8-bit RGB images obtained from time intervals of 3D-positional data are the inputs. One network performs a coarse-grained modelling in the time domain while the other one applies a fine-grained modelling. We show that combining different temporal scales into a single architecture improves the classification results of a dataset composed of short excerpts of the performances of professional dancers who interpreted four affective states: anger, happiness, sadness, and insecurity. Additionally, we investigate the effect of data chunk duration, overlapping, the size of the input images and the contribution of several data augmentation strategies for our proposed method. Better recognition results were obtained when the duration of a data chunk was longer, and this was further improved by applying balanced data augmentation. Moreover, we test our method on other existing motion capture datasets and compare the results with prior art. In all experiments, our results surpassed the state-of-the-art approaches, showing that this method generalizes across diverse settings and contexts.

(2023). Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification [journal article - articolo]. In IEEE TRANSACTIONS ON AFFECTIVE COMPUTING. Retrieved from https://hdl.handle.net/10446/260533

Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification

Beyan, Cigdem;Karumuri, Sukumar;Volpe, Gualtiero;Camurri, Antonio;Niewiadomski, Radoslaw

2023-01-01

Abstract

This article investigates classification of emotions from full-body movements by using a novel Convolutional Neural Network-based architecture. The model is composed of two shallow networks processing in parallel when the 8-bit RGB images obtained from time intervals of 3D-positional data are the inputs. One network performs a coarse-grained modelling in the time domain while the other one applies a fine-grained modelling. We show that combining different temporal scales into a single architecture improves the classification results of a dataset composed of short excerpts of the performances of professional dancers who interpreted four affective states: anger, happiness, sadness, and insecurity. Additionally, we investigate the effect of data chunk duration, overlapping, the size of the input images and the contribution of several data augmentation strategies for our proposed method. Better recognition results were obtained when the duration of a data chunk was longer, and this was further improved by applying balanced data augmentation. Moreover, we test our method on other existing motion capture datasets and compare the results with prior art. In all experiments, our results surpassed the state-of-the-art approaches, showing that this method generalizes across diverse settings and contexts.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di articolo
	
				articolo
			
	Data di pubblicazione
	
				2023
			
	Rivista in ANCE
	
				IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
			
	Tutti gli autori
	
						Beyan, Cigdem; Karumuri, Sukumar; Volpe, Gualtiero; Camurri, Antonio; Niewiadomski, Radoslaw
					
	Citazione
	
				(2023). Modeling Multiple Temporal Scales of Full-Body Movements for Emotion Classification  [journal article - articolo]. In IEEE TRANSACTIONS ON AFFECTIVE COMPUTING. Retrieved from https://hdl.handle.net/10446/260533
			
	Nelle collezioni:
	
				1.1.01 Articoli/Saggi in rivista - Journal Articles/Essays

File allegato/i alla scheda:

File	Dimensione del file	Formato
Modeling_Multiple_Temporal_Scales_of_Full-Body_Movements_for_Emotion_Classification.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Creative commons Dimensione del file 1.54 MB Formato Adobe PDF Visualizza/Apri	1.54 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/260533

Citazioni

13

13

social impact