Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations

This paper aims to address the unsupervised video anomaly detection (VAD) problem, which involves classifying each frame in a video as normal or abnormal, without any access to labels. To accomplish this, the proposed method employs conditional diffusion models, where the input data is the spatiotemporal features extracted from a pre-trained network, and the condition is the features extracted from compact motion representations that summarize a given video segment in terms of its motion and appearance. Our method utilizes a data-driven threshold and considers a high reconstruction error as an indicator of anomalous events. This study is the first to utilize compact motion representations for VAD and the experiments conducted on two large-scale VAD benchmarks demonstrate that they supply relevant information to the diffusion model, and consequently improve VAD performances w.r.t the prior art. Importantly, our method exhibits better generalization performance across different datasets, notably outperforming both the state-of-the-art and baseline methods. The code of our method is available HERE

(2023). Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations . Retrieved from https://hdl.handle.net/10446/260622

Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations

Tur, Anil Osman;Dall'Asen, Nicola;Beyan, Cigdem;Ricci, Elisa

2023-01-01

Abstract

This paper aims to address the unsupervised video anomaly detection (VAD) problem, which involves classifying each frame in a video as normal or abnormal, without any access to labels. To accomplish this, the proposed method employs conditional diffusion models, where the input data is the spatiotemporal features extracted from a pre-trained network, and the condition is the features extracted from compact motion representations that summarize a given video segment in terms of its motion and appearance. Our method utilizes a data-driven threshold and considers a high reconstruction error as an indicator of anomalous events. This study is the first to utilize compact motion representations for VAD and the experiments conducted on two large-scale VAD benchmarks demonstrate that they supply relevant information to the diffusion model, and consequently improve VAD performances w.r.t the prior art. Importantly, our method exhibits better generalization performance across different datasets, notably outperforming both the state-of-the-art and baseline methods. The code of our method is available HERE

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2023
			
	Tutti gli autori
	
						Tur, Anil Osman; Dall'Asen, Nicola; Beyan, Cigdem; Ricci, Elisa
					
	Nelle collezioni:
	
				1.4.01 Contributi in atti di convegno - Conference presentations

File allegato/i alla scheda:

File	Dimensione del file	Formato
IC27_Unsupervised Video Anomaly Detection with Diffusion Models.pdf Solo gestori di archivio Versione: publisher's version - versione editoriale Licenza: Licenza default Aisberg Dimensione del file 795.13 kB Formato Adobe PDF Visualizza/Apri	795.13 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/260622

Citazioni

11

9

social impact