One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Adapting a segmentation model from a labeled source domain to a target domain, where a single unlabeled datum is available, is one of the most challenging problems in domain adaptation and is otherwise known as one-shot unsupervised domain adaptation (OSUDA). Most of the prior works have addressed the problem by relying on style transfer techniques, where the source images are stylized to have the appearance of the target domain. Departing from the common notion of transferring only the target "texture" information, we leverage text-to-image diffusion models (e.g., Stable Diffusion) to generate a synthetic target dataset with photo-realistic images that not only faithfully depict the style of the target domain, but are also characterized by novel scenes in diverse contexts. The text interface in our method Data AugmenTation with diffUsion Models (DATUM) endows us with the possibility of guiding the generation of images towards desired semantic concepts while respecting the original spatial context of a single training image, which is not possible in existing OSUDA methods. Extensive experiments on standard benchmarks show that our DATUM surpasses the state-of-the-art OSUDA methods by up to +7.1%. The implementation is available at : https://github.com/yasserben/DATUM

(2023). One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models . Retrieved from https://hdl.handle.net/10446/311033

One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Benigmim Y.;Roy S.;Essid S.;Kalogeiton V.;Lathuilière S.

2023-01-01

Abstract

Adapting a segmentation model from a labeled source domain to a target domain, where a single unlabeled datum is available, is one of the most challenging problems in domain adaptation and is otherwise known as one-shot unsupervised domain adaptation (OSUDA). Most of the prior works have addressed the problem by relying on style transfer techniques, where the source images are stylized to have the appearance of the target domain. Departing from the common notion of transferring only the target "texture" information, we leverage text-to-image diffusion models (e.g., Stable Diffusion) to generate a synthetic target dataset with photo-realistic images that not only faithfully depict the style of the target domain, but are also characterized by novel scenes in diverse contexts. The text interface in our method Data AugmenTation with diffUsion Models (DATUM) endows us with the possibility of guiding the generation of images towards desired semantic concepts while respecting the original spatial context of a single training image, which is not possible in existing OSUDA methods. Extensive experiments on standard benchmarks show that our DATUM surpasses the state-of-the-art OSUDA methods by up to +7.1%. The implementation is available at : https://github.com/yasserben/DATUM

Scheda breve

Scheda completa

Scheda completa (DC)

	DOI del contributo
	
				https://dx.doi.org/10.1109/CVPRW59228.2023.00077
			
	Identificativo ISI
	
				WOS:001055056500073
			
	Identificativo SCOPUS
	
				2-s2.0-85170826419
			
	Data di pubblicazione
	
				2023
			
	Lingua/e del contenuto
	
				Inglese
			
	Titolo del volume/Fascicolo monografico/Collezione online
	
				2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
			
	ISBN degli Atti
	
				979-8-3503-0250-9
			
	ISBN della versione online
	
				979-8-3503-0249-3
			
	Serie/collana in ANCE
	
				IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS
			
	URL degli Atti
	
				https://ieeexplore.ieee.org/xpl/conhome/10208270/proceeding
			
	Volume rivista o collana
	
				2023
			
	Pag. iniziale
	
				698
			
	Pag. finale
	
				708
			
	Formato
	
				cartaceo
online
			
	Paese di pubblicazione
	
				United States
			
	Città di pubblicazione
	
				Piscataway
			
	Editore
	
				IEEE (Institute of Electrical and Electronics Engineers)
			
	Nome del convegno
	
				CVPRW 2023: Conference on Computer Vision and Pattern Recognition Workshops, Vancouver, Canada, 18-22 June 2023
			
	Luogo del convegno
	
				Vancouver, Canada
			
	Periodo del convegno
	
				18-22 June 2023
			
	Rilevanza del convegno
	
				internazionale
			
	Tipo di intervento
	
				contributo
			
	Settore scientifico-disciplinare (validi dal 09/05/2024)
	
				Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Keywords
	
				Domain Adaptation
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Numero autori
	
				5
			
	Tutti gli autori
	
						Benigmim, Y.; Roy, Subhankar; Essid, S.; Kalogeiton, V.; Lathuilière, S.
					
	Tipologia
	
				1.4 Contributi in atti di convegno - Contributions in conference proceedings::1.4.01 Contributi in atti di convegno - Conference presentations
			
	Fulltext
	
				reserved
			
	description.file
	
				Non definito
			
	Tipologia sito docente
	
				273
			
	Citazione
	
				(2023). One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models . Retrieved from https://hdl.handle.net/10446/311033
			
	Nelle collezioni:
	
				1.4.01 Contributi in atti di convegno - Conference presentations

File allegato/i alla scheda:

File	Dimensione del file	Formato
Benigmim_One-Shot_Unsupervised.pdf Solo gestori di archivio Versione: publisher's version - versione editoriale Licenza: Licenza default Aisberg Dimensione del file 8.92 MB Formato Adobe PDF Visualizza/Apri	8.92 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/311033

Citazioni

45

35

social impact