The application of single-cell transcriptomic sequencing technologies, such as single-cell RNA sequencing (scRNA-Seq), have witnessed in recent years a dramatic increase, allowing for the elucidation of the molecular processes driving both normal cell development and the onset of pathologies. In particular, scRNA-Seq can be exploited to investigate cell heterogeneity at single-cell resolution, and to identify the variety of known and putatively novel cell populations, which can potentially have different functional roles in different contexts. However, the heterogeneity among cells of the same cell-type can make the integration of multiple scRNA-Seq datasets a challenging task. In this context, technical non-negligible batch effects in the datasets---which may arise from the sequencing technology employed and from the size of the experiment---must be considered to realize a correct data integration. In this work, we present a novel strategy based on Autoencoders (AEs) for the integration of multiple scRNA-Seq datasets, whose performance is compared with different integration strategies that do not exploit a batch effect removal step, which might introduce artifacts in the datasets. Our results, obtained by considering 3 different datasets, suggest that AEs represent a suitable strategy for the integration of scRNA-Seq datasets, achieving better performance than other approaches, i.e., Scanorama, Ingest, and Seurat, in most of the cases.

(2021). Integration of Multiple scRNA-Seq Datasets on the Autoencoder Latent Space . Retrieved from http://hdl.handle.net/10446/202988

Integration of Multiple scRNA-Seq Datasets on the Autoencoder Latent Space

Cazzaniga, Paolo;Tangherloni, Andrea
2021-01-01

Abstract

The application of single-cell transcriptomic sequencing technologies, such as single-cell RNA sequencing (scRNA-Seq), have witnessed in recent years a dramatic increase, allowing for the elucidation of the molecular processes driving both normal cell development and the onset of pathologies. In particular, scRNA-Seq can be exploited to investigate cell heterogeneity at single-cell resolution, and to identify the variety of known and putatively novel cell populations, which can potentially have different functional roles in different contexts. However, the heterogeneity among cells of the same cell-type can make the integration of multiple scRNA-Seq datasets a challenging task. In this context, technical non-negligible batch effects in the datasets---which may arise from the sequencing technology employed and from the size of the experiment---must be considered to realize a correct data integration. In this work, we present a novel strategy based on Autoencoders (AEs) for the integration of multiple scRNA-Seq datasets, whose performance is compared with different integration strategies that do not exploit a batch effect removal step, which might introduce artifacts in the datasets. Our results, obtained by considering 3 different datasets, suggest that AEs represent a suitable strategy for the integration of scRNA-Seq datasets, achieving better performance than other approaches, i.e., Scanorama, Ingest, and Seurat, in most of the cases.
2021
Riva, Simone G.; Cazzaniga, Paolo; Tangherloni, Andrea
File allegato/i alla scheda:
File Dimensione del file Formato  
Riva_etAl-IntegrationMultiple_scRNA-SeqDatasetsAutoencoderLatentSpace.pdf

Solo gestori di archivio

Versione: publisher's version - versione editoriale
Licenza: Licenza default Aisberg
Dimensione del file 771.16 kB
Formato Adobe PDF
771.16 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/202988
Citazioni
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact