The collection of data during the routine delivery of care is changing the healthcare sector. Indeed, only from the clinical trial data it is difficult to obtain such a complete picture of the status of a patient as that provided by real-world data. However, the creation of valuable real-word evidence requires the adoption of an appropriate solution to ingest, store, and process the enormous amount of information coming from all the involved, typically heterogeneous data sources. Data lake technologies are depicted as promising solutions for enhancing data management and analysis capabilities in the healthcare domain: we can rely on them to manage the complexity of big data volume and variety, providing data analysts with a self-service environment in which advanced analytics can be applied. In this paper we envision the adoption of a data lake federation through which organizations could achieve further benefits by sharing data. Exchanging data adds new research challenges related to guaranteeing data reliability and sovereignty. For instance, the collected data should be accurately described in order to document their quality, facilitate their discovery, define security and privacy policies. On the basis of the experience in Health Big Data, we are going to present an architecture for gathering real-world evidence, also identifying the research challenges from an IT perspective.

(2023). Enabling Real-World Medicine with Data Lake Federation: A Research Perspective . Retrieved from https://hdl.handle.net/10446/294369

Enabling Real-World Medicine with Data Lake Federation: A Research Perspective

Salnitri, Mattia;
2023-01-01

Abstract

The collection of data during the routine delivery of care is changing the healthcare sector. Indeed, only from the clinical trial data it is difficult to obtain such a complete picture of the status of a patient as that provided by real-world data. However, the creation of valuable real-word evidence requires the adoption of an appropriate solution to ingest, store, and process the enormous amount of information coming from all the involved, typically heterogeneous data sources. Data lake technologies are depicted as promising solutions for enhancing data management and analysis capabilities in the healthcare domain: we can rely on them to manage the complexity of big data volume and variety, providing data analysts with a self-service environment in which advanced analytics can be applied. In this paper we envision the adoption of a data lake federation through which organizations could achieve further benefits by sharing data. Exchanging data adds new research challenges related to guaranteeing data reliability and sovereignty. For instance, the collected data should be accurately described in order to document their quality, facilitate their discovery, define security and privacy policies. On the basis of the experience in Health Big Data, we are going to present an architecture for gathering real-world evidence, also identifying the research challenges from an IT perspective.
2023
Cappiello, Cinzia; Gribaudo, Marco; Plebani, Pierluigi; Salnitri, Mattia; Tanca, Letizia
File allegato/i alla scheda:
File Dimensione del file Formato  
978-3-031-23905-2_4.pdf

Solo gestori di archivio

Descrizione: Arcticolo completo
Versione: publisher's version - versione editoriale
Licenza: Licenza default Aisberg
Dimensione del file 1.5 MB
Formato Adobe PDF
1.5 MB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/294369
Citazioni
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact