The collection of data during the routine delivery of care is changing the healthcare sector. Indeed, only from the clinical trial data it is difficult to obtain such a complete picture of the status of a patient as that provided by real-world data. However, the creation of valuable real-word evidence requires the adoption of an appropriate solution to ingest, store, and process the enormous amount of information coming from all the involved, typically heterogeneous data sources. Data lake technologies are depicted as promising solutions for enhancing data management and analysis capabilities in the healthcare domain: we can rely on them to manage the complexity of big data volume and variety, providing data analysts with a self-service environment in which advanced analytics can be applied. In this paper we envision the adoption of a data lake federation through which organizations could achieve further benefits by sharing data. Exchanging data adds new research challenges related to guaranteeing data reliability and sovereignty. For instance, the collected data should be accurately described in order to document their quality, facilitate their discovery, define security and privacy policies. On the basis of the experience in Health Big Data, we are going to present an architecture for gathering real-world evidence, also identifying the research challenges from an IT perspective.
(2023). Enabling Real-World Medicine with Data Lake Federation: A Research Perspective . Retrieved from https://hdl.handle.net/10446/294369
Enabling Real-World Medicine with Data Lake Federation: A Research Perspective
Salnitri, Mattia;
2023-01-01
Abstract
The collection of data during the routine delivery of care is changing the healthcare sector. Indeed, only from the clinical trial data it is difficult to obtain such a complete picture of the status of a patient as that provided by real-world data. However, the creation of valuable real-word evidence requires the adoption of an appropriate solution to ingest, store, and process the enormous amount of information coming from all the involved, typically heterogeneous data sources. Data lake technologies are depicted as promising solutions for enhancing data management and analysis capabilities in the healthcare domain: we can rely on them to manage the complexity of big data volume and variety, providing data analysts with a self-service environment in which advanced analytics can be applied. In this paper we envision the adoption of a data lake federation through which organizations could achieve further benefits by sharing data. Exchanging data adds new research challenges related to guaranteeing data reliability and sovereignty. For instance, the collected data should be accurately described in order to document their quality, facilitate their discovery, define security and privacy policies. On the basis of the experience in Health Big Data, we are going to present an architecture for gathering real-world evidence, also identifying the research challenges from an IT perspective.File | Dimensione del file | Formato | |
---|---|---|---|
978-3-031-23905-2_4.pdf
Solo gestori di archivio
Descrizione: Arcticolo completo
Versione:
publisher's version - versione editoriale
Licenza:
Licenza default Aisberg
Dimensione del file
1.5 MB
Formato
Adobe PDF
|
1.5 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo