Air pollution is a serious threat for almost all European countries. Recently in Northern Italy PM10, PM2.5 and NO2 limits set by the EU were often overtaken. In order to fight this trend, it is crucial to analyse air pollutants. This work uses two machine learning techniques (Natural Cubic Splines and Bagging) to predict PM10 and NO2 daily concentration for the years 2016 to 2019. We assess the methods accuracy by means of the Root Mean Square Error, the Weighted Mean Absolute Percentage Error and the Pearson’s correlation between predictions and observed values. Bagging slightly outperforms Natural Cubic Splines and per-formances are better for NO2 than for PM10. We are able to provide estimates even when actual on field measurements are not available. Moreover, we could use these models to forecast pollutant levels for the next few days by plugging in estimates of our predictors. This approach provides national and local govern-ments with tools to study the air pollution problem, shifting from ex-post meas-urements and sanctions to ex-ante strategies and corrections.

(2025). Machine Learning and Air Pollution: an Empirical Application for Northern Italy . Retrieved from https://hdl.handle.net/10446/275729

Machine Learning and Air Pollution: an Empirical Application for Northern Italy

Toninelli, Daniele
2025-01-01

Abstract

Air pollution is a serious threat for almost all European countries. Recently in Northern Italy PM10, PM2.5 and NO2 limits set by the EU were often overtaken. In order to fight this trend, it is crucial to analyse air pollutants. This work uses two machine learning techniques (Natural Cubic Splines and Bagging) to predict PM10 and NO2 daily concentration for the years 2016 to 2019. We assess the methods accuracy by means of the Root Mean Square Error, the Weighted Mean Absolute Percentage Error and the Pearson’s correlation between predictions and observed values. Bagging slightly outperforms Natural Cubic Splines and per-formances are better for NO2 than for PM10. We are able to provide estimates even when actual on field measurements are not available. Moreover, we could use these models to forecast pollutant levels for the next few days by plugging in estimates of our predictors. This approach provides national and local govern-ments with tools to study the air pollution problem, shifting from ex-post meas-urements and sanctions to ex-ante strategies and corrections.
daniele.toninelli@unibg.it
2025
Inglese
Methodological and Applied Statistics and Demography IV. SIS 2024, Short Papers, Contributed Sessions 2
Pollice, Alessio; Mariani, Paolo
978-3-031-64430-6
4
269
275
cartaceo
online
Switzerland
Springer
esperti anonimi
(SIS 2024) 52nd Scientific Meeting of the Italian Statistical Society. Bari, Italy, 17-20 June 2024
52
Bari (Italy)
17-20 June, 2024
SIS - Società Italiana di Statistica
internazionale
contributo
Settore SECS-S/03 - Statistica Economica
Air pollution assessment; machine learning; Bagging; Natural Cubic Splines
info:eu-repo/semantics/conferenceObject
2
Mimmo, Angelo; Toninelli, Daniele
1.4 Contributi in atti di convegno - Contributions in conference proceedings::1.4.01 Contributi in atti di convegno - Conference presentations
reserved
Non definito
273
(2025). Machine Learning and Air Pollution: an Empirical Application for Northern Italy . Retrieved from https://hdl.handle.net/10446/275729
File allegato/i alla scheda:
File Dimensione del file Formato  
1_merged.pdf

Solo gestori di archivio

Descrizione: Frontmatter + paper file
Versione: postprint - versione referata/accettata senza referaggio
Licenza: Licenza default Aisberg
Dimensione del file 345.24 kB
Formato Adobe PDF
345.24 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/275729
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact