Density-based clustering relies on the idea of linking groups to some specific features of the probability distribution underlying the data. The reference to a true, yet unknown, population structure allows framing the clustering problem in a standard inferential setting, where the concept of ideal population clustering is defined as the partition induced by the true density function. The nonparametric formulation of this approach, known as modal clustering, draws a correspondence between the groups and the domains of attraction of the density modes. Operationally, a nonparametric density estimate is required and a proper selection of the amount of smoothing, governing the shape of the density and hence possibly the modal structure, is crucial to identify the final partition. In this work, we address the issue of density estimation for modal clustering from an asymptotic perspective. A natural and easy to interpret metric to measure the distance between density-based partitions is discussed, its asymptotic approximation explored, and employed to study the problem of bandwidth selection for nonparametric modal clustering

(2020). Modal clustering asymptotics with applications to bandwidth selection [journal article - articolo]. In ELECTRONIC JOURNAL OF STATISTICS. Retrieved from https://hdl.handle.net/10446/269553

Modal clustering asymptotics with applications to bandwidth selection

Casa, Alessandro;
2020-01-01

Abstract

Density-based clustering relies on the idea of linking groups to some specific features of the probability distribution underlying the data. The reference to a true, yet unknown, population structure allows framing the clustering problem in a standard inferential setting, where the concept of ideal population clustering is defined as the partition induced by the true density function. The nonparametric formulation of this approach, known as modal clustering, draws a correspondence between the groups and the domains of attraction of the density modes. Operationally, a nonparametric density estimate is required and a proper selection of the amount of smoothing, governing the shape of the density and hence possibly the modal structure, is crucial to identify the final partition. In this work, we address the issue of density estimation for modal clustering from an asymptotic perspective. A natural and easy to interpret metric to measure the distance between density-based partitions is discussed, its asymptotic approximation explored, and employed to study the problem of bandwidth selection for nonparametric modal clustering
articolo
2020
Casa, Alessandro; Chacón, José E.; Menardi, Giovanna
(2020). Modal clustering asymptotics with applications to bandwidth selection [journal article - articolo]. In ELECTRONIC JOURNAL OF STATISTICS. Retrieved from https://hdl.handle.net/10446/269553
File allegato/i alla scheda:
File Dimensione del file Formato  
20-EJS1679.pdf

accesso aperto

Versione: publisher's version - versione editoriale
Licenza: Creative commons
Dimensione del file 421.61 kB
Formato Adobe PDF
421.61 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/269553
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 4
social impact