The increasing availability of data gatherable from various sources and in several contexts, is forcing practitioners to find affordable ways to manage and exploit datasets. Within this context, machine learning (ML) - which can be described as a set of algorithms to analyse and process data to extract relevant features for clusterization, classification or prediction - emerged as one of the most investigated area providing powerful tools. Indeed, in literature it is possible to find a considerable number of articles dealing with ML algorithms and describing their real-world applications. This considerable number of works, depicting a wide variety of algorithms and widespread applications, creates an extensive knowledge on the topic. At the same time, it may also generate disorientation in the selection of the right approach. Thus, the need of synthesis and guidelines to drive the selection of the most suitable algorithm for a specific scope arises. To provide a response to such a necessity, the authors propose a ML algorithm selection tool. As a starting point, authors analysed several ML algorithms investigating their scope, their characteristics, and their typical fields of application, including also real examples. According to this exploration, authors identified two decision layers: the first one concerns the nature of the learning activity (supervised, unsupervised, etc.) while the second one is related to the characteristics of the ML algorithms (type of response, data size and type they can manage, etc.). Starting from a pool of algorithms, the first layer enables the users to narrow this pool depending on their scope. Then, the second layer guides the final selection, fitting the users’ constraints, the previously mentioned algorithms features, and the data characteristics.

(2018). How to select a suitable machine learning algorithm: A feature-based, scope-oriented selection framework . In ...SUMMER SCHOOL FRANCESCO TURCO. PROCEEDINGS. Retrieved from http://hdl.handle.net/10446/132120

How to select a suitable machine learning algorithm: A feature-based, scope-oriented selection framework

Sala, R.;Zambetti, M.;Pirola, F.;Pinto, R.
2018-01-01

Abstract

The increasing availability of data gatherable from various sources and in several contexts, is forcing practitioners to find affordable ways to manage and exploit datasets. Within this context, machine learning (ML) - which can be described as a set of algorithms to analyse and process data to extract relevant features for clusterization, classification or prediction - emerged as one of the most investigated area providing powerful tools. Indeed, in literature it is possible to find a considerable number of articles dealing with ML algorithms and describing their real-world applications. This considerable number of works, depicting a wide variety of algorithms and widespread applications, creates an extensive knowledge on the topic. At the same time, it may also generate disorientation in the selection of the right approach. Thus, the need of synthesis and guidelines to drive the selection of the most suitable algorithm for a specific scope arises. To provide a response to such a necessity, the authors propose a ML algorithm selection tool. As a starting point, authors analysed several ML algorithms investigating their scope, their characteristics, and their typical fields of application, including also real examples. According to this exploration, authors identified two decision layers: the first one concerns the nature of the learning activity (supervised, unsupervised, etc.) while the second one is related to the characteristics of the ML algorithms (type of response, data size and type they can manage, etc.). Starting from a pool of algorithms, the first layer enables the users to narrow this pool depending on their scope. Then, the second layer guides the final selection, fitting the users’ constraints, the previously mentioned algorithms features, and the data characteristics.
2018
Sala, Roberto; Zambetti, Michela Giuseppina; Pirola, Fabiana; Pinto, Roberto
File allegato/i alla scheda:
File Dimensione del file Formato  
How to select a suitable machine learning algorithm.pdf

accesso aperto

Versione: publisher's version - versione editoriale
Licenza: Licenza default Aisberg
Dimensione del file 351.54 kB
Formato Adobe PDF
351.54 kB Adobe PDF Visualizza/Apri
TOC 2018.pdf

accesso aperto

Versione: publisher's version - versione editoriale
Licenza: Licenza default Aisberg
Dimensione del file 117.14 kB
Formato Adobe PDF
117.14 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/132120
Citazioni
  • Scopus 10
  • ???jsp.display-item.citation.isi??? ND
social impact