How to select a suitable machine learning algorithm: A feature-based, scope-oriented selection framework

The increasing availability of data gatherable from various sources and in several contexts, is forcing practitioners to find affordable ways to manage and exploit datasets. Within this context, machine learning (ML) - which can be described as a set of algorithms to analyse and process data to extract relevant features for clusterization, classification or prediction - emerged as one of the most investigated area providing powerful tools. Indeed, in literature it is possible to find a considerable number of articles dealing with ML algorithms and describing their real-world applications. This considerable number of works, depicting a wide variety of algorithms and widespread applications, creates an extensive knowledge on the topic. At the same time, it may also generate disorientation in the selection of the right approach. Thus, the need of synthesis and guidelines to drive the selection of the most suitable algorithm for a specific scope arises. To provide a response to such a necessity, the authors propose a ML algorithm selection tool. As a starting point, authors analysed several ML algorithms investigating their scope, their characteristics, and their typical fields of application, including also real examples. According to this exploration, authors identified two decision layers: the first one concerns the nature of the learning activity (supervised, unsupervised, etc.) while the second one is related to the characteristics of the ML algorithms (type of response, data size and type they can manage, etc.). Starting from a pool of algorithms, the first layer enables the users to narrow this pool depending on their scope. Then, the second layer guides the final selection, fitting the users’ constraints, the previously mentioned algorithms features, and the data characteristics.

(2018). How to select a suitable machine learning algorithm: A feature-based, scope-oriented selection framework . In ...SUMMER SCHOOL FRANCESCO TURCO. PROCEEDINGS. Retrieved from http://hdl.handle.net/10446/132120

How to select a suitable machine learning algorithm: A feature-based, scope-oriented selection framework

Sala, R.;Zambetti, M.;Pirola, F.;Pinto, R.

2018-01-01

Abstract

The increasing availability of data gatherable from various sources and in several contexts, is forcing practitioners to find affordable ways to manage and exploit datasets. Within this context, machine learning (ML) - which can be described as a set of algorithms to analyse and process data to extract relevant features for clusterization, classification or prediction - emerged as one of the most investigated area providing powerful tools. Indeed, in literature it is possible to find a considerable number of articles dealing with ML algorithms and describing their real-world applications. This considerable number of works, depicting a wide variety of algorithms and widespread applications, creates an extensive knowledge on the topic. At the same time, it may also generate disorientation in the selection of the right approach. Thus, the need of synthesis and guidelines to drive the selection of the most suitable algorithm for a specific scope arises. To provide a response to such a necessity, the authors propose a ML algorithm selection tool. As a starting point, authors analysed several ML algorithms investigating their scope, their characteristics, and their typical fields of application, including also real examples. According to this exploration, authors identified two decision layers: the first one concerns the nature of the learning activity (supervised, unsupervised, etc.) while the second one is related to the characteristics of the ML algorithms (type of response, data size and type they can manage, etc.). Starting from a pool of algorithms, the first layer enables the users to narrow this pool depending on their scope. Then, the second layer guides the final selection, fitting the users’ constraints, the previously mentioned algorithms features, and the data characteristics.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2018
			
	Tutti gli autori
	
						Sala, Roberto; Zambetti, Michela Giuseppina; Pirola, Fabiana; Pinto, Roberto
					
	Nelle collezioni:
	
				1.4.01 Contributi in atti di convegno - Conference presentations

File allegato/i alla scheda:

File	Dimensione del file	Formato
How to select a suitable machine learning algorithm.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Licenza default Aisberg Dimensione del file 351.54 kB Formato Adobe PDF Visualizza/Apri	351.54 kB	Adobe PDF	Visualizza/Apri
TOC 2018.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Licenza default Aisberg Dimensione del file 117.14 kB Formato Adobe PDF Visualizza/Apri	117.14 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/132120

Citazioni

10

ND

social impact