Artificial effect

Real-effort tasks, in which participants perform cognitively costly activities whose outcomes depend on actual performance, are widely used in experimental economics. Their validity, however, rests on the assumption that a human performs them. We study whether this assumption still holds in the era of Artificial Intelligence (AI) and Large Language Models (LLMs). Using 8 canonical real-effort tasks and 23 LLMs from three major providers, we show that most tasks can now be solved accurately and at a negligible cost, while only a few resist automation. Performance improves with each model generation, and midtier models are rapidly closing the gap with frontier ones, broadening the set of widely accessible models that can automate these tasks. Additionally, we show that verbally offering monetary incentives has no effect on LLM performance. Our f indings establish a boundary condition for the use of real-effort tasks in unsupervised settings: when participants can cheaply outsource task completion to an LLM, observed performance may no longer reflect genuine human effort.

Belotti, Federico, Coniglio, Stefano, Cosma, Antonio, Fallucchi, Francesco, (2026). Artificial effect (WORKING PAPERS OF DEPARTMENT OF ECONOMICS 41). Bergamo: Retrieved from https://hdl.handle.net/10446/326826 Retrieved from http://dx.doi.org/10.13122/WPEconomics_41

Artificial effect

Belotti, Federico;Coniglio, Stefano;Cosma, Antonio;Fallucchi, Francesco

2026-04-24

Abstract

Real-effort tasks, in which participants perform cognitively costly activities whose outcomes depend on actual performance, are widely used in experimental economics. Their validity, however, rests on the assumption that a human performs them. We study whether this assumption still holds in the era of Artificial Intelligence (AI) and Large Language Models (LLMs). Using 8 canonical real-effort tasks and 23 LLMs from three major providers, we show that most tasks can now be solved accurately and at a negligible cost, while only a few resist automation. Performance improves with each model generation, and midtier models are rapidly closing the gap with frontier ones, broadening the set of widely accessible models that can automate these tasks. Additionally, we show that verbally offering monetary incentives has no effect on LLM performance. Our f indings establish a boundary condition for the use of real-effort tasks in unsupervised settings: when participants can cheaply outsource task completion to an LLM, observed performance may no longer reflect genuine human effort.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				24-apr-2026
			
	Tutti gli autori
	
						Belotti, Federico; Coniglio, Stefano; Cosma, Antonio; Fallucchi, Francesco Pio
					
	Nelle collezioni:
	
				Working Papers of Department of Economics

File allegato/i alla scheda:

File	Dimensione del file	Formato
WPEconomics_41.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Creative commons Dimensione del file 1.13 MB Formato Adobe PDF Visualizza/Apri	1.13 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/326826

Citazioni

ND

ND

social impact