Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance

We propose a reinforcement learning (RL) approach to address a multiperiod optimization problem in which a portfolio manager seeks an optimal constant proportion portfolio strategy by minimizing a tail risk measure consistent with second order stochastic dominance (SSD) principles. As a risk measure, we consider in particular the Interval Conditional Value -at -Risk (ICVaR) shown to be mathematically related to SSD principles. By including the ICVaR in the reward function of an RL method we show that an optimal fixed -mix policy can be derived as solution of short- to medium -term allocation problems through an accurate specification of the learning parameters under general statistical assumptions. The financial optimization problem, thus, carries several novel features and the article details the required steps to accommodate those features within a reinforcement learning architecture. The methodology is tested in- and out -of -sample on market data showing good performance relative to the SP500, adopted as benchmark policy.

(2024). Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance [journal article - articolo]. In ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE. Retrieved from https://hdl.handle.net/10446/279469

Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance

Consigli, Giorgio;Gomez, Alvaro A.;Zubelli, Jorge P.

2024-01-01

Abstract

We propose a reinforcement learning (RL) approach to address a multiperiod optimization problem in which a portfolio manager seeks an optimal constant proportion portfolio strategy by minimizing a tail risk measure consistent with second order stochastic dominance (SSD) principles. As a risk measure, we consider in particular the Interval Conditional Value -at -Risk (ICVaR) shown to be mathematically related to SSD principles. By including the ICVaR in the reward function of an RL method we show that an optimal fixed -mix policy can be derived as solution of short- to medium -term allocation problems through an accurate specification of the learning parameters under general statistical assumptions. The financial optimization problem, thus, carries several novel features and the article details the required steps to accommodate those features within a reinforcement learning architecture. The methodology is tested in- and out -of -sample on market data showing good performance relative to the SP500, adopted as benchmark policy.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di articolo
	
				articolo
			
	Data di pubblicazione
	
				2024
			
	Rivista in ANCE
	
				ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
			
	Tutti gli autori
	
						Consigli, Giorgio; Gomez, Alvaro A.; Zubelli, Jorge P.
					
	Citazione
	
				(2024). Optimal dynamic fixed-mix portfolios based on reinforcement learning with second order stochastic dominance  [journal article - articolo]. In ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE. Retrieved from https://hdl.handle.net/10446/279469
			
	Nelle collezioni:
	
				1.1.01 Articoli/Saggi in rivista - Journal Articles/Essays

File allegato/i alla scheda:

File	Dimensione del file	Formato
EAAI_2024_OptimalFixMixRLPflios-main.pdf accesso aperto Versione: publisher's version - versione editoriale Licenza: Creative commons Dimensione del file 2.63 MB Formato Adobe PDF Visualizza/Apri	2.63 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/279469

Citazioni

4

4

social impact