This study investigates the effectiveness of Large Language Models (LLMs) in simplifying Italian administrative texts compared to human informants. This research evaluates the performance of several well-known LLMs, including GPT-3.5-Turbo, GPT-4, LLaMA 3, and Phi 3, in simplifying a corpus of Italian administrative documents (s-ItaIst), a representative corpus of Italian administrative texts. To accurately compare the simplification abilities of humans and LLMs, six parallel corpora of a subsection of ItaIst are collected. These parallel corpora were analyzed using both complexity and similarity metrics to assess the outcomes of LLMs and human participants. Our findings indicate that while LLMs perform comparably to humans in many aspects, there are notable differences in structural and semantic changes. The results of our study underscore the potential and limitations of using AI for administrative text simplification, highlighting areas where LLMs need improvement to achieve human-level proficiency.

(2024). AI vs. Human: Effectiveness of LLMs in Simplifying Italian Administrative Documents . Retrieved from https://hdl.handle.net/10446/298077

AI vs. Human: Effectiveness of LLMs in Simplifying Italian Administrative Documents

Ganfi Vittorio;
2024-01-01

Abstract

This study investigates the effectiveness of Large Language Models (LLMs) in simplifying Italian administrative texts compared to human informants. This research evaluates the performance of several well-known LLMs, including GPT-3.5-Turbo, GPT-4, LLaMA 3, and Phi 3, in simplifying a corpus of Italian administrative documents (s-ItaIst), a representative corpus of Italian administrative texts. To accurately compare the simplification abilities of humans and LLMs, six parallel corpora of a subsection of ItaIst are collected. These parallel corpora were analyzed using both complexity and similarity metrics to assess the outcomes of LLMs and human participants. Our findings indicate that while LLMs perform comparably to humans in many aspects, there are notable differences in structural and semantic changes. The results of our study underscore the potential and limitations of using AI for administrative text simplification, highlighting areas where LLMs need improvement to achieve human-level proficiency.
2024
Italiano
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024)
Dell'Orletta, F.; Lenci, A.; Montemagni, S.; Sprugnoli, R:,
3878
1
12
online
Italy
AIxIA
comitato scientifico
CLiC-it 2024: Italian Conference on Computational Linguistics, Pisa, Italy, December 4-6, 2024.
Pisa (Italy)
4-6 dicembre 2024
internazionale
contributo
Settore GLOT-01/A - Glottologia e linguistica
Automatic Text Simplification; Large Language Models; Italian Administrative language
info:eu-repo/semantics/conferenceObject
4
Russodivito, Marco; Ganfi, Vittorio; Fiorentino, Giuliana; Oliveto, Rocco
1.4 Contributi in atti di convegno - Contributions in conference proceedings::1.4.01 Contributi in atti di convegno - Conference presentations
open
Non definito
273
(2024). AI vs. Human: Effectiveness of LLMs in Simplifying Italian Administrative Documents . Retrieved from https://hdl.handle.net/10446/298077
File allegato/i alla scheda:
File Dimensione del file Formato  
91_main_long.pdf

accesso aperto

Versione: publisher's version - versione editoriale
Licenza: Creative commons
Dimensione del file 1.17 MB
Formato Adobe PDF
1.17 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/298077
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact