Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics

Théo Gigant; Camille Guinaudeau; Marc Decombas; Frédéric Dufaux

Communication Dans Un Congrès Année : 2024

Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics

(1, 2) , (3, 4) , (2) , (1)

1
2
3
4

Théo Gigant

Fonction : Auteur
PersonId : 1261551
IdHAL : gigant
ORCID : 0009-0003-6392-8519

Laboratoire des signaux et systèmes

JustAI

Camille Guinaudeau

Fonction : Auteur
PersonId : 20609
IdHAL : camille-guinaudeau
ORCID : 0000-0001-7249-8715
IdRef : 173844340

Sciences et Technologies des Langues - LISN

Laboratoire Interdisciplinaire des Sciences du Numérique

Marc Decombas

Fonction : Auteur
PersonId : 931123

JustAI

Frédéric Dufaux

Fonction : Auteur
PersonId : 11239
IdHAL : fdufaux
ORCID : 0000-0001-6388-4112
IdRef : 169586170

Laboratoire des signaux et systèmes

Résumé

Automatic metrics are used as proxies to evaluate abstractive summarization systems when human annotations are too expensive. To be useful, these metrics should be fine-grained, show a high correlation with human annotations, and ideally be independent of reference quality; however, most standard evaluation metrics for summarization are reference-based, and existing reference-free metrics correlate poorly with relevance, especially on summaries of longer documents. In this paper, we introduce a reference-free metric that correlates well with human evaluated relevance, while being very cheap to compute. We show that this metric can also be used alongside reference-based metrics to improve their robustness in low quality reference settings.

Mots clés

Evaluation metric Evaluation of summarization

Domaines

Intelligence artificielle [cs.AI] Multimédia [cs.MM] Traitement du texte et du document

Fichier principal

acl_latex.pdf (996.95 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Théo Gigant : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04720645

Soumis le : mardi 8 octobre 2024-03:48:18

Dernière modification le : mardi 8 octobre 2024-18:40:38

Dates et versions

hal-04720645 , version 1 (08-10-2024)

Identifiants

HAL Id : hal-04720645 , version 1
ARXIV : 2410.10867

Citer

Théo Gigant, Camille Guinaudeau, Marc Decombas, Frédéric Dufaux. Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics. The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Nov 2024, Miami (FL), United States. ⟨hal-04720645⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA SUP_LSS SUP_TELECOMS CENTRALESUPELEC UNIV-PARIS-SACLAY LISN GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT HUB-IA

28 Consultations

0 Téléchargements

Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager