Measuring text readability with machine comprehension: a pilot study

Marc Benzahra 1 François Yvon 1
1 TLP - Traitement du Langage Parlé
LIMSI - Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur : 247329
Abstract : This article studies the relationship between text readability indice and automatic machine understanding systems. Our hypothesis is that the simpler a text is, the better it should be understood by a machine. We thus expect to a strong correlation between readability levels on the one hand, and performance of automatic reading systems on the other hand. We test this hypothesis with several understanding systems based on language models of varying strengths, measuring this correlation on two corpora of journalistic texts. Our results suggest that this correlation is rather small that existing comprehension systems are far to reproduce the gradual improvement of their performance on texts of decreasing complexity.
Type de document :
Communication dans un congrès
Liste complète des métadonnées

Littérature citée [46 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-02267546
Contributeur : Limsi Publications <>
Soumis le : lundi 19 août 2019 - 14:18:36
Dernière modification le : jeudi 2 janvier 2020 - 14:58:04
Archivage à long terme le : jeudi 9 janvier 2020 - 14:55:29

Fichier

document(5).pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-02267546, version 1

Citation

Marc Benzahra, François Yvon. Measuring text readability with machine comprehension: a pilot study. Workshop on Building Educational Applications Using NLP, Aug 2019, Florence, Italy. pp.412 - 422. ⟨hal-02267546⟩

Partager

Métriques

Consultations de la notice

118

Téléchargements de fichiers

41