Backward explanations via redefinition of predicates

History eXplanation based on Predicates (HXP), studies the behavior of a Reinforcement Learning (RL) agent in a sequence of agent's interactions with the environment (a history), through the prism of an arbitrary predicate. To this end, an action importance score is computed for each action in the history. The explanation consists in displaying the most important actions to the user. As the calculation of an action's importance is #W[1]-hard, it is necessary for long histories to approximate the scores, at the expense of their quality. We therefore propose a new HXP method, called Backward-HXP, to provide explanations for these histories without having to approximate scores. Experiments show the ability of B-HXP to summarise long histories.

Mots clés

Machine Learning Reinforcement Learning Explainability Interpretability History Explanation Importance score

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

BHXP_ECAI.pdf (3.86 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Florence Dupin de Saint-Cyr : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04669413

Soumis le : jeudi 8 août 2024-16:41:56

Dernière modification le : mardi 17 septembre 2024-12:26:45

Dates et versions

hal-04669413 , version 1 (08-08-2024)

hal-04669413 , version 2 (08-08-2024)

Identifiants

HAL Id : hal-04669413 , version 2

Citer

Léo Saulières, Martin Cooper, Florence Dupin de Saint-Cyr. Backward explanations via redefinition of predicates. 27th European Conference on Artifical Intelligence (ECAI 2024), European Association for Artificial Intelligence (EurAI); Spanish Artificial Intelligence Society (AEPIA), Oct 2024, Saint Jacques De Compostelle, Spain. à paraître. ⟨hal-04669413v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST UNIV-TLSE2 CNRS LAB-STICC_UBO ENIB UT1-CAPITOLE LAB-STICC IRIT IRIT-ADRIA IRIT-IA LAB-STICC_COMMEDIA LAB-STICC_INTERACTION TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

416 Consultations

64 Téléchargements