Formalisation of metamorph Reinforcement Learning

Iago Bonnici 1 Abdelkader Gouaich 1 Fabien Michel 1
1 SMILE - Système Multi-agent, Interaction, Langage, Evolution
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : This technical report describes the formalisation of a particular Reinforcement Learning (RL) situation that we call "metamorph" (mRL). In this situation, the signature of the learner agent, i.e. its set of inputs, outputs and feedback slots, can change over the course of learning. RL can be viewed as signal processing, because the learner agent transforms the inputs/feedbacks signals it is continuously fed with into output signals. The following formalisation is therefore concerned with signals description and the transformation from one signal to another. Also, since the signature of the agent is expected to change, we get concerned in the definition of what is a "signature" and a "signature change". In the first part, we describe mRL learning context, or how the metamorph agent is embedded into its environment and interacts with it. In the second part, we describe one generic example of a metamorph learner agent: a dynamical computational graph that could theoretically be used in controlling the agent. In the last part, we reformulate the classical problem of RL, a.k.a. "maximizing feedback" in terms of this formalised mRL. 1
Document type :
Reports
Complete list of metadatas

https://hal-lara.archives-ouvertes.fr/hal-01924642
Contributor : Iago Bonnici <>
Submitted on : Wednesday, December 5, 2018 - 1:27:18 PM
Last modification on : Thursday, February 7, 2019 - 4:55:42 PM
Long-term archiving on : Wednesday, March 6, 2019 - 12:34:03 PM

Identifiers

  • HAL Id : hal-01924642, version 1

Collections

Citation

Iago Bonnici, Abdelkader Gouaich, Fabien Michel. Formalisation of metamorph Reinforcement Learning. [Technical Report] LIRMM (UM, CNRS). 2018. ⟨hal-01924642⟩

Share

Metrics

Record views

160

Files downloads

32