Skip to Main content Skip to Navigation
New interface
Reports (Technical report)

Formalisation of metamorph Reinforcement Learning

Iago Bonnici 1 Abdelkader Gouaich 1 Fabien Michel 1 
1 SMILE - Système Multi-agent, Interaction, Langage, Evolution
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier
Abstract : This technical report describes the formalisation of a particular Reinforcement Learning (RL) situation that we call "metamorph" (mRL). In this situation, the signature of the learner agent, i.e. its set of inputs, outputs and feedback slots, can change over the course of learning. RL can be viewed as signal processing, because the learner agent transforms the inputs/feedbacks signals it is continuously fed with into output signals. The following formalisation is therefore concerned with signals description and the transformation from one signal to another. Also, since the signature of the agent is expected to change, we get concerned in the definition of what is a "signature" and a "signature change". In the first part, we describe mRL learning context, or how the metamorph agent is embedded into its environment and interacts with it. In the second part, we describe one generic example of a metamorph learner agent: a dynamical computational graph that could theoretically be used in controlling the agent. In the last part, we reformulate the classical problem of RL, a.k.a. "maximizing feedback" in terms of this formalised mRL. 1
Document type :
Reports (Technical report)
Complete list of metadata
Contributor : Iago Bonnici Connect in order to contact the contributor
Submitted on : Wednesday, December 5, 2018 - 1:27:18 PM
Last modification on : Wednesday, October 26, 2022 - 8:13:55 AM
Long-term archiving on: : Wednesday, March 6, 2019 - 12:34:03 PM


  • HAL Id : hal-01924642, version 1



Iago Bonnici, Abdelkader Gouaich, Fabien Michel. Formalisation of metamorph Reinforcement Learning. [Technical Report] LIRMM (UM, CNRS). 2018. ⟨hal-01924642⟩



Record views


Files downloads