Reinforcement learning with function approximation for 3-spheres swimmer
Abstract
We study the swimming strategies that maximize the speed of the three-sphere swimmer using reinforcement learning methods. First of all, we ensure that for a simple model with few actions, the Q-learning method converges. However, this latter method does not fit a more complex framework (for instance the presence of boundary) where states or actions have to be continuous to obtain all directions in the swimmer's reachable set. To overcome this issue, we investigate another method from reinforcement learning which uses function approximation, and benchmark its results in absence of walls.
Origin | Files produced by the author(s) |
---|