Skip to content

RL Insights Policy-Based Methods
Type to start searching
    GitHub
    GitHub
    • Home
      • CURL: Contrastive Unsupervised Representations for Reinforcement Learning
      • Asynchronous Methods for Deep Reinforcement Learning
      • Test notebook 1
      • Test notebook 2
      • Notebook Pitfalls
      • Running Long Tasks in Notebooks

    Policy-Based Methods¶

    • gradient free methods: easy to scale, but don't work so well with too many parameters

    • policy gradient methods

    Different ways to do policy optimization: https://youtu.be/KHZVXao4qXs?t=1532 - gradient free, eg evolution methods - gradient based, eg using gradient descent, see policy gradient methods


    Last update: April 9, 2020
    Copyright © 2020 Florian Laurent
    powered by MkDocs and Material for MkDocs