Implementing Deep Reinforcement Learning Algorithms in Python for use in the MuJoCo Physics Simulator
-
Updated
Dec 18, 2021 - TeX
Implementing Deep Reinforcement Learning Algorithms in Python for use in the MuJoCo Physics Simulator
Contains Expert Trajectories for various Gym Environments used for State Only Imitation Learning
Training an agent in the gym Pendulum-v1 environment using Actor-Critic algorithm.
This project investigates to what extend evolutionary methods such as the Cross Entropy Method and Evolution Strategies can be used to optimize a neural policy compared to the baseline REINFORCE.
Add a description, image, and links to the pendulum-v1 topic page so that developers can more easily learn about it.
To associate your repository with the pendulum-v1 topic, visit your repo's landing page and select "manage topics."