Reinforcement Learning in Robotic and Simulated Environments

  • Environment: Hopper-v5
  • Algorithm: PPO
  • Steps: 1M
  • Average reward: 2200
  • Network layer size: 256

Both .pth (for neural network) and .npz (for observation preprocessing) files need to be loaded in order to work correctly.

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Collection including ItsTSV/ppo_hopper