ItsTSV
/

ppo_hopper

Reinforcement Learning

Model card Files Files and versions

Reinforcement Learning in Robotic and Simulated Environments

Environment: Hopper-v5
Algorithm: PPO
Steps: 1M
Average reward: 2200
Network layer size: 256

Both .pth (for neural network) and .npz (for observation preprocessing) files need to be loaded in order to work correctly.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

loading

Collection including ItsTSV/ppo_hopper

Reinforcement Learning in Robotic and Simulated Environments

All trained models that are result of training using my implementations of various Deep Reinforcement Learning algorithms. • 2 items • Updated 25 days ago