PPO Agent Playing gymnasium_env/UnitreeGO2-v0

This is a trained model of a PPO agent playing gymnasium_env/UnitreeGO2-v0.

Hyperparameters

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results

  • mean_reward on gymnasium_env/UnitreeGO2-v0
    self-reported
    -0.11 +/- 0.12