Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
sErial03
/
UnitreeGO2-v0-cleanrl_ppo-seed1
like
0
Reinforcement Learning
TensorBoard
gymnasium_env/UnitreeGO2-v0
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Model card
Files
Files and versions
Metrics
Training metrics
Community
PPO Agent Playing gymnasium_env/UnitreeGO2-v0
Hyperparameters
PPO Agent Playing gymnasium_env/UnitreeGO2-v0
This is a trained model of a PPO agent playing gymnasium_env/UnitreeGO2-v0.
Hyperparameters
Downloads last month
-
Downloads are not tracked for this model.
How to track
Video Preview
Reinforcement Learning
loading
Evaluation results
mean_reward
on gymnasium_env/UnitreeGO2-v0
self-reported
-0.11 +/- 0.12
View on Papers With Code