
step 6220000 . Checkpoint from initial model taken and trained further at a lower learning rate 2nd
c002a26
library_name: stable-baselines3 | |
tags: | |
- Pixelcopter-PLE-v0 | |
- deep-reinforcement-learning | |
- reinforcement-learning | |
- stable-baselines3 | |
model-index: | |
- name: PPO | |
results: | |
- task: | |
type: reinforcement-learning | |
name: reinforcement-learning | |
dataset: | |
name: Pixelcopter-PLE-v0 | |
type: Pixelcopter-PLE-v0 | |
metrics: | |
- type: mean_reward | |
value: 440.40 +/- 169.58 | |
name: mean_reward | |
verified: false | |
# **PPO** Agent playing **Pixelcopter-PLE-v0** | |
This is a trained model of a **PPO** agent playing **Pixelcopter-PLE-v0** | |
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). | |
## Usage (with Stable-baselines3) | |
TODO: Add your code | |
```python | |
from stable_baselines3 import ... | |
from huggingface_sb3 import load_from_hub | |
... | |
``` | |