vtorg
/

rl-course-unit-1-ppo-lunarlander-v2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

VikramTiwari commited on May 5, 2022

Commit

3889df0

•

1 Parent(s): 062f46b

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -24,5 +24,21 @@ model-index:
   This is a trained model of a **PPO** agent playing **LunarLander-v2** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
   ## Usage (with Stable-baselines3)
-  TODO: Add your code

   This is a trained model of a **PPO** agent playing **LunarLander-v2** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
   ## Usage (with Stable-baselines3)
+  ```
+  model = PPO(
+    policy = 'MlpPolicy',
+    env = env,
+    n_steps = 2048,
+    batch_size = 512,
+    n_epochs = 4,
+    gamma = 0.099,
+    gae_lambda = 0.98,
+    ent_coef = 0.01,
+    learning_rate=0.00001,
+    verbose=1,
+    tensorboard_log="./ppo_tensorboard/")
+  model.learn(total_timesteps=int(10e6))
+  ```