VikramTiwari commited on
Commit
3889df0
1 Parent(s): 062f46b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -1
README.md CHANGED
@@ -24,5 +24,21 @@ model-index:
24
  This is a trained model of a **PPO** agent playing **LunarLander-v2** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
25
 
26
  ## Usage (with Stable-baselines3)
27
- TODO: Add your code
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
 
 
24
  This is a trained model of a **PPO** agent playing **LunarLander-v2** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
25
 
26
  ## Usage (with Stable-baselines3)
27
+
28
+ ```
29
+ model = PPO(
30
+ policy = 'MlpPolicy',
31
+ env = env,
32
+ n_steps = 2048,
33
+ batch_size = 512,
34
+ n_epochs = 4,
35
+ gamma = 0.099,
36
+ gae_lambda = 0.98,
37
+ ent_coef = 0.01,
38
+ learning_rate=0.00001,
39
+ verbose=1,
40
+ tensorboard_log="./ppo_tensorboard/")
41
+
42
+ model.learn(total_timesteps=int(10e6))
43
+ ```
44