DeepRLCourse2022 / bguan_ppo_lunarlander2 /_stable_baselines3_version
bguan's picture
bguan's lunar lander model #2 using PPO trained for 500K timesteps
5498d2e
1.5.0