Initial Push
Browse files- README.md +37 -0
- config.json +1 -0
- replay.mp4 +0 -0
- results.json +1 -0
- sac-PandaSlide-v3.zip +3 -0
- sac-PandaSlide-v3/_stable_baselines3_version +1 -0
- sac-PandaSlide-v3/actor.optimizer.pth +3 -0
- sac-PandaSlide-v3/critic.optimizer.pth +3 -0
- sac-PandaSlide-v3/data +126 -0
- sac-PandaSlide-v3/ent_coef_optimizer.pth +3 -0
- sac-PandaSlide-v3/policy.pth +3 -0
- sac-PandaSlide-v3/pytorch_variables.pth +3 -0
- sac-PandaSlide-v3/system_info.txt +9 -0
- vec_normalize.pkl +3 -0
README.md
ADDED
@@ -0,0 +1,37 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
library_name: stable-baselines3
|
3 |
+
tags:
|
4 |
+
- PandaSlide-v3
|
5 |
+
- deep-reinforcement-learning
|
6 |
+
- reinforcement-learning
|
7 |
+
- stable-baselines3
|
8 |
+
model-index:
|
9 |
+
- name: SAC
|
10 |
+
results:
|
11 |
+
- task:
|
12 |
+
type: reinforcement-learning
|
13 |
+
name: reinforcement-learning
|
14 |
+
dataset:
|
15 |
+
name: PandaSlide-v3
|
16 |
+
type: PandaSlide-v3
|
17 |
+
metrics:
|
18 |
+
- type: mean_reward
|
19 |
+
value: -22.40 +/- 9.56
|
20 |
+
name: mean_reward
|
21 |
+
verified: false
|
22 |
+
---
|
23 |
+
|
24 |
+
# **SAC** Agent playing **PandaSlide-v3**
|
25 |
+
This is a trained model of a **SAC** agent playing **PandaSlide-v3**
|
26 |
+
using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
|
27 |
+
|
28 |
+
## Usage (with Stable-baselines3)
|
29 |
+
TODO: Add your code
|
30 |
+
|
31 |
+
|
32 |
+
```python
|
33 |
+
from stable_baselines3 import ...
|
34 |
+
from huggingface_sb3 import load_from_hub
|
35 |
+
|
36 |
+
...
|
37 |
+
```
|
config.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"policy_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVNwAAAAAAAACMHnN0YWJsZV9iYXNlbGluZXMzLnNhYy5wb2xpY2llc5SMEE11bHRpSW5wdXRQb2xpY3mUk5Qu", "__module__": "stable_baselines3.sac.policies", "__doc__": "\n Policy class (with both actor and critic) for SAC.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param use_expln: Use ``expln()`` function instead of ``exp()`` when using gSDE to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param clip_mean: Clip the mean output when using gSDE to avoid numerical instability.\n :param features_extractor_class: Features extractor to use.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n :param n_critics: Number of critic networks to create.\n :param share_features_extractor: Whether to share or not the features extractor\n between the actor and the critic (this saves computation time)\n ", "__init__": "<function MultiInputPolicy.__init__ at 0x7ff631973d00>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7ff631986c00>"}, "verbose": 1, "policy_kwargs": {"net_arch": {"pi": [512, 512, 512], "qf": [512, 512, 512]}, "use_sde": false}, "num_timesteps": 3004656, "_total_timesteps": 5000000, "_num_timesteps_at_start": 0, "seed": null, "action_noise": null, "start_time": 1718199802391038128, "learning_rate": 0.0003, "tensorboard_log": "./sacPandaSlide-v3/", "_last_obs": {":type:": "<class 'collections.OrderedDict'>", ":serialized:": "gAWV+wYAAAAAAACMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwSbnVtcHkuY29yZS5udW1lcmljlIwLX2Zyb21idWZmZXKUk5QolsAAAAAAAAAAczcAP4Dupz5bOzk+ao7NPDPEFD8y7Dw+6eV+Pg1jzT46so8+G87qvpvo1D4YQjQ+K0nyPuRC/j5VCjM+sbdKPqSN9L6wyDM+gGKYPRagzr6di8s+5TO/vth/N78O7Tw+3McevrHMOD7FHDU+iahwvwcW4r6WVDc+eL21PwNbHz6w2Dg+GdPlPjO7y77j0jk+jw2qPyJLXL5lWzo+uXSSP6qJ8j7yhTo+y+MqP2M6Nr63LDQ+QuiLv+BI1b4b6Tw+lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwOGlIwBQ5R0lFKUjAxkZXNpcmVkX2dvYWyUaAcolsAAAAAAAAAAODpHPy98zD8a0yYyFfpavgbmUb8a0yYyilWLvpyO674a0yYy4rimvpQtub8a0yYy/51/v+Kj+L4a0yYyCXmnvnM45T4a0yYyaprnPWnJcb8a0yYy+DuuPkG9Hb8a0yYyWgaYvsbBqT4a0yYyob6av3Gy1z8a0yYyhD1zP+/vOT8a0yYyY0QnPykp074a0yYyjkYqPkwsST4a0yYyoSu+P2mijz8a0yYy3FTfv8A3br8a0yYy8QUCP3fXyr8a0yYylGgOSxBLA4aUaBJ0lFKUjAtvYnNlcnZhdGlvbpRoByiWgAQAAAAAAABWdHA/ZVQEP8ixHr8lfRY/0ZKfP6qD8j1zNwA/gO6nPls7OT4pMzo7cid/vdL7CD4/ruU+Q153Pu41QL7L3cy/Vr6Rv+WGk7+rOqs+uLqKP+M43L7ipOC+aQkKP4lADMBqjs08M8QUPzLsPD50OU48AJBEu9OfUD45pgq/cG7qvTXhIT5rm3Y8RmVdvFnrpD4juTA/xezAPWJ3nD+h/l4/k8o2QMzTJ8Dp5X4+DWPNPjqyjz6SJVE8REJSu1gYTz5tpQq/YBjqvTDW/78tWmM89RhbvFvspD4lHVq+jtViP6wvHr89Wls79Xkuv8BPXz4bzuq+m+jUPhhCND4Zcvw7ZTm5vEaTKb7/wEg+9KvJvwjbGT41pa47T0xGvzDMEMAdzl4/L96UP6Wr6L7lqEo/3G0YQN6V0D8rSfI+5EL+PlUKMz54tJ87WtcTPIdtdT7kwBQ/iJ2dv8sGNz6g2C6/JbS+vmU66D5xeAE/S8Ibv9pdHr+x2o4/myLXP4gttz2xt0o+pI30vrDIMz7bOe87bq2OvPNAJT1coLi+PnONP48QIT77GjG/W3z2Pdl0ib4nu+o+5YeBvZ+uqz+jej8+J3dau3ApND6AYpg9FqDOvp2Lyz6SJVE8REJSu1gYTz5tpQq/YBjqvTzNIT4tWmM89RhbvFvspD5u9qu9tLWWv+HQH78v+vk+JWoePDAdMj7lM7++2H83vw7tPD7QRVE84edSu8M5UD5RpQq/1BTqvTfPIT4ojWI8Ir9avFvspD7bqqG8g4A2P/rBIr8V8aw/vIncPt4lHz7cxx6+scw4PsUcNT7eB1W7OIv4vMNy0T6OVAtAUFofv+wLHz5wUXm/ZCQWP/oCiD/ECEC/oGVRv52TG7/kIjc+nHgkPRcsXz6JqHC/BxbivpZUNz7wgy48Sr7RPASNd70rZsG+tIUEPzgZxz3sXzo/wey0Pq7Xg73EOcI+2dgbvifRIb+9eAg/mkEyP5gePD54vbU/A1sfPrDYOD4J68q8muPYvTdbKMBErRJAMUZnPjchKT55WKi/nhOrv16/T8C0Da0+xgkqvy/CGr/lCmc+y+l5vwj1Yz4Z0+U+M7vLvuPSOT6RxYc8UN8+vqrmFb8LDB5Aw1OBPll2Gj53qwDAgr2bPdfVIMCkYRc/Ib7vvoM8HL/Brns9x+8DvZ54OD6PDao/IktcvmVbOj43oX0996joPQf9kr/bSA1AkTZpP7HaFj66tcs+/iAGQN3pG8A03wo/l9WCP/aIvb6ZvIE/cCShv0rB9z+5dJI/qonyPvKFOj5OzRw8o0lOPus3S77dWApAYr9/vs1EJj45vwHARYFHP75sHb9jMJY/icCQP0F+G79wuvy7GDnAP0/wPj7L4yo/Yzo2vrcsND73Nqc8BtzNvGb2Gz88WTQ+ekp9v8H6Hz5gQFE/N5mjPvcLYj9jf9S+QE07vPxnKz/fFfW/G+vTPql3er9C6Iu/4EjVvhvpPD6P3lA8qupPuxN1UD5DoQq/2avpvZjrIT7mykQ8X/hFvPIkpT6UaA5LEEsShpRoEnSUUpR1Lg==", "achieved_goal": "[[ 0.5008461 0.3279915 0.18089049]\n [ 0.02509232 0.58111876 0.18449476]\n [ 0.24892391 0.40114632 0.28065664]\n [-0.4586037 0.41583714 0.17603338]\n [ 0.47321448 0.49660408 0.1748441 ]\n [ 0.19796635 -0.47764313 0.17557025]\n [ 0.07440662 -0.4035651 0.39754954]\n [-0.3734428 -0.7167945 0.18449804]\n [-0.15505928 0.18046834 0.17686756]\n [-0.94007164 -0.4415743 0.1790336 ]\n [ 1.4198446 0.15562062 0.1805141 ]\n [ 0.44887617 -0.3979126 0.18146853]\n [ 1.3285388 -0.21513036 0.18198927]\n [ 1.1441871 0.47370654 0.18215159]\n [ 0.66753834 -0.1779571 0.17595182]\n [-1.0930254 -0.41657162 0.18448298]]", "desired_goal": "[[ 7.7823210e-01 1.5975398e+00 9.7104706e-09]\n [-2.1384461e-01 -8.1991613e-01 9.7104706e-09]\n [-2.7213699e-01 -4.6007240e-01 9.7104706e-09]\n [-3.2562929e-01 -1.4467034e+00 9.7104706e-09]\n [-9.9850458e-01 -4.8562533e-01 9.7104706e-09]\n [-3.2709530e-01 4.4769630e-01 9.7104706e-09]\n [ 1.1308749e-01 -9.4447953e-01 9.7104706e-09]\n [ 3.4030128e-01 -6.1616904e-01 9.7104706e-09]\n [-2.9692346e-01 3.3155650e-01 9.7104706e-09]\n [-1.2089425e+00 1.6851331e+00 9.7104706e-09]\n [ 9.5015740e-01 7.2631735e-01 9.7104706e-09]\n [ 6.5338725e-01 -4.1242340e-01 9.7104706e-09]\n [ 1.6628477e-01 1.9645804e-01 9.7104706e-09]\n [ 1.4857064e+00 1.1221439e+00 9.7104706e-09]\n [-1.7447772e+00 -9.3053818e-01 9.7104706e-09]\n [ 5.0790316e-01 -1.5847005e+00 9.7104706e-09]]", "observation": "[[ 9.39275146e-01 5.16912758e-01 -6.19900227e-01 5.87847054e-01\n 1.24666798e+00 1.18415192e-01 5.00846088e-01 3.27991486e-01\n 1.80890486e-01 2.84118415e-03 -6.22934774e-02 1.33773118e-01\n 4.48595017e-01 2.41570517e-01 -1.87705725e-01 -1.60051858e+00\n -1.13862109e+00 -1.15255415e+00]\n [ 3.34431976e-01 1.08382320e+00 -4.30121511e-01 -4.38757956e-01\n 5.39206088e-01 -2.19143891e+00 2.50923224e-02 5.81118762e-01\n 1.84494764e-01 1.25869401e-02 -2.99930573e-03 2.03734681e-01\n -5.41598856e-01 -1.14468455e-01 1.58085659e-01 1.50517030e-02\n -1.35129150e-02 3.22108060e-01]\n [ 6.90324962e-01 9.42016020e-02 1.22239327e+00 8.71072829e-01\n 2.85611415e+00 -2.62230206e+00 2.48923913e-01 4.01146322e-01\n 2.80656636e-01 1.27653051e-02 -3.20829544e-03 2.02241302e-01\n -5.41586697e-01 -1.14304304e-01 -1.99872398e+00 1.38764801e-02\n -1.33726494e-02 3.22115749e-01]\n [-2.13001803e-01 8.86071086e-01 -6.17914915e-01 3.34705343e-03\n -6.81548417e-01 2.18077660e-01 -4.58603710e-01 4.15837139e-01\n 1.76033378e-01 7.70403119e-03 -2.26103757e-02 -1.65600866e-01\n 1.96048722e-01 -1.57556009e+00 1.50249600e-01 5.32975281e-03\n -7.74601877e-01 -2.26246262e+00]\n [ 8.70332539e-01 1.16303051e+00 -4.54434544e-01 7.91639626e-01\n 2.38170528e+00 1.62957358e+00 4.73214477e-01 4.96604085e-01\n 1.74844101e-01 4.87380847e-03 9.02351178e-03 2.39675626e-01\n 5.81068277e-01 -1.23136997e+00 1.78736851e-01 -6.82992935e-01\n -3.72468144e-01 4.53570515e-01]\n [ 5.05744040e-01 -6.08433425e-01 -6.18619561e-01 1.11604893e+00\n 1.68074358e+00 8.94423127e-02 1.97966352e-01 -4.77643132e-01\n 1.75570250e-01 7.30059808e-03 -1.74166821e-02 4.03451435e-02\n -3.60598445e-01 1.10507941e+00 1.57289729e-01 -6.91817939e-01\n 1.20354377e-01 -2.68469602e-01]\n [ 4.58459109e-01 -6.32474795e-02 1.34126651e+00 1.86991259e-01\n -3.33351805e-03 1.75939322e-01 7.44066238e-02 -4.03565109e-01\n 3.97549540e-01 1.27653051e-02 -3.20829544e-03 2.02241302e-01\n -5.41586697e-01 -1.14304304e-01 1.58009470e-01 1.38764801e-02\n -1.33726494e-02 3.22115749e-01]\n [-8.39661211e-02 -1.17742014e+00 -6.24280989e-01 4.88236874e-01\n 9.66886152e-03 1.73939466e-01 -3.73442799e-01 -7.16794491e-01\n 1.84498042e-01 1.27729923e-02 -3.21816676e-03 2.03345343e-01\n -5.41585028e-01 -1.14297539e-01 1.58017024e-01 1.38275996e-02\n -1.33512337e-02 3.22115749e-01]\n [-1.97347905e-02 7.12898433e-01 -6.35772347e-01 1.35110724e+00\n 4.30738330e-01 1.55417889e-01 -1.55059278e-01 1.80468336e-01\n 1.76867560e-01 -3.25059099e-03 -3.03398222e-02 4.09078687e-01\n 2.17703581e+00 -6.22471809e-01 1.55318916e-01 -9.73898888e-01\n 5.86492777e-01 1.06259084e+00]\n [-7.50133753e-01 -8.17956924e-01 -6.07721150e-01 1.78844035e-01\n 4.01540846e-02 2.17941627e-01 -9.40071642e-01 -4.41574305e-01\n 1.79033607e-01 1.06515735e-02 2.56034322e-02 -6.04372174e-02\n -3.77732605e-01 5.17665148e-01 9.72160697e-02 7.28026152e-01\n 3.53368789e-01 -6.43762201e-02]\n [ 3.79346967e-01 -1.52194396e-01 -6.32097661e-01 5.33092320e-01\n 6.96313500e-01 1.83710456e-01 1.41984463e+00 1.55620620e-01\n 1.80514097e-01 -2.47702766e-02 -1.05902866e-01 -2.63056731e+00\n 2.29182529e+00 2.25853696e-01 1.65165767e-01 -1.31519997e+00\n -1.33653617e+00 -3.24605513e+00]\n [ 3.37995172e-01 -6.64211631e-01 -6.04525506e-01 2.25627497e-01\n -9.76223648e-01 2.22614408e-01 4.48876172e-01 -3.97912592e-01\n 1.81468531e-01 1.65736992e-02 -1.86398745e-01 -5.85550904e-01\n 2.46948504e+00 2.52592176e-01 1.50842085e-01 -2.01046538e+00\n 7.60450512e-02 -2.51305175e+00]\n [ 5.91333628e-01 -4.68247443e-01 -6.10298336e-01 6.14459552e-02\n -3.22110914e-02 1.80147618e-01 1.32853878e+00 -2.15130359e-01\n 1.81989267e-01 6.19213246e-02 1.13603525e-01 -1.14834678e+00\n 2.20757174e+00 9.10988867e-01 1.47318617e-01 3.97870839e-01\n 2.09576368e+00 -2.43614888e+00]\n [ 5.42468309e-01 1.02214324e+00 -3.70185554e-01 1.01356804e+00\n -1.25892448e+00 1.93558621e+00 1.14418709e+00 4.73706543e-01\n 1.82151586e-01 9.57043283e-03 2.01452777e-01 -1.98455498e-01\n 2.16167378e+00 -2.49753505e-01 1.62371829e-01 -2.02729630e+00\n 7.79316247e-01 -6.14940524e-01]\n [ 1.17335165e+00 1.13087571e+00 -6.07395232e-01 -7.71265477e-03\n 1.50174236e+00 1.86463580e-01 6.67538345e-01 -1.77957103e-01\n 1.75951824e-01 2.04119515e-02 -2.51293294e-02 6.09228492e-01\n 1.76121652e-01 -9.89417672e-01 1.56229988e-01 8.17388535e-01\n 3.19528311e-01 8.82995069e-01]\n [-4.15034384e-01 -1.14319921e-02 6.69555426e-01 -1.91472995e+00\n 4.13903087e-01 -9.78388369e-01 -1.09302545e+00 -4.16571617e-01\n 1.84482977e-01 1.27483746e-02 -3.17255640e-03 2.03571603e-01\n -5.41523159e-01 -1.14097305e-01 1.58125281e-01 1.20112654e-02\n -1.20831421e-02 3.22547495e-01]]"}, "_last_episode_starts": {":type:": "<class 'numpy.ndarray'>", ":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAEBAQEBAQEBAQEBAQEBAQGUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="}, "_last_original_obs": {":type:": "<class 'collections.OrderedDict'>", ":serialized:": "gAWV+wYAAAAAAACMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwSbnVtcHkuY29yZS5udW1lcmljlIwLX2Zyb21idWZmZXKUk5QolsAAAAAAAAAAzNoePpeDsj10bnE8BCiKPUbkDT6El3U8zaHePfTv0D2oS7I8qpOxvP8L1z0t02s896MZPsai+D2/c2o8eGbLPYyHnL1TSms8lcScPcxxe72PwvU8tA/Eu5P8/717mHU83FQMPVpTaj2tyWw8lBrivYOHjb2+SW889yKmPpOoVT07/3A8OgwVPji+dr1DGXI8joWdPnttvbwks3I8tiCMPgId7z0b43I8ZU8+Pgguf7wUu2s8NeoNvqAhg70IlHU8lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwOGlIwBQ5R0lFKUjAxkZXNpcmVkX2dvYWyUaAcolsAAAAAAAAAADA/yPpzSED6PwvU8UaDFPjgemL2PwvU89gPDPpy4LL2PwvU8pJ7APglUBb6PwvU8vXuiPmcPNr2PwvU81Y3APl8MHz2PwvU8yETUPnHhrr2PwvU863HePmHFZb2PwvU8xefBPh406TyPwvU88g6ZPnjTGD6PwvU8RcL5Pmhwgj2PwvU8oXfsPnVOG72PwvU8uKbWPttzhjyPwvU8U98IP6PFyj2PwvU8Sw+BPkBVrL2PwvU8kvPlPt7vEb6PwvU8lGgOSxBLA4aUaBJ0lFKUjAtvYnNlcnZhdGlvbpRoByiWgAQAAAAAAADUceQ9ose7PXMrmTvALwU+0v8BP8tz0LzM2h4+l4OyPXRucTwdHjW7cKioux8X87xRybY+0BmWPU+ygL0uakTAmm8JwAdhnL9oaqA8s6I5PjUSvzyZ6E++67FhPgULhr8EKIo9RuQNPoSXdTxhVVC4ELiYN4CqKTpDxpK2tMsNuPJZaDcpahI76FeMuUAN1ra07JY9cdjLPPQsPj4zSmM+t7CUP1xwnr/Nod499O/QPahLsjwAAAAAAAAAgAAAAAAAAAAAAAAAAKG3yL4AAAAAAAAAAAAAAAA4XIK96KEZPhqQnzsxEXS9WRGNvr2ymDyqk7G8/wvXPS3Tazwtvbi6NIjdurs/I76pKog+AhWavo3guLpKGYW8VPO5v+sJCcAN+s49O3RGPo5/qzzR5Ug+2gp4P7OcJD/3oxk+xqL4Pb9zajyRBRC7hqmLOkLohDylPc8+8ZNrvo7pdjuXkam/8W8vv94J3z2S5To9sXWwvTZLnTudV5o+kiQvP6C4HL14Zss9jIecvVNKazzodse6Dzuiuhazj70RpIU975OAPoMuCblRt6u/YaqCPr+C+r7qch09GWwarEMjSj4AAAAAAAAAgAAAAACVxJw9zHF7vY/C9TwAAAAAAAAAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA3XDS9wE80vioQiztWLsg9JxatOyjnZ7q0D8S7k/z/vXuYdTx4qA82kNRmtVzd+jk5kSQ1ihO7NUxBuDXE3sK4gGwrOHdahC7IWwy93Db7PVUeTDtWZME+nJI0Poa6FLzcVAw9WlNqPa3JbDyiJZK7rOQau+qWtz1i7Xo/t1XWvTk0ALrJWvC/eYiSP4ULHT/qzhS+pkb0vWVjwDvYPi27zbmQPIw0mDyUGuK9g4eNvb5JbzzyTRq6SHwkO4An6b1s+nE9p0YFPmYMNbwExq0/NywzP5Dwo74OXNg8mFBmvODIYzvp/OU9j4aRPrNIYTv3IqY+k6hVPTv/cDwlQiu8+pESvI8mnb/awoI/x3iPPSx/qjqbsyHA5JshwMQwPcCT2qQ8boPCvbqtyjvCZE08HFzKvmAjqTw6DBU+OL52vUMZcjwZAos6j7qCvM3Prr7I9Yo/4r+aPdXCqrpxSnbAwb0uPk5TFsAqM3A9/RWDvdYWuDs62ia9NjVAvM7+8zqOhZ0+e229vCSzcjz/RmA8D7gmPCnZFb/ovn0/WDlYPvi0/rrG3zo/nc2AQHU/EsCMxFE9U6cvPlxS7zw8UYk+opQCv/lDRz+2IIw+Ah3vPRvjcjyVOmm6+AySPHPUMb5mgnk/24TkvD3dTzqoVnjAp6LBP768Rr+mqhY+EUBBPg9wwTsEYoG94IYcP2aMmDtlTz4+CC5/vBS7azycjQs7GEv6uh+fND72fIQ+d404vleVqbmahMM/zaMiPwTp7T5MRcG9IisGPDeLBj68ky6/vJEtPta3Ar816g2+oCGDvQiUdTztNJ62L+5QNoMjFzrQIMA3IM4yOFmTsDd8YGi7PUchO1mGuzmUaA5LEEsShpRoEnSUUpR1Lg==", "achieved_goal": "[[ 0.15513152 0.08716505 0.01473581]\n [ 0.06745914 0.1385661 0.01498974]\n [ 0.10870705 0.10202017 0.02176459]\n [-0.02167686 0.10500335 0.01439361]\n [ 0.15003954 0.12140422 0.01430982]\n [ 0.09931654 -0.07643041 0.01436098]\n [ 0.07654683 -0.06138782 0.03 ]\n [-0.00598332 -0.12499347 0.01498997]\n [ 0.03426062 0.0572084 0.01445238]\n [-0.11040226 -0.06910612 0.01460498]\n [ 0.3244855 0.05216272 0.01470929]\n [ 0.14555445 -0.06024 0.01477653]\n [ 0.30765957 -0.02312349 0.01481322]\n [ 0.27368706 0.11675455 0.01482465]\n [ 0.18584974 -0.01557494 0.01438786]\n [-0.13858874 -0.06402898 0.01498891]]", "desired_goal": "[[ 0.47277105 0.14142841 0.03 ]\n [ 0.38598874 -0.07427639 0.03 ]\n [ 0.3808896 -0.04216824 0.03 ]\n [ 0.37621033 -0.13020338 0.03 ]\n [ 0.3173503 -0.04444828 0.03 ]\n [ 0.3760821 0.03883016 0.03 ]\n [ 0.41458726 -0.08539093 0.03 ]\n [ 0.43446288 -0.05609644 0.03 ]\n [ 0.3787214 0.02846723 0.03 ]\n [ 0.29894215 0.14924419 0.03 ]\n [ 0.48781028 0.06369096 0.03 ]\n [ 0.4618502 -0.03791662 0.03 ]\n [ 0.4192407 0.01641267 0.03 ]\n [ 0.53465766 0.09900977 0.03 ]\n [ 0.2520698 -0.08414698 0.03 ]\n [ 0.44912392 -0.14251658 0.03 ]]", "observation": "[[ 1.11545235e-01 9.16893631e-02 4.67436900e-03 1.30064964e-01\n 5.07809758e-01 -2.54458394e-02 1.55131519e-01 8.71650502e-02\n 1.47358067e-02 -2.76363571e-03 -5.14703244e-03 -2.96741109e-02\n 3.57004672e-01 7.32914209e-02 -6.28400967e-02 -3.06898069e+00\n -2.14743662e+00 -1.22171104e+00]\n [ 1.95819885e-02 1.81284711e-01 2.33241115e-02 -2.03035727e-01\n 2.20405266e-01 -1.04721129e+00 6.74591362e-02 1.38566107e-01\n 1.49897374e-02 -4.96705798e-05 1.82055228e-05 6.47224486e-04\n -4.37421977e-06 -3.38067330e-05 1.38492196e-05 2.23411084e-03\n -2.67683761e-04 -6.37923949e-06]\n [ 7.36936629e-02 2.48834807e-02 1.85718358e-01 2.21962735e-01\n 1.16164291e+00 -1.23780394e+00 1.08707048e-01 1.02020174e-01\n 2.17645913e-02 0.00000000e+00 -0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00 -3.92025977e-01 0.00000000e+00\n 0.00000000e+00 0.00000000e+00]\n [-6.36524558e-02 1.50031686e-01 4.86947317e-03 -5.95867075e-02\n -2.75522977e-01 1.86399166e-02 -2.16768570e-02 1.05003349e-01\n 1.43936099e-02 -1.40944647e-03 -1.69015536e-03 -1.59422800e-01\n 2.65950471e-01 -3.00941527e-01 -1.41050073e-03 -1.62474103e-02\n -1.45273829e+00 -2.14123034e+00]\n [ 1.01062872e-01 1.93802759e-01 2.09348463e-02 1.96189180e-01\n 9.68915582e-01 6.43016040e-01 1.50039539e-01 1.21404216e-01\n 1.43098226e-02 -2.19759741e-03 1.06553803e-03 1.62240304e-02\n 4.04767185e-01 -2.30056539e-01 3.76758305e-03 -1.32475555e+00\n -6.85301840e-01 1.08905539e-01]\n [ 4.56290916e-02 -8.61619785e-02 4.80022561e-03 3.01449686e-01\n 6.84151769e-01 -3.82620096e-02 9.93165374e-02 -7.64304101e-02\n 1.43609820e-02 -1.52179319e-03 -1.23772200e-03 -7.01657981e-02\n 6.52543381e-02 2.51128644e-01 -1.30826651e-04 -1.34153187e+00\n 2.55206138e-01 -4.89278764e-01]\n [ 3.84396687e-02 -2.19447225e-12 1.97400138e-01 0.00000000e+00\n -0.00000000e+00 0.00000000e+00 7.65468255e-02 -6.13878220e-02\n 2.99999993e-02 0.00000000e+00 -0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00]\n [-4.40332554e-02 -1.76085472e-01 4.24387027e-03 9.77446288e-02\n 5.28218178e-03 -8.84639565e-04 -5.98331727e-03 -1.24993466e-01\n 1.49899675e-02 2.14067222e-06 -8.59909960e-07 4.78486414e-04\n 6.13060877e-07 1.39382723e-06 1.37280722e-06 -9.29213420e-05\n 4.08706255e-05 6.01873482e-11]\n [-3.42672169e-02 1.22663230e-01 3.11460090e-03 3.77718627e-01\n 1.76340520e-01 -9.07767378e-03 3.42606157e-02 5.72083965e-02\n 1.44523801e-02 -4.46005259e-03 -2.36348342e-03 8.96433145e-02\n 9.80184674e-01 -1.04655676e-01 -4.89059428e-04 -1.87777054e+00\n 1.14478981e+00 6.13457024e-01]\n [-1.45320565e-01 -1.19275376e-01 5.87122375e-03 -2.64351629e-03\n 1.76667217e-02 1.85797438e-02 -1.10402256e-01 -6.91061243e-02\n 1.46049839e-02 -5.88624855e-04 2.50984915e-03 -1.13844872e-01\n 5.90767115e-02 1.30152330e-01 -1.10503193e-02 1.35760546e+00\n 6.99893415e-01 -3.20194721e-01]\n [ 2.64110826e-02 -1.40573010e-02 3.47571820e-03 1.12298794e-01\n 2.84229726e-01 3.43756075e-03 3.24485511e-01 5.21627180e-02\n 1.47092892e-02 -1.04527818e-02 -8.94593634e-03 -1.22773921e+00\n 1.02157140e+00 7.00545833e-02 1.30078709e-03 -2.52658725e+00\n -2.52513981e+00 -2.95610142e+00]\n [ 2.01237556e-02 -9.49772447e-02 6.18526060e-03 1.25362296e-02\n -3.95233989e-01 2.06467509e-02 1.45554453e-01 -6.02400005e-02\n 1.47765307e-02 1.06054835e-03 -1.59580987e-02 -3.41429144e-01\n 1.08562565e+00 7.55612999e-02 -1.30280352e-03 -3.84829354e+00\n 1.70645729e-01 -2.34883451e+00]\n [ 5.86425439e-02 -6.40067831e-02 5.61795663e-03 -4.07354608e-02\n -1.17314365e-02 1.86153664e-03 3.07659566e-01 -2.31234934e-02\n 1.48132183e-02 1.36888018e-02 1.01757189e-02 -5.85344851e-01\n 9.91194248e-01 2.11156249e-01 -1.94325950e-03 7.29977012e-01\n 4.02509928e+00 -2.28512311e+00]\n [ 5.12128323e-02 1.71536729e-01 2.92140767e-02 2.68197894e-01\n -5.10080457e-01 7.78380930e-01 2.73687065e-01 1.16754547e-01\n 1.48246540e-02 -8.89697403e-04 1.78284496e-02 -1.73661992e-01\n 9.74645972e-01 -2.78953817e-02 7.92939041e-04 -3.88028908e+00\n 1.51277626e+00 -7.76317477e-01]\n [ 1.47135347e-01 1.88720956e-01 5.90325100e-03 -6.31752312e-02\n 6.11433029e-01 4.65540867e-03 1.85849741e-01 -1.55749395e-02\n 1.43878646e-02 2.12941226e-03 -1.90958660e-03 1.76388249e-01\n 2.58765876e-01 -1.80227146e-01 -3.23454587e-04 1.52748418e+00\n 6.35311902e-01 4.64668393e-01]\n [-9.43704545e-02 8.18899460e-03 1.31390437e-01 -6.81941748e-01\n 1.69501245e-01 -5.10617614e-01 -1.38588741e-01 -6.40289783e-02\n 1.49889067e-02 -4.71492831e-06 3.11330564e-06 5.76548453e-04\n 2.29034631e-05 4.26304759e-05 2.10494491e-05 -3.54578998e-03\n 2.46091117e-03 3.57675162e-04]]"}, "_episode_num": 93648, "use_sde": false, "sde_sample_freq": -1, "_current_progress_remaining": 0.3990688, "_stats_window_size": 100, "ep_info_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWV4AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHwCoAAAAAAACMAWyUSw6MAXSUR0DBNwlu76HkdX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBNv77ZWaMdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBNxuJiy6ddX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN3wMKCxvdX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBN1fjQzDXdX2UKGgGR8A9AAAAAAAAaAdLHmgIR0DBN7wHmig1dX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBNz6q+8GtdX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBN+i6vq1PdX2UKGgGR8BBAAAAAAAAaAdLI2gIR0DBN9NEd/8VdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBNyFoBaLXdX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBN/3L7oB8dX2UKGgGR8AmAAAAAAAAaAdLDGgIR0DBN0gAEMb4dX2UKGgGR8BCgAAAAAAAaAdLJmgIR0DBN0Ob1AZ9dX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBN31x82JjdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBN2+AI6bOdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBN9qhakhzdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBN52lyimEdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN70v/R3NdX2UKGgGR8BAAAAAAAAAaAdLIWgIR0DBN1N1uBMBdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBN5Ate2NOdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBODQJ1JUYdX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBN533rUsndX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBN5Lo8p1BdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBN40NDtw8dX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOF3rpqyodX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOF0qlP8AdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOFd81Gb1dX2UKGgGR8BIgAAAAAAAaAdLMmgIR0DBOFMpd8iOdX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBN/q/O+qSdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBOCQD9wWFdX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBOBXSH/LldX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBN+uJzkp7dX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBN8msA/9pdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOLq2fChwdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOKV6kZaWdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN+9ke6qbdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOL499tuUdX2UKGgGR8A/AAAAAAAAaAdLIGgIR0DBOCeGKyfMdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBOKSgqVhTdX2UKGgGR8A9AAAAAAAAaAdLHmgIR0DBOIf40uUVdX2UKGgGR8A3AAAAAAAAaAdLGGgIR0DBOCtlwtJ4dX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOHAFkhA4dX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBONjQPZqVdX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOOeuaF23dX2UKGgGR8BAgAAAAAAAaAdLImgIR0DBOFeFBY3edX2UKGgGR8AoAAAAAAAAaAdLDWgIR0DBOD8UTL4fdX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOOdTDO1OdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOFR55Z8sdX2UKGgGR8BJAAAAAAAAaAdLMmgIR0DBOEQV0tAcdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOL3fj0cwdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBOKQ+EAYIdX2UKGgGR8A3AAAAAAAAaAdLGGgIR0DBOUbnDBM0dX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOHtTLns+dX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOGSM3qA0dX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOULWiDdydX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOLyVQhwEdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOLhA4XGfdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOK3HPu5SdX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBOOhHy3CsdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOVpJAdGRdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOK7KLbYcdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBOKyRB/qgdX2UKGgGR8AYAAAAAAAAaAdLB2gIR0DBONin+AEudX2UKGgGR8AmAAAAAAAAaAdLDGgIR0DBOMKHwgDBdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOSAGQjlgdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOaV1+y7gdX2UKGgGR8A+AAAAAAAAaAdLH2gIR0DBOYvA9FF2dX2UKGgGR8A7AAAAAAAAaAdLHGgIR0DBOYdEPUaydX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBOPh5VwPzdX2UKGgGR8BHAAAAAAAAaAdLL2gIR0DBOZ0nogV5dX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBOU/I2fkFdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOd9MVUModX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOUg2Ifr9dX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOSf8MuvmdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBORIo3JgcdX2UKGgGR8AoAAAAAAAAaAdLDWgIR0DBOfa5d4VzdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOebhUBGQdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOamXb/OudX2UKGgGR8A/AAAAAAAAaAdLIGgIR0DBOXeearmydX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOf9JJ5E/dX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOVw2MsH0dX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBOXfrpqyodX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBOh6JbdJrdX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOjRfICEIdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOaOb3Gn5dX2UKGgGR8AgAAAAAAAAaAdLCWgIR0DBOaqBVdX1dX2UKGgGR8AyAAAAAAAAaAdLE2gIR0DBOZTThHbzdX2UKGgGR8AyAAAAAAAAaAdLE2gIR0DBOX78UEgXdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOi/Qtz0ZdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOeJw4sErdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBOcSu2Zy/dX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOa6m/FisdX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBOn/NcGC7dX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOlc384xUdX2UKGgGR8BCgAAAAAAAaAdLJmgIR0DBOmiVD8cddX2UKGgGR8BJAAAAAAAAaAdLMmgIR0DBOgnLidaudX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOo+0zCUHdX2UKGgGR8BAAAAAAAAAaAdLIWgIR0DBObZ8c+7ldX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBOpjteD3/dX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBOj+ivgWKdWUu"}, "ep_success_buffer": {":type:": "<class 'collections.deque'>", ":serialized:": "gAWVhgAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiImIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiYiIiIhlLg=="}, "_n_updates": 187785, "buffer_size": 1000000, "batch_size": 2048, "learning_starts": 100, "tau": 0.005, "gamma": 0.99, "gradient_steps": 1, "optimize_memory_usage": false, "replay_buffer_class": {":type:": "<class 'abc.ABCMeta'>", ":serialized:": "gAWVOQAAAAAAAACMIHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5idWZmZXJzlIwQRGljdFJlcGxheUJ1ZmZlcpSTlC4=", "__module__": "stable_baselines3.common.buffers", "__annotations__": "{'observation_space': <class 'gymnasium.spaces.dict.Dict'>, 'obs_shape': typing.Dict[str, typing.Tuple[int, ...]], 'observations': typing.Dict[str, numpy.ndarray], 'next_observations': typing.Dict[str, numpy.ndarray]}", "__doc__": "\n Dict Replay buffer used in off-policy algorithms like SAC/TD3.\n Extends the ReplayBuffer to use dictionary observations\n\n :param buffer_size: Max number of element in the buffer\n :param observation_space: Observation space\n :param action_space: Action space\n :param device: PyTorch device\n :param n_envs: Number of parallel environments\n :param optimize_memory_usage: Enable a memory efficient variant\n Disabled for now (see https://github.com/DLR-RM/stable-baselines3/pull/243#discussion_r531535702)\n :param handle_timeout_termination: Handle timeout termination (due to timelimit)\n separately and treat the task as infinite horizon task.\n https://github.com/DLR-RM/stable-baselines3/issues/284\n ", "__init__": "<function DictReplayBuffer.__init__ at 0x7ff631b03e20>", "add": "<function DictReplayBuffer.add at 0x7ff631b03eb0>", "sample": "<function DictReplayBuffer.sample at 0x7ff631b03f40>", "_get_samples": "<function DictReplayBuffer._get_samples at 0x7ff631b28040>", "__abstractmethods__": "frozenset()", "_abc_impl": "<_abc._abc_data object at 0x7ff631b1d3c0>"}, "replay_buffer_kwargs": {}, "train_freq": {":type:": "<class 'stable_baselines3.common.type_aliases.TrainFreq'>", ":serialized:": "gAWVYQAAAAAAAACMJXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi50eXBlX2FsaWFzZXOUjAlUcmFpbkZyZXGUk5RLAWgAjBJUcmFpbkZyZXF1ZW5jeVVuaXSUk5SMBHN0ZXCUhZRSlIaUgZQu"}, "use_sde_at_warmup": false, "target_entropy": -3.0, "ent_coef": "auto", "target_update_interval": 1, "observation_space": {":type:": "<class 'gymnasium.spaces.dict.Dict'>", ":serialized:": "gAWVKAQAAAAAAACMFWd5bW5hc2l1bS5zcGFjZXMuZGljdJSMBERpY3SUk5QpgZR9lCiMBnNwYWNlc5SMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwUZ3ltbmFzaXVtLnNwYWNlcy5ib3iUjANCb3iUk5QpgZR9lCiMBWR0eXBllIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYowNYm91bmRlZF9iZWxvd5SMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYDAAAAAAAAAAEBAZRoE4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksDhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoHCiWAwAAAAAAAAABAQGUaCBLA4WUaCR0lFKUjAZfc2hhcGWUSwOFlIwDbG93lGgcKJYMAAAAAAAAAAAAIMEAACDBAAAgwZRoFksDhZRoJHSUUpSMBGhpZ2iUaBwolgwAAAAAAAAAAAAgQQAAIEEAACBBlGgWSwOFlGgkdJRSlIwIbG93X3JlcHKUjAUtMTAuMJSMCWhpZ2hfcmVwcpSMBDEwLjCUjApfbnBfcmFuZG9tlE51YowMZGVzaXJlZF9nb2FslGgNKYGUfZQoaBBoFmgZaBwolgMAAAAAAAAAAQEBlGggSwOFlGgkdJRSlGgnaBwolgMAAAAAAAAAAQEBlGggSwOFlGgkdJRSlGgsSwOFlGguaBwolgwAAAAAAAAAAAAgwQAAIMEAACDBlGgWSwOFlGgkdJRSlGgzaBwolgwAAAAAAAAAAAAgQQAAIEEAACBBlGgWSwOFlGgkdJRSlGg4jAUtMTAuMJRoOowEMTAuMJRoPE51YowLb2JzZXJ2YXRpb26UaA0pgZR9lChoEGgWaBloHCiWEgAAAAAAAAABAQEBAQEBAQEBAQEBAQEBAQGUaCBLEoWUaCR0lFKUaCdoHCiWEgAAAAAAAAABAQEBAQEBAQEBAQEBAQEBAQGUaCBLEoWUaCR0lFKUaCxLEoWUaC5oHCiWSAAAAAAAAAAAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMGUaBZLEoWUaCR0lFKUaDNoHCiWSAAAAAAAAAAAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEGUaBZLEoWUaCR0lFKUaDiMBS0xMC4wlGg6jAQxMC4wlGg8TnVidWgsTmgQTmg8TnViLg==", "spaces": "OrderedDict([('achieved_goal', Box(-10.0, 10.0, (3,), float32)), ('desired_goal', Box(-10.0, 10.0, (3,), float32)), ('observation', Box(-10.0, 10.0, (18,), float32))])", "_shape": null, "dtype": null, "_np_random": null}, "action_space": {":type:": "<class 'gymnasium.spaces.box.Box'>", ":serialized:": "gAWVYAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWAwAAAAAAAAABAQGUaAiMAmIxlImIh5RSlChLA4wBfJROTk5K/////0r/////SwB0lGJLA4WUjAFDlHSUUpSMDWJvdW5kZWRfYWJvdmWUaBEolgMAAAAAAAAAAQEBlGgVSwOFlGgZdJRSlIwGX3NoYXBllEsDhZSMA2xvd5RoESiWDAAAAAAAAAAAAIC/AACAvwAAgL+UaAtLA4WUaBl0lFKUjARoaWdolGgRKJYMAAAAAAAAAAAAgD8AAIA/AACAP5RoC0sDhZRoGXSUUpSMCGxvd19yZXBylIwELTEuMJSMCWhpZ2hfcmVwcpSMAzEuMJSMCl9ucF9yYW5kb22UjBRudW1weS5yYW5kb20uX3BpY2tsZZSMEF9fZ2VuZXJhdG9yX2N0b3KUk5SMBVBDRzY0lGgyjBRfX2JpdF9nZW5lcmF0b3JfY3RvcpSTlIaUUpR9lCiMDWJpdF9nZW5lcmF0b3KUjAVQQ0c2NJSMBXN0YXRllH2UKGg9ihBl3wcOaCf4tNHk4wFpO1pOjANpbmOUihBHNkRLhAhbywqUoI0S9+11dYwKaGFzX3VpbnQzMpRLAIwIdWludGVnZXKUSwB1YnViLg==", "dtype": "float32", "bounded_below": "[ True True True]", "bounded_above": "[ True True True]", "_shape": [3], "low": "[-1. -1. -1.]", "high": "[1. 1. 1.]", "low_repr": "-1.0", "high_repr": "1.0", "_np_random": "Generator(PCG64)"}, "n_envs": 16, "lr_schedule": {":type:": "<class 'function'>", ":serialized:": "gAWVsQMAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLA0sTQwx0AIgAfACDAYMBUwCUToWUjAVmbG9hdJSFlIwScHJvZ3Jlc3NfcmVtYWluaW5nlIWUjGAvaG9tZS90b21lay9weXRvcmNoX2xlYXJuaW5nL3ZlbnYvbGliL3B5dGhvbjMuMTAvc2l0ZS1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjAg8bGFtYmRhPpRLYUMCDACUjA52YWx1ZV9zY2hlZHVsZZSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjGAvaG9tZS90b21lay9weXRvcmNoX2xlYXJuaW5nL3ZlbnYvbGliL3B5dGhvbjMuMTAvc2l0ZS1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUdU5OaACMEF9tYWtlX2VtcHR5X2NlbGyUk5QpUpSFlHSUUpRoAIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCF9lH2UKGgYaA+MDF9fcXVhbG5hbWVfX5SMIWdldF9zY2hlZHVsZV9mbi48bG9jYWxzPi48bGFtYmRhPpSMD19fYW5ub3RhdGlvbnNfX5R9lIwOX19rd2RlZmF1bHRzX1+UTowMX19kZWZhdWx0c19flE6MCl9fbW9kdWxlX1+UaBmMB19fZG9jX1+UTowLX19jbG9zdXJlX1+UaACMCl9tYWtlX2NlbGyUk5RoAihoByhLAUsASwBLAUsBSxNDBIgAUwCUaAkpjAFflIWUaA6MBGZ1bmOUS4VDAgQBlIwDdmFslIWUKXSUUpRoFU5OaB0pUpSFlHSUUpRoI2g9fZR9lChoGGg0aCaMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUaCh9lGgqTmgrTmgsaBloLU5oLmgwRz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjCFlFKUhZRoRV2UaEd9lHWGlIZSMC4="}, "batch_norm_stats": [], "batch_norm_stats_target": [], "system_info": {"OS": "Linux-5.15.146.1-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP Thu Jan 11 04:09:03 UTC 2024", "Python": "3.10.12", "Stable-Baselines3": "2.3.2", "PyTorch": "2.3.0+cu121", "GPU Enabled": "True", "Numpy": "1.26.4", "Cloudpickle": "3.0.0", "Gymnasium": "0.29.1", "OpenAI Gym": "0.26.2"}}
|
replay.mp4
ADDED
Binary file (861 kB). View file
|
|
results.json
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
{"mean_reward": -22.4, "std_reward": 9.562426470305537, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-06-12T18:08:37.128392"}
|
sac-PandaSlide-v3.zip
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:81b712a487812d3c1c2e09d0ef230eee666f47ea0c2025b4afcde11e4cf08112
|
3 |
+
size 23853286
|
sac-PandaSlide-v3/_stable_baselines3_version
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
2.3.2
|
sac-PandaSlide-v3/actor.optimizer.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c0e4ad772d4e15bb12e796399c074a2a7202fddea0048c2a21d69d3cb21f1f1a
|
3 |
+
size 4338044
|
sac-PandaSlide-v3/critic.optimizer.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fed4119397a29852a8f966c838462e117296052a68ef6d984608e44cc6143d0d
|
3 |
+
size 8656134
|
sac-PandaSlide-v3/data
ADDED
@@ -0,0 +1,126 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"policy_class": {
|
3 |
+
":type:": "<class 'abc.ABCMeta'>",
|
4 |
+
":serialized:": "gAWVNwAAAAAAAACMHnN0YWJsZV9iYXNlbGluZXMzLnNhYy5wb2xpY2llc5SMEE11bHRpSW5wdXRQb2xpY3mUk5Qu",
|
5 |
+
"__module__": "stable_baselines3.sac.policies",
|
6 |
+
"__doc__": "\n Policy class (with both actor and critic) for SAC.\n\n :param observation_space: Observation space\n :param action_space: Action space\n :param lr_schedule: Learning rate schedule (could be constant)\n :param net_arch: The specification of the policy and value networks.\n :param activation_fn: Activation function\n :param use_sde: Whether to use State Dependent Exploration or not\n :param log_std_init: Initial value for the log standard deviation\n :param use_expln: Use ``expln()`` function instead of ``exp()`` when using gSDE to ensure\n a positive standard deviation (cf paper). It allows to keep variance\n above zero and prevent it from growing too fast. In practice, ``exp()`` is usually enough.\n :param clip_mean: Clip the mean output when using gSDE to avoid numerical instability.\n :param features_extractor_class: Features extractor to use.\n :param normalize_images: Whether to normalize images or not,\n dividing by 255.0 (True by default)\n :param optimizer_class: The optimizer to use,\n ``th.optim.Adam`` by default\n :param optimizer_kwargs: Additional keyword arguments,\n excluding the learning rate, to pass to the optimizer\n :param n_critics: Number of critic networks to create.\n :param share_features_extractor: Whether to share or not the features extractor\n between the actor and the critic (this saves computation time)\n ",
|
7 |
+
"__init__": "<function MultiInputPolicy.__init__ at 0x7ff631973d00>",
|
8 |
+
"__abstractmethods__": "frozenset()",
|
9 |
+
"_abc_impl": "<_abc._abc_data object at 0x7ff631986c00>"
|
10 |
+
},
|
11 |
+
"verbose": 1,
|
12 |
+
"policy_kwargs": {
|
13 |
+
"net_arch": {
|
14 |
+
"pi": [
|
15 |
+
512,
|
16 |
+
512,
|
17 |
+
512
|
18 |
+
],
|
19 |
+
"qf": [
|
20 |
+
512,
|
21 |
+
512,
|
22 |
+
512
|
23 |
+
]
|
24 |
+
},
|
25 |
+
"use_sde": false
|
26 |
+
},
|
27 |
+
"num_timesteps": 3004656,
|
28 |
+
"_total_timesteps": 5000000,
|
29 |
+
"_num_timesteps_at_start": 0,
|
30 |
+
"seed": null,
|
31 |
+
"action_noise": null,
|
32 |
+
"start_time": 1718199802391038128,
|
33 |
+
"learning_rate": 0.0003,
|
34 |
+
"tensorboard_log": "./sacPandaSlide-v3/",
|
35 |
+
"_last_obs": {
|
36 |
+
":type:": "<class 'collections.OrderedDict'>",
|
37 |
+
":serialized:": "gAWV+wYAAAAAAACMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwSbnVtcHkuY29yZS5udW1lcmljlIwLX2Zyb21idWZmZXKUk5QolsAAAAAAAAAAczcAP4Dupz5bOzk+ao7NPDPEFD8y7Dw+6eV+Pg1jzT46so8+G87qvpvo1D4YQjQ+K0nyPuRC/j5VCjM+sbdKPqSN9L6wyDM+gGKYPRagzr6di8s+5TO/vth/N78O7Tw+3McevrHMOD7FHDU+iahwvwcW4r6WVDc+eL21PwNbHz6w2Dg+GdPlPjO7y77j0jk+jw2qPyJLXL5lWzo+uXSSP6qJ8j7yhTo+y+MqP2M6Nr63LDQ+QuiLv+BI1b4b6Tw+lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwOGlIwBQ5R0lFKUjAxkZXNpcmVkX2dvYWyUaAcolsAAAAAAAAAAODpHPy98zD8a0yYyFfpavgbmUb8a0yYyilWLvpyO674a0yYy4rimvpQtub8a0yYy/51/v+Kj+L4a0yYyCXmnvnM45T4a0yYyaprnPWnJcb8a0yYy+DuuPkG9Hb8a0yYyWgaYvsbBqT4a0yYyob6av3Gy1z8a0yYyhD1zP+/vOT8a0yYyY0QnPykp074a0yYyjkYqPkwsST4a0yYyoSu+P2mijz8a0yYy3FTfv8A3br8a0yYy8QUCP3fXyr8a0yYylGgOSxBLA4aUaBJ0lFKUjAtvYnNlcnZhdGlvbpRoByiWgAQAAAAAAABWdHA/ZVQEP8ixHr8lfRY/0ZKfP6qD8j1zNwA/gO6nPls7OT4pMzo7cid/vdL7CD4/ruU+Q153Pu41QL7L3cy/Vr6Rv+WGk7+rOqs+uLqKP+M43L7ipOC+aQkKP4lADMBqjs08M8QUPzLsPD50OU48AJBEu9OfUD45pgq/cG7qvTXhIT5rm3Y8RmVdvFnrpD4juTA/xezAPWJ3nD+h/l4/k8o2QMzTJ8Dp5X4+DWPNPjqyjz6SJVE8REJSu1gYTz5tpQq/YBjqvTDW/78tWmM89RhbvFvspD4lHVq+jtViP6wvHr89Wls79Xkuv8BPXz4bzuq+m+jUPhhCND4Zcvw7ZTm5vEaTKb7/wEg+9KvJvwjbGT41pa47T0xGvzDMEMAdzl4/L96UP6Wr6L7lqEo/3G0YQN6V0D8rSfI+5EL+PlUKMz54tJ87WtcTPIdtdT7kwBQ/iJ2dv8sGNz6g2C6/JbS+vmU66D5xeAE/S8Ibv9pdHr+x2o4/myLXP4gttz2xt0o+pI30vrDIMz7bOe87bq2OvPNAJT1coLi+PnONP48QIT77GjG/W3z2Pdl0ib4nu+o+5YeBvZ+uqz+jej8+J3dau3ApND6AYpg9FqDOvp2Lyz6SJVE8REJSu1gYTz5tpQq/YBjqvTzNIT4tWmM89RhbvFvspD5u9qu9tLWWv+HQH78v+vk+JWoePDAdMj7lM7++2H83vw7tPD7QRVE84edSu8M5UD5RpQq/1BTqvTfPIT4ojWI8Ir9avFvspD7bqqG8g4A2P/rBIr8V8aw/vIncPt4lHz7cxx6+scw4PsUcNT7eB1W7OIv4vMNy0T6OVAtAUFofv+wLHz5wUXm/ZCQWP/oCiD/ECEC/oGVRv52TG7/kIjc+nHgkPRcsXz6JqHC/BxbivpZUNz7wgy48Sr7RPASNd70rZsG+tIUEPzgZxz3sXzo/wey0Pq7Xg73EOcI+2dgbvifRIb+9eAg/mkEyP5gePD54vbU/A1sfPrDYOD4J68q8muPYvTdbKMBErRJAMUZnPjchKT55WKi/nhOrv16/T8C0Da0+xgkqvy/CGr/lCmc+y+l5vwj1Yz4Z0+U+M7vLvuPSOT6RxYc8UN8+vqrmFb8LDB5Aw1OBPll2Gj53qwDAgr2bPdfVIMCkYRc/Ib7vvoM8HL/Brns9x+8DvZ54OD6PDao/IktcvmVbOj43oX0996joPQf9kr/bSA1AkTZpP7HaFj66tcs+/iAGQN3pG8A03wo/l9WCP/aIvb6ZvIE/cCShv0rB9z+5dJI/qonyPvKFOj5OzRw8o0lOPus3S77dWApAYr9/vs1EJj45vwHARYFHP75sHb9jMJY/icCQP0F+G79wuvy7GDnAP0/wPj7L4yo/Yzo2vrcsND73Nqc8BtzNvGb2Gz88WTQ+ekp9v8H6Hz5gQFE/N5mjPvcLYj9jf9S+QE07vPxnKz/fFfW/G+vTPql3er9C6Iu/4EjVvhvpPD6P3lA8qupPuxN1UD5DoQq/2avpvZjrIT7mykQ8X/hFvPIkpT6UaA5LEEsShpRoEnSUUpR1Lg==",
|
38 |
+
"achieved_goal": "[[ 0.5008461 0.3279915 0.18089049]\n [ 0.02509232 0.58111876 0.18449476]\n [ 0.24892391 0.40114632 0.28065664]\n [-0.4586037 0.41583714 0.17603338]\n [ 0.47321448 0.49660408 0.1748441 ]\n [ 0.19796635 -0.47764313 0.17557025]\n [ 0.07440662 -0.4035651 0.39754954]\n [-0.3734428 -0.7167945 0.18449804]\n [-0.15505928 0.18046834 0.17686756]\n [-0.94007164 -0.4415743 0.1790336 ]\n [ 1.4198446 0.15562062 0.1805141 ]\n [ 0.44887617 -0.3979126 0.18146853]\n [ 1.3285388 -0.21513036 0.18198927]\n [ 1.1441871 0.47370654 0.18215159]\n [ 0.66753834 -0.1779571 0.17595182]\n [-1.0930254 -0.41657162 0.18448298]]",
|
39 |
+
"desired_goal": "[[ 7.7823210e-01 1.5975398e+00 9.7104706e-09]\n [-2.1384461e-01 -8.1991613e-01 9.7104706e-09]\n [-2.7213699e-01 -4.6007240e-01 9.7104706e-09]\n [-3.2562929e-01 -1.4467034e+00 9.7104706e-09]\n [-9.9850458e-01 -4.8562533e-01 9.7104706e-09]\n [-3.2709530e-01 4.4769630e-01 9.7104706e-09]\n [ 1.1308749e-01 -9.4447953e-01 9.7104706e-09]\n [ 3.4030128e-01 -6.1616904e-01 9.7104706e-09]\n [-2.9692346e-01 3.3155650e-01 9.7104706e-09]\n [-1.2089425e+00 1.6851331e+00 9.7104706e-09]\n [ 9.5015740e-01 7.2631735e-01 9.7104706e-09]\n [ 6.5338725e-01 -4.1242340e-01 9.7104706e-09]\n [ 1.6628477e-01 1.9645804e-01 9.7104706e-09]\n [ 1.4857064e+00 1.1221439e+00 9.7104706e-09]\n [-1.7447772e+00 -9.3053818e-01 9.7104706e-09]\n [ 5.0790316e-01 -1.5847005e+00 9.7104706e-09]]",
|
40 |
+
"observation": "[[ 9.39275146e-01 5.16912758e-01 -6.19900227e-01 5.87847054e-01\n 1.24666798e+00 1.18415192e-01 5.00846088e-01 3.27991486e-01\n 1.80890486e-01 2.84118415e-03 -6.22934774e-02 1.33773118e-01\n 4.48595017e-01 2.41570517e-01 -1.87705725e-01 -1.60051858e+00\n -1.13862109e+00 -1.15255415e+00]\n [ 3.34431976e-01 1.08382320e+00 -4.30121511e-01 -4.38757956e-01\n 5.39206088e-01 -2.19143891e+00 2.50923224e-02 5.81118762e-01\n 1.84494764e-01 1.25869401e-02 -2.99930573e-03 2.03734681e-01\n -5.41598856e-01 -1.14468455e-01 1.58085659e-01 1.50517030e-02\n -1.35129150e-02 3.22108060e-01]\n [ 6.90324962e-01 9.42016020e-02 1.22239327e+00 8.71072829e-01\n 2.85611415e+00 -2.62230206e+00 2.48923913e-01 4.01146322e-01\n 2.80656636e-01 1.27653051e-02 -3.20829544e-03 2.02241302e-01\n -5.41586697e-01 -1.14304304e-01 -1.99872398e+00 1.38764801e-02\n -1.33726494e-02 3.22115749e-01]\n [-2.13001803e-01 8.86071086e-01 -6.17914915e-01 3.34705343e-03\n -6.81548417e-01 2.18077660e-01 -4.58603710e-01 4.15837139e-01\n 1.76033378e-01 7.70403119e-03 -2.26103757e-02 -1.65600866e-01\n 1.96048722e-01 -1.57556009e+00 1.50249600e-01 5.32975281e-03\n -7.74601877e-01 -2.26246262e+00]\n [ 8.70332539e-01 1.16303051e+00 -4.54434544e-01 7.91639626e-01\n 2.38170528e+00 1.62957358e+00 4.73214477e-01 4.96604085e-01\n 1.74844101e-01 4.87380847e-03 9.02351178e-03 2.39675626e-01\n 5.81068277e-01 -1.23136997e+00 1.78736851e-01 -6.82992935e-01\n -3.72468144e-01 4.53570515e-01]\n [ 5.05744040e-01 -6.08433425e-01 -6.18619561e-01 1.11604893e+00\n 1.68074358e+00 8.94423127e-02 1.97966352e-01 -4.77643132e-01\n 1.75570250e-01 7.30059808e-03 -1.74166821e-02 4.03451435e-02\n -3.60598445e-01 1.10507941e+00 1.57289729e-01 -6.91817939e-01\n 1.20354377e-01 -2.68469602e-01]\n [ 4.58459109e-01 -6.32474795e-02 1.34126651e+00 1.86991259e-01\n -3.33351805e-03 1.75939322e-01 7.44066238e-02 -4.03565109e-01\n 3.97549540e-01 1.27653051e-02 -3.20829544e-03 2.02241302e-01\n -5.41586697e-01 -1.14304304e-01 1.58009470e-01 1.38764801e-02\n -1.33726494e-02 3.22115749e-01]\n [-8.39661211e-02 -1.17742014e+00 -6.24280989e-01 4.88236874e-01\n 9.66886152e-03 1.73939466e-01 -3.73442799e-01 -7.16794491e-01\n 1.84498042e-01 1.27729923e-02 -3.21816676e-03 2.03345343e-01\n -5.41585028e-01 -1.14297539e-01 1.58017024e-01 1.38275996e-02\n -1.33512337e-02 3.22115749e-01]\n [-1.97347905e-02 7.12898433e-01 -6.35772347e-01 1.35110724e+00\n 4.30738330e-01 1.55417889e-01 -1.55059278e-01 1.80468336e-01\n 1.76867560e-01 -3.25059099e-03 -3.03398222e-02 4.09078687e-01\n 2.17703581e+00 -6.22471809e-01 1.55318916e-01 -9.73898888e-01\n 5.86492777e-01 1.06259084e+00]\n [-7.50133753e-01 -8.17956924e-01 -6.07721150e-01 1.78844035e-01\n 4.01540846e-02 2.17941627e-01 -9.40071642e-01 -4.41574305e-01\n 1.79033607e-01 1.06515735e-02 2.56034322e-02 -6.04372174e-02\n -3.77732605e-01 5.17665148e-01 9.72160697e-02 7.28026152e-01\n 3.53368789e-01 -6.43762201e-02]\n [ 3.79346967e-01 -1.52194396e-01 -6.32097661e-01 5.33092320e-01\n 6.96313500e-01 1.83710456e-01 1.41984463e+00 1.55620620e-01\n 1.80514097e-01 -2.47702766e-02 -1.05902866e-01 -2.63056731e+00\n 2.29182529e+00 2.25853696e-01 1.65165767e-01 -1.31519997e+00\n -1.33653617e+00 -3.24605513e+00]\n [ 3.37995172e-01 -6.64211631e-01 -6.04525506e-01 2.25627497e-01\n -9.76223648e-01 2.22614408e-01 4.48876172e-01 -3.97912592e-01\n 1.81468531e-01 1.65736992e-02 -1.86398745e-01 -5.85550904e-01\n 2.46948504e+00 2.52592176e-01 1.50842085e-01 -2.01046538e+00\n 7.60450512e-02 -2.51305175e+00]\n [ 5.91333628e-01 -4.68247443e-01 -6.10298336e-01 6.14459552e-02\n -3.22110914e-02 1.80147618e-01 1.32853878e+00 -2.15130359e-01\n 1.81989267e-01 6.19213246e-02 1.13603525e-01 -1.14834678e+00\n 2.20757174e+00 9.10988867e-01 1.47318617e-01 3.97870839e-01\n 2.09576368e+00 -2.43614888e+00]\n [ 5.42468309e-01 1.02214324e+00 -3.70185554e-01 1.01356804e+00\n -1.25892448e+00 1.93558621e+00 1.14418709e+00 4.73706543e-01\n 1.82151586e-01 9.57043283e-03 2.01452777e-01 -1.98455498e-01\n 2.16167378e+00 -2.49753505e-01 1.62371829e-01 -2.02729630e+00\n 7.79316247e-01 -6.14940524e-01]\n [ 1.17335165e+00 1.13087571e+00 -6.07395232e-01 -7.71265477e-03\n 1.50174236e+00 1.86463580e-01 6.67538345e-01 -1.77957103e-01\n 1.75951824e-01 2.04119515e-02 -2.51293294e-02 6.09228492e-01\n 1.76121652e-01 -9.89417672e-01 1.56229988e-01 8.17388535e-01\n 3.19528311e-01 8.82995069e-01]\n [-4.15034384e-01 -1.14319921e-02 6.69555426e-01 -1.91472995e+00\n 4.13903087e-01 -9.78388369e-01 -1.09302545e+00 -4.16571617e-01\n 1.84482977e-01 1.27483746e-02 -3.17255640e-03 2.03571603e-01\n -5.41523159e-01 -1.14097305e-01 1.58125281e-01 1.20112654e-02\n -1.20831421e-02 3.22547495e-01]]"
|
41 |
+
},
|
42 |
+
"_last_episode_starts": {
|
43 |
+
":type:": "<class 'numpy.ndarray'>",
|
44 |
+
":serialized:": "gAWVgwAAAAAAAACMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYQAAAAAAAAAAEBAQEBAQEBAQEBAQEBAQGUjAVudW1weZSMBWR0eXBllJOUjAJiMZSJiIeUUpQoSwOMAXyUTk5OSv////9K/////0sAdJRiSxCFlIwBQ5R0lFKULg=="
|
45 |
+
},
|
46 |
+
"_last_original_obs": {
|
47 |
+
":type:": "<class 'collections.OrderedDict'>",
|
48 |
+
":serialized:": "gAWV+wYAAAAAAACMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwSbnVtcHkuY29yZS5udW1lcmljlIwLX2Zyb21idWZmZXKUk5QolsAAAAAAAAAAzNoePpeDsj10bnE8BCiKPUbkDT6El3U8zaHePfTv0D2oS7I8qpOxvP8L1z0t02s896MZPsai+D2/c2o8eGbLPYyHnL1TSms8lcScPcxxe72PwvU8tA/Eu5P8/717mHU83FQMPVpTaj2tyWw8lBrivYOHjb2+SW889yKmPpOoVT07/3A8OgwVPji+dr1DGXI8joWdPnttvbwks3I8tiCMPgId7z0b43I8ZU8+Pgguf7wUu2s8NeoNvqAhg70IlHU8lIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYksQSwOGlIwBQ5R0lFKUjAxkZXNpcmVkX2dvYWyUaAcolsAAAAAAAAAADA/yPpzSED6PwvU8UaDFPjgemL2PwvU89gPDPpy4LL2PwvU8pJ7APglUBb6PwvU8vXuiPmcPNr2PwvU81Y3APl8MHz2PwvU8yETUPnHhrr2PwvU863HePmHFZb2PwvU8xefBPh406TyPwvU88g6ZPnjTGD6PwvU8RcL5Pmhwgj2PwvU8oXfsPnVOG72PwvU8uKbWPttzhjyPwvU8U98IP6PFyj2PwvU8Sw+BPkBVrL2PwvU8kvPlPt7vEb6PwvU8lGgOSxBLA4aUaBJ0lFKUjAtvYnNlcnZhdGlvbpRoByiWgAQAAAAAAADUceQ9ose7PXMrmTvALwU+0v8BP8tz0LzM2h4+l4OyPXRucTwdHjW7cKioux8X87xRybY+0BmWPU+ygL0uakTAmm8JwAdhnL9oaqA8s6I5PjUSvzyZ6E++67FhPgULhr8EKIo9RuQNPoSXdTxhVVC4ELiYN4CqKTpDxpK2tMsNuPJZaDcpahI76FeMuUAN1ra07JY9cdjLPPQsPj4zSmM+t7CUP1xwnr/Nod499O/QPahLsjwAAAAAAAAAgAAAAAAAAAAAAAAAAKG3yL4AAAAAAAAAAAAAAAA4XIK96KEZPhqQnzsxEXS9WRGNvr2ymDyqk7G8/wvXPS3Tazwtvbi6NIjdurs/I76pKog+AhWavo3guLpKGYW8VPO5v+sJCcAN+s49O3RGPo5/qzzR5Ug+2gp4P7OcJD/3oxk+xqL4Pb9zajyRBRC7hqmLOkLohDylPc8+8ZNrvo7pdjuXkam/8W8vv94J3z2S5To9sXWwvTZLnTudV5o+kiQvP6C4HL14Zss9jIecvVNKazzodse6Dzuiuhazj70RpIU975OAPoMuCblRt6u/YaqCPr+C+r7qch09GWwarEMjSj4AAAAAAAAAgAAAAACVxJw9zHF7vY/C9TwAAAAAAAAAgAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA3XDS9wE80vioQiztWLsg9JxatOyjnZ7q0D8S7k/z/vXuYdTx4qA82kNRmtVzd+jk5kSQ1ihO7NUxBuDXE3sK4gGwrOHdahC7IWwy93Db7PVUeTDtWZME+nJI0Poa6FLzcVAw9WlNqPa3JbDyiJZK7rOQau+qWtz1i7Xo/t1XWvTk0ALrJWvC/eYiSP4ULHT/qzhS+pkb0vWVjwDvYPi27zbmQPIw0mDyUGuK9g4eNvb5JbzzyTRq6SHwkO4An6b1s+nE9p0YFPmYMNbwExq0/NywzP5Dwo74OXNg8mFBmvODIYzvp/OU9j4aRPrNIYTv3IqY+k6hVPTv/cDwlQiu8+pESvI8mnb/awoI/x3iPPSx/qjqbsyHA5JshwMQwPcCT2qQ8boPCvbqtyjvCZE08HFzKvmAjqTw6DBU+OL52vUMZcjwZAos6j7qCvM3Prr7I9Yo/4r+aPdXCqrpxSnbAwb0uPk5TFsAqM3A9/RWDvdYWuDs62ia9NjVAvM7+8zqOhZ0+e229vCSzcjz/RmA8D7gmPCnZFb/ovn0/WDlYPvi0/rrG3zo/nc2AQHU/EsCMxFE9U6cvPlxS7zw8UYk+opQCv/lDRz+2IIw+Ah3vPRvjcjyVOmm6+AySPHPUMb5mgnk/24TkvD3dTzqoVnjAp6LBP768Rr+mqhY+EUBBPg9wwTsEYoG94IYcP2aMmDtlTz4+CC5/vBS7azycjQs7GEv6uh+fND72fIQ+d404vleVqbmahMM/zaMiPwTp7T5MRcG9IisGPDeLBj68ky6/vJEtPta3Ar816g2+oCGDvQiUdTztNJ62L+5QNoMjFzrQIMA3IM4yOFmTsDd8YGi7PUchO1mGuzmUaA5LEEsShpRoEnSUUpR1Lg==",
|
49 |
+
"achieved_goal": "[[ 0.15513152 0.08716505 0.01473581]\n [ 0.06745914 0.1385661 0.01498974]\n [ 0.10870705 0.10202017 0.02176459]\n [-0.02167686 0.10500335 0.01439361]\n [ 0.15003954 0.12140422 0.01430982]\n [ 0.09931654 -0.07643041 0.01436098]\n [ 0.07654683 -0.06138782 0.03 ]\n [-0.00598332 -0.12499347 0.01498997]\n [ 0.03426062 0.0572084 0.01445238]\n [-0.11040226 -0.06910612 0.01460498]\n [ 0.3244855 0.05216272 0.01470929]\n [ 0.14555445 -0.06024 0.01477653]\n [ 0.30765957 -0.02312349 0.01481322]\n [ 0.27368706 0.11675455 0.01482465]\n [ 0.18584974 -0.01557494 0.01438786]\n [-0.13858874 -0.06402898 0.01498891]]",
|
50 |
+
"desired_goal": "[[ 0.47277105 0.14142841 0.03 ]\n [ 0.38598874 -0.07427639 0.03 ]\n [ 0.3808896 -0.04216824 0.03 ]\n [ 0.37621033 -0.13020338 0.03 ]\n [ 0.3173503 -0.04444828 0.03 ]\n [ 0.3760821 0.03883016 0.03 ]\n [ 0.41458726 -0.08539093 0.03 ]\n [ 0.43446288 -0.05609644 0.03 ]\n [ 0.3787214 0.02846723 0.03 ]\n [ 0.29894215 0.14924419 0.03 ]\n [ 0.48781028 0.06369096 0.03 ]\n [ 0.4618502 -0.03791662 0.03 ]\n [ 0.4192407 0.01641267 0.03 ]\n [ 0.53465766 0.09900977 0.03 ]\n [ 0.2520698 -0.08414698 0.03 ]\n [ 0.44912392 -0.14251658 0.03 ]]",
|
51 |
+
"observation": "[[ 1.11545235e-01 9.16893631e-02 4.67436900e-03 1.30064964e-01\n 5.07809758e-01 -2.54458394e-02 1.55131519e-01 8.71650502e-02\n 1.47358067e-02 -2.76363571e-03 -5.14703244e-03 -2.96741109e-02\n 3.57004672e-01 7.32914209e-02 -6.28400967e-02 -3.06898069e+00\n -2.14743662e+00 -1.22171104e+00]\n [ 1.95819885e-02 1.81284711e-01 2.33241115e-02 -2.03035727e-01\n 2.20405266e-01 -1.04721129e+00 6.74591362e-02 1.38566107e-01\n 1.49897374e-02 -4.96705798e-05 1.82055228e-05 6.47224486e-04\n -4.37421977e-06 -3.38067330e-05 1.38492196e-05 2.23411084e-03\n -2.67683761e-04 -6.37923949e-06]\n [ 7.36936629e-02 2.48834807e-02 1.85718358e-01 2.21962735e-01\n 1.16164291e+00 -1.23780394e+00 1.08707048e-01 1.02020174e-01\n 2.17645913e-02 0.00000000e+00 -0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00 -3.92025977e-01 0.00000000e+00\n 0.00000000e+00 0.00000000e+00]\n [-6.36524558e-02 1.50031686e-01 4.86947317e-03 -5.95867075e-02\n -2.75522977e-01 1.86399166e-02 -2.16768570e-02 1.05003349e-01\n 1.43936099e-02 -1.40944647e-03 -1.69015536e-03 -1.59422800e-01\n 2.65950471e-01 -3.00941527e-01 -1.41050073e-03 -1.62474103e-02\n -1.45273829e+00 -2.14123034e+00]\n [ 1.01062872e-01 1.93802759e-01 2.09348463e-02 1.96189180e-01\n 9.68915582e-01 6.43016040e-01 1.50039539e-01 1.21404216e-01\n 1.43098226e-02 -2.19759741e-03 1.06553803e-03 1.62240304e-02\n 4.04767185e-01 -2.30056539e-01 3.76758305e-03 -1.32475555e+00\n -6.85301840e-01 1.08905539e-01]\n [ 4.56290916e-02 -8.61619785e-02 4.80022561e-03 3.01449686e-01\n 6.84151769e-01 -3.82620096e-02 9.93165374e-02 -7.64304101e-02\n 1.43609820e-02 -1.52179319e-03 -1.23772200e-03 -7.01657981e-02\n 6.52543381e-02 2.51128644e-01 -1.30826651e-04 -1.34153187e+00\n 2.55206138e-01 -4.89278764e-01]\n [ 3.84396687e-02 -2.19447225e-12 1.97400138e-01 0.00000000e+00\n -0.00000000e+00 0.00000000e+00 7.65468255e-02 -6.13878220e-02\n 2.99999993e-02 0.00000000e+00 -0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00\n 0.00000000e+00 0.00000000e+00]\n [-4.40332554e-02 -1.76085472e-01 4.24387027e-03 9.77446288e-02\n 5.28218178e-03 -8.84639565e-04 -5.98331727e-03 -1.24993466e-01\n 1.49899675e-02 2.14067222e-06 -8.59909960e-07 4.78486414e-04\n 6.13060877e-07 1.39382723e-06 1.37280722e-06 -9.29213420e-05\n 4.08706255e-05 6.01873482e-11]\n [-3.42672169e-02 1.22663230e-01 3.11460090e-03 3.77718627e-01\n 1.76340520e-01 -9.07767378e-03 3.42606157e-02 5.72083965e-02\n 1.44523801e-02 -4.46005259e-03 -2.36348342e-03 8.96433145e-02\n 9.80184674e-01 -1.04655676e-01 -4.89059428e-04 -1.87777054e+00\n 1.14478981e+00 6.13457024e-01]\n [-1.45320565e-01 -1.19275376e-01 5.87122375e-03 -2.64351629e-03\n 1.76667217e-02 1.85797438e-02 -1.10402256e-01 -6.91061243e-02\n 1.46049839e-02 -5.88624855e-04 2.50984915e-03 -1.13844872e-01\n 5.90767115e-02 1.30152330e-01 -1.10503193e-02 1.35760546e+00\n 6.99893415e-01 -3.20194721e-01]\n [ 2.64110826e-02 -1.40573010e-02 3.47571820e-03 1.12298794e-01\n 2.84229726e-01 3.43756075e-03 3.24485511e-01 5.21627180e-02\n 1.47092892e-02 -1.04527818e-02 -8.94593634e-03 -1.22773921e+00\n 1.02157140e+00 7.00545833e-02 1.30078709e-03 -2.52658725e+00\n -2.52513981e+00 -2.95610142e+00]\n [ 2.01237556e-02 -9.49772447e-02 6.18526060e-03 1.25362296e-02\n -3.95233989e-01 2.06467509e-02 1.45554453e-01 -6.02400005e-02\n 1.47765307e-02 1.06054835e-03 -1.59580987e-02 -3.41429144e-01\n 1.08562565e+00 7.55612999e-02 -1.30280352e-03 -3.84829354e+00\n 1.70645729e-01 -2.34883451e+00]\n [ 5.86425439e-02 -6.40067831e-02 5.61795663e-03 -4.07354608e-02\n -1.17314365e-02 1.86153664e-03 3.07659566e-01 -2.31234934e-02\n 1.48132183e-02 1.36888018e-02 1.01757189e-02 -5.85344851e-01\n 9.91194248e-01 2.11156249e-01 -1.94325950e-03 7.29977012e-01\n 4.02509928e+00 -2.28512311e+00]\n [ 5.12128323e-02 1.71536729e-01 2.92140767e-02 2.68197894e-01\n -5.10080457e-01 7.78380930e-01 2.73687065e-01 1.16754547e-01\n 1.48246540e-02 -8.89697403e-04 1.78284496e-02 -1.73661992e-01\n 9.74645972e-01 -2.78953817e-02 7.92939041e-04 -3.88028908e+00\n 1.51277626e+00 -7.76317477e-01]\n [ 1.47135347e-01 1.88720956e-01 5.90325100e-03 -6.31752312e-02\n 6.11433029e-01 4.65540867e-03 1.85849741e-01 -1.55749395e-02\n 1.43878646e-02 2.12941226e-03 -1.90958660e-03 1.76388249e-01\n 2.58765876e-01 -1.80227146e-01 -3.23454587e-04 1.52748418e+00\n 6.35311902e-01 4.64668393e-01]\n [-9.43704545e-02 8.18899460e-03 1.31390437e-01 -6.81941748e-01\n 1.69501245e-01 -5.10617614e-01 -1.38588741e-01 -6.40289783e-02\n 1.49889067e-02 -4.71492831e-06 3.11330564e-06 5.76548453e-04\n 2.29034631e-05 4.26304759e-05 2.10494491e-05 -3.54578998e-03\n 2.46091117e-03 3.57675162e-04]]"
|
52 |
+
},
|
53 |
+
"_episode_num": 93648,
|
54 |
+
"use_sde": false,
|
55 |
+
"sde_sample_freq": -1,
|
56 |
+
"_current_progress_remaining": 0.3990688,
|
57 |
+
"_stats_window_size": 100,
|
58 |
+
"ep_info_buffer": {
|
59 |
+
":type:": "<class 'collections.deque'>",
|
60 |
+
":serialized:": "gAWV4AsAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKH2UKIwBcpRHwCoAAAAAAACMAWyUSw6MAXSUR0DBNwlu76HkdX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBNv77ZWaMdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBNxuJiy6ddX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN3wMKCxvdX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBN1fjQzDXdX2UKGgGR8A9AAAAAAAAaAdLHmgIR0DBN7wHmig1dX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBNz6q+8GtdX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBN+i6vq1PdX2UKGgGR8BBAAAAAAAAaAdLI2gIR0DBN9NEd/8VdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBNyFoBaLXdX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBN/3L7oB8dX2UKGgGR8AmAAAAAAAAaAdLDGgIR0DBN0gAEMb4dX2UKGgGR8BCgAAAAAAAaAdLJmgIR0DBN0Ob1AZ9dX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBN31x82JjdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBN2+AI6bOdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBN9qhakhzdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBN52lyimEdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN70v/R3NdX2UKGgGR8BAAAAAAAAAaAdLIWgIR0DBN1N1uBMBdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBN5Ate2NOdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBODQJ1JUYdX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBN533rUsndX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBN5Lo8p1BdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBN40NDtw8dX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOF3rpqyodX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOF0qlP8AdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOFd81Gb1dX2UKGgGR8BIgAAAAAAAaAdLMmgIR0DBOFMpd8iOdX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBN/q/O+qSdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBOCQD9wWFdX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBOBXSH/LldX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBN+uJzkp7dX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBN8msA/9pdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOLq2fChwdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOKV6kZaWdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBN+9ke6qbdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOL499tuUdX2UKGgGR8A/AAAAAAAAaAdLIGgIR0DBOCeGKyfMdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBOKSgqVhTdX2UKGgGR8A9AAAAAAAAaAdLHmgIR0DBOIf40uUVdX2UKGgGR8A3AAAAAAAAaAdLGGgIR0DBOCtlwtJ4dX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOHAFkhA4dX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBONjQPZqVdX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOOeuaF23dX2UKGgGR8BAgAAAAAAAaAdLImgIR0DBOFeFBY3edX2UKGgGR8AoAAAAAAAAaAdLDWgIR0DBOD8UTL4fdX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOOdTDO1OdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOFR55Z8sdX2UKGgGR8BJAAAAAAAAaAdLMmgIR0DBOEQV0tAcdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOL3fj0cwdX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBOKQ+EAYIdX2UKGgGR8A3AAAAAAAAaAdLGGgIR0DBOUbnDBM0dX2UKGgGR8AkAAAAAAAAaAdLC2gIR0DBOHtTLns+dX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOGSM3qA0dX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOULWiDdydX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOLyVQhwEdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOLhA4XGfdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOK3HPu5SdX2UKGgGR8A0AAAAAAAAaAdLFWgIR0DBOOhHy3CsdX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOVpJAdGRdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOK7KLbYcdX2UKGgGR8AxAAAAAAAAaAdLEmgIR0DBOKyRB/qgdX2UKGgGR8AYAAAAAAAAaAdLB2gIR0DBONin+AEudX2UKGgGR8AmAAAAAAAAaAdLDGgIR0DBOMKHwgDBdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBOSAGQjlgdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOaV1+y7gdX2UKGgGR8A+AAAAAAAAaAdLH2gIR0DBOYvA9FF2dX2UKGgGR8A7AAAAAAAAaAdLHGgIR0DBOYdEPUaydX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBOPh5VwPzdX2UKGgGR8BHAAAAAAAAaAdLL2gIR0DBOZ0nogV5dX2UKGgGR8A8AAAAAAAAaAdLHWgIR0DBOU/I2fkFdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOd9MVUModX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOUg2Ifr9dX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOSf8MuvmdX2UKGgGR8AwAAAAAAAAaAdLEWgIR0DBORIo3JgcdX2UKGgGR8AoAAAAAAAAaAdLDWgIR0DBOfa5d4VzdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOebhUBGQdX2UKGgGR8A2AAAAAAAAaAdLF2gIR0DBOamXb/OudX2UKGgGR8A/AAAAAAAAaAdLIGgIR0DBOXeearmydX2UKGgGR8AzAAAAAAAAaAdLFGgIR0DBOf9JJ5E/dX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOVw2MsH0dX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBOXfrpqyodX2UKGgGR8AiAAAAAAAAaAdLCmgIR0DBOh6JbdJrdX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOjRfICEIdX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOaOb3Gn5dX2UKGgGR8AgAAAAAAAAaAdLCWgIR0DBOaqBVdX1dX2UKGgGR8AyAAAAAAAAaAdLE2gIR0DBOZTThHbzdX2UKGgGR8AyAAAAAAAAaAdLE2gIR0DBOX78UEgXdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOi/Qtz0ZdX2UKGgGR8A5AAAAAAAAaAdLGmgIR0DBOeJw4sErdX2UKGgGR8AqAAAAAAAAaAdLDmgIR0DBOcSu2Zy/dX2UKGgGR8AsAAAAAAAAaAdLD2gIR0DBOa6m/FisdX2UKGgGR8A4AAAAAAAAaAdLGWgIR0DBOn/NcGC7dX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOlc384xUdX2UKGgGR8BCgAAAAAAAaAdLJmgIR0DBOmiVD8cddX2UKGgGR8BJAAAAAAAAaAdLMmgIR0DBOgnLidaudX2UKGgGR8AuAAAAAAAAaAdLEGgIR0DBOo+0zCUHdX2UKGgGR8BAAAAAAAAAaAdLIWgIR0DBObZ8c+7ldX2UKGgGR8A1AAAAAAAAaAdLFmgIR0DBOpjteD3/dX2UKGgGR8A6AAAAAAAAaAdLG2gIR0DBOj+ivgWKdWUu"
|
61 |
+
},
|
62 |
+
"ep_success_buffer": {
|
63 |
+
":type:": "<class 'collections.deque'>",
|
64 |
+
":serialized:": "gAWVhgAAAAAAAACMC2NvbGxlY3Rpb25zlIwFZGVxdWWUk5QpS2SGlFKUKIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiImIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiIiYiIiIhlLg=="
|
65 |
+
},
|
66 |
+
"_n_updates": 187785,
|
67 |
+
"buffer_size": 1000000,
|
68 |
+
"batch_size": 2048,
|
69 |
+
"learning_starts": 100,
|
70 |
+
"tau": 0.005,
|
71 |
+
"gamma": 0.99,
|
72 |
+
"gradient_steps": 1,
|
73 |
+
"optimize_memory_usage": false,
|
74 |
+
"replay_buffer_class": {
|
75 |
+
":type:": "<class 'abc.ABCMeta'>",
|
76 |
+
":serialized:": "gAWVOQAAAAAAAACMIHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi5idWZmZXJzlIwQRGljdFJlcGxheUJ1ZmZlcpSTlC4=",
|
77 |
+
"__module__": "stable_baselines3.common.buffers",
|
78 |
+
"__annotations__": "{'observation_space': <class 'gymnasium.spaces.dict.Dict'>, 'obs_shape': typing.Dict[str, typing.Tuple[int, ...]], 'observations': typing.Dict[str, numpy.ndarray], 'next_observations': typing.Dict[str, numpy.ndarray]}",
|
79 |
+
"__doc__": "\n Dict Replay buffer used in off-policy algorithms like SAC/TD3.\n Extends the ReplayBuffer to use dictionary observations\n\n :param buffer_size: Max number of element in the buffer\n :param observation_space: Observation space\n :param action_space: Action space\n :param device: PyTorch device\n :param n_envs: Number of parallel environments\n :param optimize_memory_usage: Enable a memory efficient variant\n Disabled for now (see https://github.com/DLR-RM/stable-baselines3/pull/243#discussion_r531535702)\n :param handle_timeout_termination: Handle timeout termination (due to timelimit)\n separately and treat the task as infinite horizon task.\n https://github.com/DLR-RM/stable-baselines3/issues/284\n ",
|
80 |
+
"__init__": "<function DictReplayBuffer.__init__ at 0x7ff631b03e20>",
|
81 |
+
"add": "<function DictReplayBuffer.add at 0x7ff631b03eb0>",
|
82 |
+
"sample": "<function DictReplayBuffer.sample at 0x7ff631b03f40>",
|
83 |
+
"_get_samples": "<function DictReplayBuffer._get_samples at 0x7ff631b28040>",
|
84 |
+
"__abstractmethods__": "frozenset()",
|
85 |
+
"_abc_impl": "<_abc._abc_data object at 0x7ff631b1d3c0>"
|
86 |
+
},
|
87 |
+
"replay_buffer_kwargs": {},
|
88 |
+
"train_freq": {
|
89 |
+
":type:": "<class 'stable_baselines3.common.type_aliases.TrainFreq'>",
|
90 |
+
":serialized:": "gAWVYQAAAAAAAACMJXN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbi50eXBlX2FsaWFzZXOUjAlUcmFpbkZyZXGUk5RLAWgAjBJUcmFpbkZyZXF1ZW5jeVVuaXSUk5SMBHN0ZXCUhZRSlIaUgZQu"
|
91 |
+
},
|
92 |
+
"use_sde_at_warmup": false,
|
93 |
+
"target_entropy": -3.0,
|
94 |
+
"ent_coef": "auto",
|
95 |
+
"target_update_interval": 1,
|
96 |
+
"observation_space": {
|
97 |
+
":type:": "<class 'gymnasium.spaces.dict.Dict'>",
|
98 |
+
":serialized:": "gAWVKAQAAAAAAACMFWd5bW5hc2l1bS5zcGFjZXMuZGljdJSMBERpY3SUk5QpgZR9lCiMBnNwYWNlc5SMC2NvbGxlY3Rpb25zlIwLT3JkZXJlZERpY3SUk5QpUpQojA1hY2hpZXZlZF9nb2FslIwUZ3ltbmFzaXVtLnNwYWNlcy5ib3iUjANCb3iUk5QpgZR9lCiMBWR0eXBllIwFbnVtcHmUjAVkdHlwZZSTlIwCZjSUiYiHlFKUKEsDjAE8lE5OTkr/////Sv////9LAHSUYowNYm91bmRlZF9iZWxvd5SMEm51bXB5LmNvcmUubnVtZXJpY5SMC19mcm9tYnVmZmVylJOUKJYDAAAAAAAAAAEBAZRoE4wCYjGUiYiHlFKUKEsDjAF8lE5OTkr/////Sv////9LAHSUYksDhZSMAUOUdJRSlIwNYm91bmRlZF9hYm92ZZRoHCiWAwAAAAAAAAABAQGUaCBLA4WUaCR0lFKUjAZfc2hhcGWUSwOFlIwDbG93lGgcKJYMAAAAAAAAAAAAIMEAACDBAAAgwZRoFksDhZRoJHSUUpSMBGhpZ2iUaBwolgwAAAAAAAAAAAAgQQAAIEEAACBBlGgWSwOFlGgkdJRSlIwIbG93X3JlcHKUjAUtMTAuMJSMCWhpZ2hfcmVwcpSMBDEwLjCUjApfbnBfcmFuZG9tlE51YowMZGVzaXJlZF9nb2FslGgNKYGUfZQoaBBoFmgZaBwolgMAAAAAAAAAAQEBlGggSwOFlGgkdJRSlGgnaBwolgMAAAAAAAAAAQEBlGggSwOFlGgkdJRSlGgsSwOFlGguaBwolgwAAAAAAAAAAAAgwQAAIMEAACDBlGgWSwOFlGgkdJRSlGgzaBwolgwAAAAAAAAAAAAgQQAAIEEAACBBlGgWSwOFlGgkdJRSlGg4jAUtMTAuMJRoOowEMTAuMJRoPE51YowLb2JzZXJ2YXRpb26UaA0pgZR9lChoEGgWaBloHCiWEgAAAAAAAAABAQEBAQEBAQEBAQEBAQEBAQGUaCBLEoWUaCR0lFKUaCdoHCiWEgAAAAAAAAABAQEBAQEBAQEBAQEBAQEBAQGUaCBLEoWUaCR0lFKUaCxLEoWUaC5oHCiWSAAAAAAAAAAAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMEAACDBAAAgwQAAIMGUaBZLEoWUaCR0lFKUaDNoHCiWSAAAAAAAAAAAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEEAACBBAAAgQQAAIEGUaBZLEoWUaCR0lFKUaDiMBS0xMC4wlGg6jAQxMC4wlGg8TnVidWgsTmgQTmg8TnViLg==",
|
99 |
+
"spaces": "OrderedDict([('achieved_goal', Box(-10.0, 10.0, (3,), float32)), ('desired_goal', Box(-10.0, 10.0, (3,), float32)), ('observation', Box(-10.0, 10.0, (18,), float32))])",
|
100 |
+
"_shape": null,
|
101 |
+
"dtype": null,
|
102 |
+
"_np_random": null
|
103 |
+
},
|
104 |
+
"action_space": {
|
105 |
+
":type:": "<class 'gymnasium.spaces.box.Box'>",
|
106 |
+
":serialized:": "gAWVYAIAAAAAAACMFGd5bW5hc2l1bS5zcGFjZXMuYm94lIwDQm94lJOUKYGUfZQojAVkdHlwZZSMBW51bXB5lIwFZHR5cGWUk5SMAmY0lImIh5RSlChLA4wBPJROTk5K/////0r/////SwB0lGKMDWJvdW5kZWRfYmVsb3eUjBJudW1weS5jb3JlLm51bWVyaWOUjAtfZnJvbWJ1ZmZlcpSTlCiWAwAAAAAAAAABAQGUaAiMAmIxlImIh5RSlChLA4wBfJROTk5K/////0r/////SwB0lGJLA4WUjAFDlHSUUpSMDWJvdW5kZWRfYWJvdmWUaBEolgMAAAAAAAAAAQEBlGgVSwOFlGgZdJRSlIwGX3NoYXBllEsDhZSMA2xvd5RoESiWDAAAAAAAAAAAAIC/AACAvwAAgL+UaAtLA4WUaBl0lFKUjARoaWdolGgRKJYMAAAAAAAAAAAAgD8AAIA/AACAP5RoC0sDhZRoGXSUUpSMCGxvd19yZXBylIwELTEuMJSMCWhpZ2hfcmVwcpSMAzEuMJSMCl9ucF9yYW5kb22UjBRudW1weS5yYW5kb20uX3BpY2tsZZSMEF9fZ2VuZXJhdG9yX2N0b3KUk5SMBVBDRzY0lGgyjBRfX2JpdF9nZW5lcmF0b3JfY3RvcpSTlIaUUpR9lCiMDWJpdF9nZW5lcmF0b3KUjAVQQ0c2NJSMBXN0YXRllH2UKGg9ihBl3wcOaCf4tNHk4wFpO1pOjANpbmOUihBHNkRLhAhbywqUoI0S9+11dYwKaGFzX3VpbnQzMpRLAIwIdWludGVnZXKUSwB1YnViLg==",
|
107 |
+
"dtype": "float32",
|
108 |
+
"bounded_below": "[ True True True]",
|
109 |
+
"bounded_above": "[ True True True]",
|
110 |
+
"_shape": [
|
111 |
+
3
|
112 |
+
],
|
113 |
+
"low": "[-1. -1. -1.]",
|
114 |
+
"high": "[1. 1. 1.]",
|
115 |
+
"low_repr": "-1.0",
|
116 |
+
"high_repr": "1.0",
|
117 |
+
"_np_random": "Generator(PCG64)"
|
118 |
+
},
|
119 |
+
"n_envs": 16,
|
120 |
+
"lr_schedule": {
|
121 |
+
":type:": "<class 'function'>",
|
122 |
+
":serialized:": "gAWVsQMAAAAAAACMF2Nsb3VkcGlja2xlLmNsb3VkcGlja2xllIwOX21ha2VfZnVuY3Rpb26Uk5QoaACMDV9idWlsdGluX3R5cGWUk5SMCENvZGVUeXBllIWUUpQoSwFLAEsASwFLA0sTQwx0AIgAfACDAYMBUwCUToWUjAVmbG9hdJSFlIwScHJvZ3Jlc3NfcmVtYWluaW5nlIWUjGAvaG9tZS90b21lay9weXRvcmNoX2xlYXJuaW5nL3ZlbnYvbGliL3B5dGhvbjMuMTAvc2l0ZS1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUjAg8bGFtYmRhPpRLYUMCDACUjA52YWx1ZV9zY2hlZHVsZZSFlCl0lFKUfZQojAtfX3BhY2thZ2VfX5SMGHN0YWJsZV9iYXNlbGluZXMzLmNvbW1vbpSMCF9fbmFtZV9flIwec3RhYmxlX2Jhc2VsaW5lczMuY29tbW9uLnV0aWxzlIwIX19maWxlX1+UjGAvaG9tZS90b21lay9weXRvcmNoX2xlYXJuaW5nL3ZlbnYvbGliL3B5dGhvbjMuMTAvc2l0ZS1wYWNrYWdlcy9zdGFibGVfYmFzZWxpbmVzMy9jb21tb24vdXRpbHMucHmUdU5OaACMEF9tYWtlX2VtcHR5X2NlbGyUk5QpUpSFlHSUUpRoAIwSX2Z1bmN0aW9uX3NldHN0YXRllJOUaCF9lH2UKGgYaA+MDF9fcXVhbG5hbWVfX5SMIWdldF9zY2hlZHVsZV9mbi48bG9jYWxzPi48bGFtYmRhPpSMD19fYW5ub3RhdGlvbnNfX5R9lIwOX19rd2RlZmF1bHRzX1+UTowMX19kZWZhdWx0c19flE6MCl9fbW9kdWxlX1+UaBmMB19fZG9jX1+UTowLX19jbG9zdXJlX1+UaACMCl9tYWtlX2NlbGyUk5RoAihoByhLAUsASwBLAUsBSxNDBIgAUwCUaAkpjAFflIWUaA6MBGZ1bmOUS4VDAgQBlIwDdmFslIWUKXSUUpRoFU5OaB0pUpSFlHSUUpRoI2g9fZR9lChoGGg0aCaMGWNvbnN0YW50X2ZuLjxsb2NhbHM+LmZ1bmOUaCh9lGgqTmgrTmgsaBloLU5oLmgwRz8zqSowVTJhhZRSlIWUjBdfY2xvdWRwaWNrbGVfc3VibW9kdWxlc5RdlIwLX19nbG9iYWxzX1+UfZR1hpSGUjCFlFKUhZRoRV2UaEd9lHWGlIZSMC4="
|
123 |
+
},
|
124 |
+
"batch_norm_stats": [],
|
125 |
+
"batch_norm_stats_target": []
|
126 |
+
}
|
sac-PandaSlide-v3/ent_coef_optimizer.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4b527abe45812060228910b92f166466321b84c460c6576426bbf32200087152
|
3 |
+
size 1940
|
sac-PandaSlide-v3/policy.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3a3a80901dbd889028c342bc71c5cad83ec482f0efa1ff9dfb2bef631f790e6c
|
3 |
+
size 10822808
|
sac-PandaSlide-v3/pytorch_variables.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e1609c474b8039b8210d2c9e005675dd4887aec6efbb81d66cb87a4a532fbfe1
|
3 |
+
size 1180
|
sac-PandaSlide-v3/system_info.txt
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
- OS: Linux-5.15.146.1-microsoft-standard-WSL2-x86_64-with-glibc2.35 # 1 SMP Thu Jan 11 04:09:03 UTC 2024
|
2 |
+
- Python: 3.10.12
|
3 |
+
- Stable-Baselines3: 2.3.2
|
4 |
+
- PyTorch: 2.3.0+cu121
|
5 |
+
- GPU Enabled: True
|
6 |
+
- Numpy: 1.26.4
|
7 |
+
- Cloudpickle: 3.0.0
|
8 |
+
- Gymnasium: 0.29.1
|
9 |
+
- OpenAI Gym: 0.26.2
|
vec_normalize.pkl
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d1c0d736d30d2ff44de953df32e2dff10ec0a2933f7219a780eafdc4b30111bf
|
3 |
+
size 3276
|