ypl/bart_game_clean
Browse files- README.md +23 -42
- config.json +1 -1
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -1,4 +1,6 @@
|
|
1 |
---
|
|
|
|
|
2 |
tags:
|
3 |
- generated_from_trainer
|
4 |
model-index:
|
@@ -11,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
11 |
|
12 |
# bart_test_p2
|
13 |
|
14 |
-
This model
|
15 |
It achieves the following results on the evaluation set:
|
16 |
-
- Loss: 0.
|
17 |
|
18 |
## Model description
|
19 |
|
@@ -42,46 +44,25 @@ The following hyperparameters were used during training:
|
|
42 |
|
43 |
### Training results
|
44 |
|
45 |
-
| Training Loss | Epoch | Step
|
46 |
-
|
47 |
-
|
|
48 |
-
| 0.
|
49 |
-
| 0.
|
50 |
-
| 0.
|
51 |
-
| 0.
|
52 |
-
| 0.
|
53 |
-
| 0.
|
54 |
-
| 0.
|
55 |
-
| 0.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.0186 | 1.42 | 9000 | 0.0197 |
|
65 |
-
| 0.0199 | 1.5 | 9500 | 0.0197 |
|
66 |
-
| 0.0207 | 1.58 | 10000 | 0.0195 |
|
67 |
-
| 0.0199 | 1.66 | 10500 | 0.0196 |
|
68 |
-
| 0.0188 | 1.74 | 11000 | 0.0195 |
|
69 |
-
| 0.0194 | 1.81 | 11500 | 0.0194 |
|
70 |
-
| 0.0201 | 1.89 | 12000 | 0.0195 |
|
71 |
-
| 0.0181 | 1.97 | 12500 | 0.0194 |
|
72 |
-
| 0.0177 | 2.05 | 13000 | 0.0194 |
|
73 |
-
| 0.0161 | 2.13 | 13500 | 0.0196 |
|
74 |
-
| 0.0172 | 2.21 | 14000 | 0.0195 |
|
75 |
-
| 0.0184 | 2.29 | 14500 | 0.0195 |
|
76 |
-
| 0.0168 | 2.37 | 15000 | 0.0195 |
|
77 |
-
| 0.0176 | 2.44 | 15500 | 0.0194 |
|
78 |
-
| 0.0177 | 2.52 | 16000 | 0.0194 |
|
79 |
-
| 0.0158 | 2.6 | 16500 | 0.0194 |
|
80 |
-
| 0.0177 | 2.68 | 17000 | 0.0193 |
|
81 |
-
| 0.0179 | 2.76 | 17500 | 0.0193 |
|
82 |
-
| 0.0167 | 2.84 | 18000 | 0.0194 |
|
83 |
-
| 0.0177 | 2.92 | 18500 | 0.0193 |
|
84 |
-
| 0.0171 | 3.0 | 19000 | 0.0194 |
|
85 |
|
86 |
|
87 |
### Framework versions
|
|
|
1 |
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model: facebook/bart-base
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
model-index:
|
|
|
13 |
|
14 |
# bart_test_p2
|
15 |
|
16 |
+
This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 0.0306
|
19 |
|
20 |
## Model description
|
21 |
|
|
|
44 |
|
45 |
### Training results
|
46 |
|
47 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
48 |
+
|:-------------:|:-----:|:----:|:---------------:|
|
49 |
+
| 1.5336 | 0.18 | 500 | 0.1004 |
|
50 |
+
| 0.1089 | 0.35 | 1000 | 0.0755 |
|
51 |
+
| 0.0923 | 0.53 | 1500 | 0.0637 |
|
52 |
+
| 0.0747 | 0.7 | 2000 | 0.0554 |
|
53 |
+
| 0.0719 | 0.88 | 2500 | 0.0498 |
|
54 |
+
| 0.0653 | 1.05 | 3000 | 0.0470 |
|
55 |
+
| 0.0557 | 1.23 | 3500 | 0.0425 |
|
56 |
+
| 0.0505 | 1.4 | 4000 | 0.0403 |
|
57 |
+
| 0.05 | 1.58 | 4500 | 0.0378 |
|
58 |
+
| 0.0477 | 1.75 | 5000 | 0.0362 |
|
59 |
+
| 0.0451 | 1.93 | 5500 | 0.0343 |
|
60 |
+
| 0.0443 | 2.1 | 6000 | 0.0327 |
|
61 |
+
| 0.0372 | 2.28 | 6500 | 0.0326 |
|
62 |
+
| 0.0384 | 2.45 | 7000 | 0.0316 |
|
63 |
+
| 0.0371 | 2.63 | 7500 | 0.0311 |
|
64 |
+
| 0.0363 | 2.8 | 8000 | 0.0306 |
|
65 |
+
| 0.0345 | 2.98 | 8500 | 0.0306 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
66 |
|
67 |
|
68 |
### Framework versions
|
config.json
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
{
|
2 |
-
"_name_or_path": "
|
3 |
"activation_dropout": 0.1,
|
4 |
"activation_function": "gelu",
|
5 |
"add_bias_logits": false,
|
|
|
1 |
{
|
2 |
+
"_name_or_path": "facebook/bart-base",
|
3 |
"activation_dropout": 0.1,
|
4 |
"activation_function": "gelu",
|
5 |
"add_bias_logits": false,
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 557912620
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f2558d47436667ae1b8a797015805d813c8d89be3b9c41971ba494d8ff97ee0f
|
3 |
size 557912620
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4664
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:73dae393b2d260a70f60ef338ffd292b5f213775449dfde701757ecec893be35
|
3 |
size 4664
|