End of training
Browse files- README.md +12 -13
- logs/events.out.tfevents.1727531166.f41cea15f6e1.2276.2 +2 -2
- model.safetensors +1 -1
README.md
CHANGED
@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
19 |
|
20 |
This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
-
- Loss: 0.
|
23 |
-
- Bleu Score:
|
24 |
-
- Precision:
|
25 |
-
- Recall:
|
26 |
-
- Gen Len: 12.
|
27 |
-
- Err:
|
28 |
|
29 |
## Model description
|
30 |
|
@@ -44,21 +44,20 @@ More information needed
|
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
- learning_rate: 0.0001
|
47 |
-
- train_batch_size:
|
48 |
-
- eval_batch_size:
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
-
- num_epochs:
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
| Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
|
57 |
|:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.0365 | 4.0 | 3352 | 0.2594 | 79.8663 | 58.4229 | 58.4229 | 12.7658 | 58.4229 |
|
62 |
|
63 |
|
64 |
### Framework versions
|
|
|
19 |
|
20 |
This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
+
- Loss: 0.2178
|
23 |
+
- Bleu Score: 78.7261
|
24 |
+
- Precision: 54.5998
|
25 |
+
- Recall: 54.5998
|
26 |
+
- Gen Len: 12.7826
|
27 |
+
- Err: 54.5998
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
- learning_rate: 0.0001
|
47 |
+
- train_batch_size: 16
|
48 |
+
- eval_batch_size: 16
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
+
- num_epochs: 3
|
53 |
|
54 |
### Training results
|
55 |
|
56 |
| Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall | Gen Len | Err |
|
57 |
|:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
|
58 |
+
| 0.4635 | 1.0 | 419 | 0.2255 | 77.2678 | 49.5818 | 49.5818 | 12.7969 | 49.5818 |
|
59 |
+
| 0.166 | 2.0 | 838 | 0.2026 | 78.2851 | 53.1661 | 53.1661 | 12.8041 | 53.1661 |
|
60 |
+
| 0.0752 | 3.0 | 1257 | 0.2178 | 78.7261 | 54.5998 | 54.5998 | 12.7826 | 54.5998 |
|
|
|
61 |
|
62 |
|
63 |
### Framework versions
|
logs/events.out.tfevents.1727531166.f41cea15f6e1.2276.2
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5d0004c85c0d06b6cb475fbed280595253055bd4f46d40c95a6d480c8dc3783a
|
3 |
+
size 7749
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 903834408
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a3d56343698946c69ec8dc1217225d632d5bf35cd2f709a12aa3a7f2ae15623a
|
3 |
size 903834408
|