End of training
Browse files
README.md
CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
- Loss: 0.
|
20 |
- Rouge1: 0.0
|
21 |
- Rouge2: 0.0
|
22 |
- Rougel: 0.0
|
23 |
- Rougelsum: 0.0
|
24 |
-
- Gen Len: 6.
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -53,10 +53,10 @@ The following hyperparameters were used during training:
|
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
56 |
-
| No log | 1.0 | 40 |
|
57 |
-
| No log | 2.0 | 80 | 0.
|
58 |
-
| No log | 3.0 | 120 | 0.
|
59 |
-
| No log | 4.0 | 160 | 0.
|
60 |
|
61 |
|
62 |
### Framework versions
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
+
- Loss: 0.9755
|
20 |
- Rouge1: 0.0
|
21 |
- Rouge2: 0.0
|
22 |
- Rougel: 0.0
|
23 |
- Rougelsum: 0.0
|
24 |
+
- Gen Len: 6.4654
|
25 |
|
26 |
## Model description
|
27 |
|
|
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
|
56 |
+
| No log | 1.0 | 40 | 1.0042 | 0.0 | 0.0 | 0.0 | 0.0 | 6.6101 |
|
57 |
+
| No log | 2.0 | 80 | 0.9566 | 0.0 | 0.0 | 0.0 | 0.0 | 6.3145 |
|
58 |
+
| No log | 3.0 | 120 | 0.9539 | 0.0 | 0.0 | 0.0 | 0.0 | 6.2893 |
|
59 |
+
| No log | 4.0 | 160 | 0.9755 | 0.0 | 0.0 | 0.0 | 0.0 | 6.4654 |
|
60 |
|
61 |
|
62 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 2444578688
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:00aae353f311ea5ba7bf69d3c56b158b0ac1b1969028ccabd33fb1b53c562c69
|
3 |
size 2444578688
|
runs/Mar18_13-24-59_33f496c3afc6/events.out.tfevents.1710768300.33f496c3afc6.300.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fffe1b670d687be18343e6cb6c807f126da4879173829328a8d37f333cbb8524
|
3 |
+
size 7897
|
tokenizer.json
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:73397494b7dbed83d0bdc990eaf128cf2397ff728c768a1d4225695383f28b62
|
3 |
+
size 17110040
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5048
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94d788b03cb5b8a100e75ddf061347763a9550870487cf0ccc572b191e6c4374
|
3 |
size 5048
|