meoo225
/

mT5_base

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
 license: apache-2.0
-base_model: google/mt5-small
 tags:
 - generated_from_trainer
 metrics:
@@ -17,14 +17,14 @@ should probably proofread and complete it, then remove this comment. -->
 # mT5_base
-This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3417
-- Bleu Score: 47.0526
-- Precision: 17.2043
-- Recall: 17.2043
-- Gen Len: 16.8315
-- Err: 17.2043
 ## Model description
@@ -49,16 +49,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall  | Gen Len | Err     |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
-| 2.798         | 1.0   | 838  | 0.5495          | 41.8683    | 7.7658    | 7.7658  | 16.7766 | 7.7658  |
-| 0.7216        | 2.0   | 1676 | 0.4311          | 44.9002    | 13.0227   | 13.0227 | 16.8148 | 13.0227 |
-| 0.5551        | 3.0   | 2514 | 0.3565          | 46.5247    | 16.0096   | 16.0096 | 16.816  | 16.0096 |
-| 0.4951        | 4.0   | 3352 | 0.3417          | 47.0526    | 17.2043   | 17.2043 | 16.8315 | 17.2043 |
 ### Framework versions

 ---
 library_name: transformers
 license: apache-2.0
+base_model: google/mt5-base
 tags:
 - generated_from_trainer
 metrics:
 # mT5_base
+This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1703
+- Bleu Score: 51.176
+- Precision: 27.4791
+- Recall: 27.4791
+- Gen Len: 16.8805
+- Err: 27.4791
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Precision | Recall  | Gen Len | Err     |
 |:-------------:|:-----:|:----:|:---------------:|:----------:|:---------:|:-------:|:-------:|:-------:|
+| 1.3269        | 1.0   | 838  | 0.2396          | 48.4521    | 20.7885   | 20.7885 | 16.8339 | 20.7885 |
+| 0.2831        | 2.0   | 1676 | 0.1861          | 50.5118    | 26.1649   | 26.1649 | 16.8781 | 26.1649 |
+| 0.2167        | 3.0   | 2514 | 0.1703          | 51.176     | 27.4791   | 27.4791 | 16.8805 | 27.4791 |
 ### Framework versions

logs/events.out.tfevents.1728145765.b035966b8ee0.4495.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:82f722805b7b2d262a80022df42133833f7caf40edd1c593fe8846ad3bac085b
-size 6684

 version https://git-lfs.github.com/spec/v1
+oid sha256:0628992bff50a5f319a9c8b6bf2c256091856babf7dd19abe6d40bf59171e75e
+size 7775