yoshidevs
/

t5-portuguese-quiz

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yoshidevs commited on Sep 24

Commit

ddb6a16

•

1 Parent(s): d0b0614

End of training

Files changed (2) hide show

README.md +8 -13
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [unicamp-dl/ptt5-base-portuguese-vocab](https://huggingface.co/unicamp-dl/ptt5-base-portuguese-vocab) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7499
 ## Model description
@@ -35,28 +35,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 7    | 38.9123         |
-| 37.0215       | 2.0   | 14   | 19.7826         |
-| 20.0237       | 3.0   | 21   | 9.3822          |
-| 20.0237       | 4.0   | 28   | 6.4034          |
-| 8.472         | 5.0   | 35   | 4.0273          |
-| 4.6509        | 6.0   | 42   | 2.0258          |
-| 4.6509        | 7.0   | 49   | 1.0862          |
-| 2.3018        | 8.0   | 56   | 0.8273          |
-| 1.5773        | 9.0   | 63   | 0.7654          |
-| 1.3641        | 10.0  | 70   | 0.7499          |
 ### Framework versions

 This model is a fine-tuned version of [unicamp-dl/ptt5-base-portuguese-vocab](https://huggingface.co/unicamp-dl/ptt5-base-portuguese-vocab) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 47.4780
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
 - train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 7    | 53.1720         |
+| 40.7453       | 2.0   | 14   | 50.8278         |
+| 38.9149       | 3.0   | 21   | 49.0209         |
+| 38.9149       | 4.0   | 28   | 47.8934         |
+| 37.3341       | 5.0   | 35   | 47.4780         |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dbb238d6f836d79c54d71cac56bfd7123595cacbcb8b8112aa5f80555c6f2c40
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:23b328d147e497fe3437c2cbc23b5fb1a9091fba6d85cd3ae4b39ca51e0a473a
 size 891644712