tiagoblima
/

t5_base-qg-ap-nopeft

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tiagoblima commited on Jan 15

Commit

bd7bcda

•

1 Parent(s): b85ab75

Model save

Files changed (2) hide show

README.md +13 -13
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: mit
 base_model: unicamp-dl/ptt5-base-t5-vocab
 tags:
 - generated_from_trainer
-datasets:
-- tiagoblima/qg_squad_v1_pt
 model-index:
 - name: t5_base-qg-ap-nopeft
   results: []
@@ -15,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # t5_base-qg-ap-nopeft
-This model is a fine-tuned version of [unicamp-dl/ptt5-base-t5-vocab](https://huggingface.co/unicamp-dl/ptt5-base-t5-vocab) on the tiagoblima/qg_squad_v1_pt dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2067
 ## Model description
@@ -37,27 +35,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
-- train_batch_size: 64
 - eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
 - lr_scheduler_type: linear
 - num_epochs: 5.0
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 1.2123        | 1.0   | 808  | 1.2496          |
-| 1.1329        | 2.0   | 1616 | 1.2207          |
-| 1.0819        | 3.0   | 2424 | 1.2097          |
-| 1.0447        | 4.0   | 3232 | 1.2067          |
-| 1.0244        | 5.0   | 4040 | 1.2074          |
 ### Framework versions
 - Transformers 4.35.2
-- Pytorch 2.1.0+cu121
 - Datasets 2.15.0
 - Tokenizers 0.15.0

 base_model: unicamp-dl/ptt5-base-t5-vocab
 tags:
 - generated_from_trainer
 model-index:
 - name: t5_base-qg-ap-nopeft
   results: []
 # t5_base-qg-ap-nopeft
+This model is a fine-tuned version of [unicamp-dl/ptt5-base-t5-vocab](https://huggingface.co/unicamp-dl/ptt5-base-t5-vocab) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2074
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
+- train_batch_size: 4
 - eval_batch_size: 4
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-06
 - lr_scheduler_type: linear
 - num_epochs: 5.0
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 1.1634        | 1.0   | 3231  | 1.2246          |
+| 1.0645        | 2.0   | 6463  | 1.2035          |
+| 0.991         | 3.0   | 9694  | 1.1980          |
+| 0.9459        | 4.0   | 12926 | 1.2027          |
+| 0.9191        | 5.0   | 16155 | 1.2074          |
 ### Framework versions
 - Transformers 4.35.2
+- Pytorch 2.0.0
 - Datasets 2.15.0
 - Tokenizers 0.15.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d01a0fb9e2585093fb82d0b8a53d8f01396f35cbd937fbd4b107c36c48c3a435
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf97f88034783bdca91b6175860096363329991cc6731c624e81579a669b69c5
 size 891644712