Sarathbabu-Karunanithi
/

SQLgpt2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Sarathbabu-Karunanithi commited on Dec 11, 2024

Commit

cd224de

·

verified ·

1 Parent(s): 6d1cd6a

End of training

Files changed (3) hide show

README.md +7 -5
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6103
 ## Model description
@@ -42,13 +42,15 @@ The following hyperparameters were used during training:
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 0.6319        | 1.0   | 2000 | 0.6103          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4502
 ## Model description
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 0.5039        | 1.0   | 15716 | 0.4827          |
+| 0.4513        | 2.0   | 31432 | 0.4577          |
+| 0.4199        | 3.0   | 47148 | 0.4502          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bd4e5008d659e46d465e5518d8b587823f38716f94a5326983738cf8e1325eb1
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:985f74977b23d5a88b8ea2e2dfd2788b88e274740a09171dabe3b3765d2c2ad6
 size 497774208

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:79c3e01869a8c41dd1f069cee2ebd858d159ec519657b222c7aaa4d6b00930f8
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:1a5315dd3ce9437b46e710afd68613c90b0e6599cd1fcbed87115cf70a778d35
 size 5176