lengoctuong
/

gpt2-finetuned-wikitext2-test

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lengoctuong commited on Aug 24, 2023

Commit

40ac109

•

1 Parent(s): 9c9249b

Update README.md

Files changed (1) hide show

README.md +27 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 ---
 license: mit
 datasets:
 - wikitext
 language:
@@ -12,7 +13,18 @@ pipeline_tag: text-generation
 tags:
 - code
 - text-generation-inference
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
@@ -103,6 +115,14 @@ Use the code below to get started with the model.
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
@@ -135,7 +155,13 @@ Use the code below to get started with the model.
 ### Results
-[More Information Needed]
 #### Summary

 ---
 license: mit
+base_model: gpt2
 datasets:
 - wikitext
 language:
 tags:
 - code
 - text-generation-inference
+- generated_from_trainer
+model-index:
+- name: gpt2-finetuned-wikitext2
+  results: []
 ---
+# gpt2-finetuned-wikitext2
+This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss:
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+The following hyperparameters were used during training:
+- learning_rate: 5e-04
+- train_batch_size: 8
+- eval_batch_size: 8
+- optimizer: AdamW
+- lr_scheduler_type: linear
+- num_epochs: 2.0
+-
 #### Speeds, Sizes, Times [optional]
 <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
 ### Results
+| Epoch | Step | Validation Loss |
+|:-----:|:----:|:---------------:|
+| 1.0   | 1000 | 3.6487          |
+| 1.0   | 2000 | 3.6033          |
+| 2.0   | 1000 | 3.6578          |
+| 2.0   | 2000 | 3.6434          |
 #### Summary