End of training
Browse files
README.md
CHANGED
@@ -7,7 +7,7 @@ tags:
|
|
7 |
datasets:
|
8 |
- Aivesa/dataset_34ad7f9d-ff58-4068-8537-2d15a40438a9
|
9 |
model-index:
|
10 |
-
- name:
|
11 |
results: []
|
12 |
---
|
13 |
|
@@ -47,7 +47,7 @@ fsdp_config: null
|
|
47 |
gradient_accumulation_steps: 4
|
48 |
gradient_checkpointing: false
|
49 |
group_by_length: false
|
50 |
-
hub_model_id: Aivesa/
|
51 |
hub_private_repo: true
|
52 |
hub_repo: null
|
53 |
hub_strategy: checkpoint
|
@@ -101,7 +101,7 @@ xformers_attention: null
|
|
101 |
|
102 |
</details><br>
|
103 |
|
104 |
-
#
|
105 |
|
106 |
This model is a fine-tuned version of [HuggingFaceH4/tiny-random-LlamaForCausalLM](https://huggingface.co/HuggingFaceH4/tiny-random-LlamaForCausalLM) on the Aivesa/dataset_34ad7f9d-ff58-4068-8537-2d15a40438a9 dataset.
|
107 |
It achieves the following results on the evaluation set:
|
@@ -139,7 +139,7 @@ The following hyperparameters were used during training:
|
|
139 |
|
140 |
| Training Loss | Epoch | Step | Validation Loss |
|
141 |
|:-------------:|:------:|:----:|:---------------:|
|
142 |
-
| 10.
|
143 |
| 10.3789 | 0.0026 | 6 | 10.3773 |
|
144 |
| 10.3786 | 0.0038 | 9 | 10.3772 |
|
145 |
|
|
|
7 |
datasets:
|
8 |
- Aivesa/dataset_34ad7f9d-ff58-4068-8537-2d15a40438a9
|
9 |
model-index:
|
10 |
+
- name: 1931e729-dd3f-4026-a118-7fef2d3afe59
|
11 |
results: []
|
12 |
---
|
13 |
|
|
|
47 |
gradient_accumulation_steps: 4
|
48 |
gradient_checkpointing: false
|
49 |
group_by_length: false
|
50 |
+
hub_model_id: Aivesa/1931e729-dd3f-4026-a118-7fef2d3afe59
|
51 |
hub_private_repo: true
|
52 |
hub_repo: null
|
53 |
hub_strategy: checkpoint
|
|
|
101 |
|
102 |
</details><br>
|
103 |
|
104 |
+
# 1931e729-dd3f-4026-a118-7fef2d3afe59
|
105 |
|
106 |
This model is a fine-tuned version of [HuggingFaceH4/tiny-random-LlamaForCausalLM](https://huggingface.co/HuggingFaceH4/tiny-random-LlamaForCausalLM) on the Aivesa/dataset_34ad7f9d-ff58-4068-8537-2d15a40438a9 dataset.
|
107 |
It achieves the following results on the evaluation set:
|
|
|
139 |
|
140 |
| Training Loss | Epoch | Step | Validation Loss |
|
141 |
|:-------------:|:------:|:----:|:---------------:|
|
142 |
+
| 10.3773 | 0.0013 | 3 | 10.3774 |
|
143 |
| 10.3789 | 0.0026 | 6 | 10.3773 |
|
144 |
| 10.3786 | 0.0038 | 9 | 10.3772 |
|
145 |
|