Tohrumi
/

MistralAI_iwslt15_10000_2

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

Tohrumi commited on Apr 27

Commit

c510177

•

1 Parent(s): 19acb93

#1: First attempt

Files changed (1) hide show

README.md +17 -2

README.md CHANGED Viewed

@@ -2,9 +2,10 @@
 license: apache-2.0
 library_name: peft
 tags:
-- unsloth
 - trl
 - sft
 - generated_from_trainer
 base_model: unsloth/mistral-7b-bnb-4bit
 model-index:
@@ -17,7 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 # MistralAI_iwslt15_10000_2
-This model is a fine-tuned version of [unsloth/mistral-7b-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-bnb-4bit) on an unknown dataset.
 ## Model description
@@ -48,6 +51,18 @@ The following hyperparameters were used during training:
 - num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Framework versions
 - PEFT 0.10.0

 license: apache-2.0
 library_name: peft
 tags:
 - trl
 - sft
+- unsloth
+- translation
 - generated_from_trainer
 base_model: unsloth/mistral-7b-bnb-4bit
 model-index:
 # MistralAI_iwslt15_10000_2
+This model is a fine-tuned version of [unsloth/mistral-7b-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-bnb-4bit) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.0438
 ## Model description
 - num_epochs: 2
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 1.1684        | 0.32  | 100  | 1.0926          |
+| 1.0883        | 0.64  | 200  | 1.0701          |
+| 1.0672        | 0.96  | 300  | 1.0498          |
+| 0.9315        | 1.28  | 400  | 1.0547          |
+| 0.8973        | 1.6   | 500  | 1.0495          |
+| 0.8831        | 1.92  | 600  | 1.0438          |
 ### Framework versions
 - PEFT 0.10.0