Tohrumi commited on
Commit
c510177
1 Parent(s): 19acb93

#1: First attempt

Browse files
Files changed (1) hide show
  1. README.md +17 -2
README.md CHANGED
@@ -2,9 +2,10 @@
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
5
- - unsloth
6
  - trl
7
  - sft
 
 
8
  - generated_from_trainer
9
  base_model: unsloth/mistral-7b-bnb-4bit
10
  model-index:
@@ -17,7 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # MistralAI_iwslt15_10000_2
19
 
20
- This model is a fine-tuned version of [unsloth/mistral-7b-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-bnb-4bit) on an unknown dataset.
 
 
21
 
22
  ## Model description
23
 
@@ -48,6 +51,18 @@ The following hyperparameters were used during training:
48
  - num_epochs: 2
49
  - mixed_precision_training: Native AMP
50
 
 
 
 
 
 
 
 
 
 
 
 
 
51
  ### Framework versions
52
 
53
  - PEFT 0.10.0
 
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
 
5
  - trl
6
  - sft
7
+ - unsloth
8
+ - translation
9
  - generated_from_trainer
10
  base_model: unsloth/mistral-7b-bnb-4bit
11
  model-index:
 
18
 
19
  # MistralAI_iwslt15_10000_2
20
 
21
+ This model is a fine-tuned version of [unsloth/mistral-7b-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-bnb-4bit) on the None dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 1.0438
24
 
25
  ## Model description
26
 
 
51
  - num_epochs: 2
52
  - mixed_precision_training: Native AMP
53
 
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss |
57
+ |:-------------:|:-----:|:----:|:---------------:|
58
+ | 1.1684 | 0.32 | 100 | 1.0926 |
59
+ | 1.0883 | 0.64 | 200 | 1.0701 |
60
+ | 1.0672 | 0.96 | 300 | 1.0498 |
61
+ | 0.9315 | 1.28 | 400 | 1.0547 |
62
+ | 0.8973 | 1.6 | 500 | 1.0495 |
63
+ | 0.8831 | 1.92 | 600 | 1.0438 |
64
+
65
+
66
  ### Framework versions
67
 
68
  - PEFT 0.10.0