Farjfar commited on
Commit
e57cf12
·
verified ·
1 Parent(s): c4f995b

End of training

Browse files
Files changed (2) hide show
  1. README.md +17 -15
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.1114
24
- - Precision: 0.2802
25
- - Recall: 0.2684
26
- - F1: 0.2742
27
- - Accuracy: 0.9677
28
 
29
  ## Model description
30
 
@@ -43,26 +43,28 @@ More information needed
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
- - learning_rate: 0.0001
47
  - train_batch_size: 32
48
  - eval_batch_size: 32
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 8
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
- | No log | 1.0 | 39 | 0.1589 | 0.0 | 0.0 | 0.0 | 0.9677 |
59
- | No log | 2.0 | 78 | 0.1386 | 0.5714 | 0.0421 | 0.0784 | 0.9682 |
60
- | No log | 3.0 | 117 | 0.1257 | 0.3784 | 0.0737 | 0.1233 | 0.9678 |
61
- | No log | 4.0 | 156 | 0.1107 | 0.2941 | 0.1842 | 0.2265 | 0.9682 |
62
- | No log | 5.0 | 195 | 0.1243 | 0.4651 | 0.1053 | 0.1717 | 0.9691 |
63
- | No log | 6.0 | 234 | 0.1112 | 0.2775 | 0.2526 | 0.2645 | 0.9679 |
64
- | No log | 7.0 | 273 | 0.1093 | 0.2809 | 0.2632 | 0.2717 | 0.9677 |
65
- | No log | 8.0 | 312 | 0.1114 | 0.2802 | 0.2684 | 0.2742 | 0.9677 |
 
 
66
 
67
 
68
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.1511
24
+ - Precision: 0.4375
25
+ - Recall: 0.4789
26
+ - F1: 0.4573
27
+ - Accuracy: 0.9752
28
 
29
  ## Model description
30
 
 
43
  ### Training hyperparameters
44
 
45
  The following hyperparameters were used during training:
46
+ - learning_rate: 0.0005
47
  - train_batch_size: 32
48
  - eval_batch_size: 32
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 10
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
+ | No log | 1.0 | 39 | 0.1580 | 0.0 | 0.0 | 0.0 | 0.9677 |
59
+ | No log | 2.0 | 78 | 0.1313 | 0.2037 | 0.1158 | 0.1477 | 0.9654 |
60
+ | No log | 3.0 | 117 | 0.0948 | 0.3494 | 0.1526 | 0.2125 | 0.9698 |
61
+ | No log | 4.0 | 156 | 0.0952 | 0.2252 | 0.3947 | 0.2868 | 0.9676 |
62
+ | No log | 5.0 | 195 | 0.0850 | 0.3160 | 0.3526 | 0.3333 | 0.9733 |
63
+ | No log | 6.0 | 234 | 0.1012 | 0.2990 | 0.4895 | 0.3713 | 0.9690 |
64
+ | No log | 7.0 | 273 | 0.1282 | 0.4476 | 0.4947 | 0.4700 | 0.9739 |
65
+ | No log | 8.0 | 312 | 0.1400 | 0.4332 | 0.4947 | 0.4619 | 0.9748 |
66
+ | No log | 9.0 | 351 | 0.1492 | 0.4279 | 0.5 | 0.4612 | 0.9753 |
67
+ | No log | 10.0 | 390 | 0.1511 | 0.4375 | 0.4789 | 0.4573 | 0.9752 |
68
 
69
 
70
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0e85fb71562e15de881456ac110cc3503fab687ddef21cf9d2be1f822a57b56b
3
  size 33596518
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6d473daaf5c931f63e0d3c77f9e0198cf03254947ebafb1f9324d0a2bab3a5f
3
  size 33596518