avidoavid commited on
Commit
e1da659
·
1 Parent(s): c1af145

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -12
README.md CHANGED
@@ -5,7 +5,6 @@ tags:
5
  model-index:
6
  - name: RWKV-1b5-finetuned-overfit
7
  results: []
8
- library_name: peft
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [RWKV/rwkv-raven-1b5](https://huggingface.co/RWKV/rwkv-raven-1b5) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 67.8669
19
 
20
  ## Model description
21
 
@@ -46,20 +45,18 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 1.6836 | 1.0 | 1 | 1.4181 |
50
- | 1.5358 | 2.0 | 2 | 1.4378 |
51
- | 0.7673 | 3.0 | 3 | 7.2988 |
52
- | 0.3153 | 4.0 | 4 | 28.9362 |
53
- | 0.0964 | 5.0 | 5 | 60.8725 |
54
- | 0.0242 | 6.0 | 6 | 68.6885 |
55
- | 0.0119 | 7.0 | 7 | 67.8234 |
56
- | 0.0088 | 8.0 | 8 | 67.8669 |
57
 
58
 
59
  ### Framework versions
60
 
61
- - PEFT 0.4.0
62
- - PEFT 0.4.0
63
  - Transformers 4.31.0
64
  - Pytorch 2.0.1+cu117
65
  - Datasets 2.13.1
 
5
  model-index:
6
  - name: RWKV-1b5-finetuned-overfit
7
  results: []
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
14
 
15
  This model is a fine-tuned version of [RWKV/rwkv-raven-1b5](https://huggingface.co/RWKV/rwkv-raven-1b5) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 68.7560
18
 
19
  ## Model description
20
 
 
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | 1.6836 | 1.0 | 1 | 1.4341 |
49
+ | 1.5494 | 2.0 | 2 | 1.7198 |
50
+ | 0.7595 | 3.0 | 3 | 9.1981 |
51
+ | 0.3142 | 4.0 | 4 | 35.6430 |
52
+ | 0.1007 | 5.0 | 5 | 68.5554 |
53
+ | 0.0256 | 6.0 | 6 | 69.8436 |
54
+ | 0.0119 | 7.0 | 7 | 69.2797 |
55
+ | 0.0082 | 8.0 | 8 | 68.7560 |
56
 
57
 
58
  ### Framework versions
59
 
 
 
60
  - Transformers 4.31.0
61
  - Pytorch 2.0.1+cu117
62
  - Datasets 2.13.1