ColleenMacklin commited on
Commit
24db40e
1 Parent(s): 917aa57

End of training

Browse files
Files changed (5) hide show
  1. README.md +7 -8
  2. config.json +1 -1
  3. generation_config.json +1 -1
  4. pytorch_model.bin +2 -2
  5. training_args.bin +2 -2
README.md CHANGED
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # gpt-neo-125M-couples_therapist_full
15
 
16
- This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.8731
19
 
20
  ## Model description
21
 
@@ -46,14 +46,13 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 330 | 2.9649 |
50
- | 3.0431 | 2.0 | 660 | 2.8893 |
51
- | 3.0431 | 3.0 | 990 | 2.8731 |
52
 
53
 
54
  ### Framework versions
55
 
56
- - Transformers 4.34.0
57
- - Pytorch 2.0.1+cu118
58
- - Datasets 2.14.5
59
  - Tokenizers 0.14.1
 
13
 
14
  # gpt-neo-125M-couples_therapist_full
15
 
16
+ This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.9661
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 48 | 3.1405 |
50
+ | No log | 2.0 | 96 | 2.9985 |
51
+ | No log | 3.0 | 144 | 2.9661 |
52
 
53
 
54
  ### Framework versions
55
 
56
+ - Transformers 4.34.1
57
+ - Pytorch 2.1.0+cu118
 
58
  - Tokenizers 0.14.1
config.json CHANGED
@@ -48,7 +48,7 @@
48
  "summary_type": "cls_index",
49
  "summary_use_proj": true,
50
  "torch_dtype": "float32",
51
- "transformers_version": "4.34.0",
52
  "use_cache": true,
53
  "vocab_size": 50257,
54
  "window_size": 256
 
48
  "summary_type": "cls_index",
49
  "summary_use_proj": true,
50
  "torch_dtype": "float32",
51
+ "transformers_version": "4.34.1",
52
  "use_cache": true,
53
  "vocab_size": 50257,
54
  "window_size": 256
generation_config.json CHANGED
@@ -2,5 +2,5 @@
2
  "_from_model_config": true,
3
  "bos_token_id": 50256,
4
  "eos_token_id": 50256,
5
- "transformers_version": "4.34.0"
6
  }
 
2
  "_from_model_config": true,
3
  "bos_token_id": 50256,
4
  "eos_token_id": 50256,
5
+ "transformers_version": "4.34.1"
6
  }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c484827e662abadd51a735303783deed0949c97f86317d7d6c44aece579a0cb0
3
- size 500847181
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7de5057836cb1690bcdef57c05405e9a7c45b248d76ef43812665327765f369c
3
+ size 500847626
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2eb2f6f9a4c7e306e29705920646bdbb296c5de9e23a79532bffeffe495272f2
3
- size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65235da15ff86e7c3fc61c8040f0f4db583456b5eebc68856b91d30959120b0f
3
+ size 4536