LDJnr commited on
Commit
59b4b7f
·
1 Parent(s): 156b11f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -10
README.md CHANGED
@@ -16,7 +16,6 @@ For other faster or more accurate quantization methods, please check out Eachade
16
 
17
  ![puffin](https://i.imgur.com/R2xTHMb.png)
18
 
19
-
20
  ## **Redmond-Puffin-13b-V1.3**
21
 
22
  **The first commercially available language model released by Nous Research!**
@@ -53,11 +52,6 @@ Optional reccomended pre-prompt / system prompt:
53
  ### response: Sure! sounds good.
54
  ```
55
 
56
- ## Improvements over previous version:
57
-
58
- The original Puffin model was loved by many, however it was quickly discovered to have dataset errors in a significant amount of the conversations.
59
- Puffin-V1.3 dataset solves this issue and the resulting fixed model has now fully finished training!
60
-
61
  ## When should I use Puffin or Hermes 2?
62
 
63
  Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-2 winning by a 0.1% margin over Puffin.
@@ -68,9 +62,17 @@ Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-
68
 
69
  For these reasons, it's reccomended to give Puffin a try if you want to have multi-turn conversations and/or long context communication.
70
 
71
- That being said, it's important to note that the commonly referenced benchmarks are all single-turn tests, and despite this, Puffin reaches within 0.1% of the Hermes-2 GPT4All average score.
 
 
72
 
73
- Puffin also beats Hermes-2 for the #1 spot in Arc-E, Hella swag and Winogrande! as well as perfectly tying with Hermes-2 in PIQA for the exact score of 80.69 (PIQA is a single-turn benchmark for common-sense reasoning of the physical world)
 
 
 
 
 
 
74
 
75
  ## Notable Features:
76
 
@@ -113,9 +115,9 @@ New Sota: Puffin - 69.9 (+1.1)
113
  note: After release, Puffin has since had its average GPT4All score beaten by 0.1%, by Nous' very own Model Hermes-2!
114
  Latest SOTA w/ Hermes 2- 70.0 (+0.1 over Puffins 69.9 score)
115
 
116
- That being said, Puffin still ends up supplanting even Hermes-2 for the #1 spot in Arc-E, HellaSwag and Winogrande!
117
 
118
- Puffin also perfectly ties with Hermes in PIQA.
119
 
120
  GPT4all :
121
 
@@ -184,3 +186,4 @@ AGI Eval:
184
  |agieval_sat_math | 0|acc |0.3364|± |0.0319|
185
  | | |acc_norm|0.2773|± |0.0302|
186
  ```
 
 
16
 
17
  ![puffin](https://i.imgur.com/R2xTHMb.png)
18
 
 
19
  ## **Redmond-Puffin-13b-V1.3**
20
 
21
  **The first commercially available language model released by Nous Research!**
 
52
  ### response: Sure! sounds good.
53
  ```
54
 
 
 
 
 
 
55
  ## When should I use Puffin or Hermes 2?
56
 
57
  Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-2 winning by a 0.1% margin over Puffin.
 
62
 
63
  For these reasons, it's reccomended to give Puffin a try if you want to have multi-turn conversations and/or long context communication.
64
 
65
+ ## Example Outputs!:
66
+
67
+ ![puffin](https://i.imgur.com/P0MsN8B.png)
68
 
69
+ ![puffin](https://i.imgur.com/8EO3ThV.png)
70
+
71
+ ![puffin](https://i.imgur.com/5IWolFw.png)
72
+
73
+ ![puffin](https://i.imgur.com/TQui8m7.png)
74
+
75
+ ![puffin](https://i.imgur.com/tderIfl.png)
76
 
77
  ## Notable Features:
78
 
 
115
  note: After release, Puffin has since had its average GPT4All score beaten by 0.1%, by Nous' very own Model Hermes-2!
116
  Latest SOTA w/ Hermes 2- 70.0 (+0.1 over Puffins 69.9 score)
117
 
118
+ That being said, Puffin supplants Hermes-2 for the #1 spot in Arc-E, HellaSwag and Winogrande!
119
 
120
+ Puffin also perfectly ties with Hermes in PIQA, however Hermes-2 still excels in much of Big Bench and AGIEval, so it's highly reccomended you give it a try as well!
121
 
122
  GPT4all :
123
 
 
186
  |agieval_sat_math | 0|acc |0.3364|± |0.0319|
187
  | | |acc_norm|0.2773|± |0.0302|
188
  ```
189
+