Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,6 @@ For other faster or more accurate quantization methods, please check out Eachade
|
|
16 |
|
17 |
![puffin](https://i.imgur.com/R2xTHMb.png)
|
18 |
|
19 |
-
|
20 |
## **Redmond-Puffin-13b-V1.3**
|
21 |
|
22 |
**The first commercially available language model released by Nous Research!**
|
@@ -53,11 +52,6 @@ Optional reccomended pre-prompt / system prompt:
|
|
53 |
### response: Sure! sounds good.
|
54 |
```
|
55 |
|
56 |
-
## Improvements over previous version:
|
57 |
-
|
58 |
-
The original Puffin model was loved by many, however it was quickly discovered to have dataset errors in a significant amount of the conversations.
|
59 |
-
Puffin-V1.3 dataset solves this issue and the resulting fixed model has now fully finished training!
|
60 |
-
|
61 |
## When should I use Puffin or Hermes 2?
|
62 |
|
63 |
Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-2 winning by a 0.1% margin over Puffin.
|
@@ -68,9 +62,17 @@ Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-
|
|
68 |
|
69 |
For these reasons, it's reccomended to give Puffin a try if you want to have multi-turn conversations and/or long context communication.
|
70 |
|
71 |
-
|
|
|
|
|
72 |
|
73 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
74 |
|
75 |
## Notable Features:
|
76 |
|
@@ -113,9 +115,9 @@ New Sota: Puffin - 69.9 (+1.1)
|
|
113 |
note: After release, Puffin has since had its average GPT4All score beaten by 0.1%, by Nous' very own Model Hermes-2!
|
114 |
Latest SOTA w/ Hermes 2- 70.0 (+0.1 over Puffins 69.9 score)
|
115 |
|
116 |
-
That being said, Puffin
|
117 |
|
118 |
-
Puffin also perfectly ties with Hermes in PIQA
|
119 |
|
120 |
GPT4all :
|
121 |
|
@@ -184,3 +186,4 @@ AGI Eval:
|
|
184 |
|agieval_sat_math | 0|acc |0.3364|± |0.0319|
|
185 |
| | |acc_norm|0.2773|± |0.0302|
|
186 |
```
|
|
|
|
16 |
|
17 |
![puffin](https://i.imgur.com/R2xTHMb.png)
|
18 |
|
|
|
19 |
## **Redmond-Puffin-13b-V1.3**
|
20 |
|
21 |
**The first commercially available language model released by Nous Research!**
|
|
|
52 |
### response: Sure! sounds good.
|
53 |
```
|
54 |
|
|
|
|
|
|
|
|
|
|
|
55 |
## When should I use Puffin or Hermes 2?
|
56 |
|
57 |
Puffin and Hermes-2 both beat previous SOTA for GPT4ALL benchmarks, with Hermes-2 winning by a 0.1% margin over Puffin.
|
|
|
62 |
|
63 |
For these reasons, it's reccomended to give Puffin a try if you want to have multi-turn conversations and/or long context communication.
|
64 |
|
65 |
+
## Example Outputs!:
|
66 |
+
|
67 |
+
![puffin](https://i.imgur.com/P0MsN8B.png)
|
68 |
|
69 |
+
![puffin](https://i.imgur.com/8EO3ThV.png)
|
70 |
+
|
71 |
+
![puffin](https://i.imgur.com/5IWolFw.png)
|
72 |
+
|
73 |
+
![puffin](https://i.imgur.com/TQui8m7.png)
|
74 |
+
|
75 |
+
![puffin](https://i.imgur.com/tderIfl.png)
|
76 |
|
77 |
## Notable Features:
|
78 |
|
|
|
115 |
note: After release, Puffin has since had its average GPT4All score beaten by 0.1%, by Nous' very own Model Hermes-2!
|
116 |
Latest SOTA w/ Hermes 2- 70.0 (+0.1 over Puffins 69.9 score)
|
117 |
|
118 |
+
That being said, Puffin supplants Hermes-2 for the #1 spot in Arc-E, HellaSwag and Winogrande!
|
119 |
|
120 |
+
Puffin also perfectly ties with Hermes in PIQA, however Hermes-2 still excels in much of Big Bench and AGIEval, so it's highly reccomended you give it a try as well!
|
121 |
|
122 |
GPT4all :
|
123 |
|
|
|
186 |
|agieval_sat_math | 0|acc |0.3364|± |0.0319|
|
187 |
| | |acc_norm|0.2773|± |0.0302|
|
188 |
```
|
189 |
+
|