Add README
Browse files
README.md
CHANGED
@@ -1,3 +1,50 @@
|
|
1 |
---
|
2 |
-
license:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
license: llama2
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
inference: false
|
7 |
+
tags:
|
8 |
+
- dare
|
9 |
+
- super mario merge
|
10 |
+
- pytorch
|
11 |
+
- llama
|
12 |
+
- llama-2
|
13 |
---
|
14 |
+
|
15 |
+
# llama 13b mega merge v2
|
16 |
+
|
17 |
+
The following were merged with DARE using [https://github.com/martyn/safetensors-merge-supermario](https://github.com/martyn/safetensors-merge-supermario)
|
18 |
+
|
19 |
+
## Mergelist
|
20 |
+
|
21 |
+
```
|
22 |
+
meta-llama/Llama-2-13b-hf
|
23 |
+
ajibawa-2023/Code-13B
|
24 |
+
ajibawa-2023/Python-Code-13B
|
25 |
+
meta-math/MetaMath-13B-V1.0
|
26 |
+
rombodawg/LosslessMegaCoder-llama2-13b-mini
|
27 |
+
NousResearch/Nous-Hermes-Llama2-13b
|
28 |
+
allenai/digital-socrates-13b
|
29 |
+
migtissera/Synthia-13B
|
30 |
+
Gryphe/MythoLogic-L2-13b
|
31 |
+
allenai/tulu-2-dpo-13b
|
32 |
+
FPHam/Free_Sydney_13b_HF
|
33 |
+
FPHam/Free_Sydney_V2_13b_HF
|
34 |
+
FPHam/Sydney_Overthinker_13b_HF
|
35 |
+
KoboldAI/LLaMA2-13B-Psyfighter2
|
36 |
+
Undi95/Unholy-v1-12L-13B
|
37 |
+
athirdpath/Eileithyia-13B
|
38 |
+
athirdpath/Orca-2-13b-Alpaca-Uncensored
|
39 |
+
```
|
40 |
+
|
41 |
+
## Merge command
|
42 |
+
|
43 |
+
```
|
44 |
+
python hf_merge.py mergelist.txt 13b-merge-v2 -p 0.11 -lambda 2.1
|
45 |
+
```
|
46 |
+
|
47 |
+
## Notes
|
48 |
+
|
49 |
+
* seems to generalize instruct styles
|
50 |
+
* `p` and `lambda` are still a guess
|