llmixer commited on
Commit
799a20c
1 Parent(s): 27a5cfc

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama2
3
+ language:
4
+ - en
5
+ pipeline_tag: conversational
6
+ tags:
7
+ - frankenmerge
8
+ - merge
9
+ - 124b
10
+ ---
11
+ # BigWeave v7.1 124b
12
+
13
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/65a6db055c58475cf9e6def1/4CbbAN-X7ZWj702JrcCGH.png" width=600>
14
+
15
+ The BigWeave models aim to experimentally identify merge settings for increasing model performance. The version number merely tracks various attempts and is not a quality indicator. Only results demonstrating good performance are retained and shared.
16
+
17
+ # Prompting Format
18
+ Vicuna and Alpaca.
19
+
20
+ # Merge process
21
+ This is a merge of Xwin-LM/Xwin-LM-70B-V0.1 and Sao10K/Euryale-1.3-L2-70B. It uses the same configuration as alpindale/goliath-120b but with the ranges "fixed" since goliath omits some layers (see [this thread](https://huggingface.co/ChuckMcSneed/WinterGoliath-123b/discussions/2#65d324693ecda975d83089d3)).
22
+
23
+ Merge configuration:
24
+ ```
25
+ slices:
26
+ - sources:
27
+ - model: Xwin-LM/Xwin-LM-70B-V0.1
28
+ layer_range: [0,16]
29
+ - sources:
30
+ - model: Sao10K/Euryale-1.3-L2-70B
31
+ layer_range: [8,24]
32
+ - sources:
33
+ - model: Xwin-LM/Xwin-LM-70B-V0.1
34
+ layer_range: [16,32]
35
+ - sources:
36
+ - model: Sao10K/Euryale-1.3-L2-70B
37
+ layer_range: [24,40]
38
+ - sources:
39
+ - model: Xwin-LM/Xwin-LM-70B-V0.1
40
+ layer_range: [32,48]
41
+ - sources:
42
+ - model: Sao10K/Euryale-1.3-L2-70B
43
+ layer_range: [40,56]
44
+ - sources:
45
+ - model: Xwin-LM/Xwin-LM-70B-V0.1
46
+ layer_range: [48,64]
47
+ - sources:
48
+ - model: Sao10K/Euryale-1.3-L2-70B
49
+ layer_range: [56,72]
50
+ - sources:
51
+ - model: Xwin-LM/Xwin-LM-70B-V0.1
52
+ layer_range: [64,80]
53
+ merge_method: passthrough
54
+ dtype: float16
55
+ ```