File size: 1,782 Bytes
dc5d051
 
 
 
 
 
 
 
d73c4bf
 
dc5d051
5a40fc8
e7b49aa
9c72a5d
 
 
5a40fc8
 
38613ae
5a40fc8
 
 
9cb9083
f4b2ca8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ca29881
 
f4b2ca8
 
8be825a
f4b2ca8
0df1cbd
dc5d051
f4b2ca8
9f57ce1
f4b2ca8
dc5d051
f4b2ca8
dc5d051
f4b2ca8
 
dc5d051
f4b2ca8
dc5d051
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
---
license: other
license_name: yi-license
license_link: https://huggingface.co/01-ai/Yi-34B/blob/main/LICENSE
language:
- en
library_name: transformers
pipeline_tag: text-generation
tags:
- merge
---

# Obsolete, succeeded by a new merge: **https://huggingface.co/brucethemoose/CaPlatTessDolXaBoros-Yi-34B-200K-DARE-Ties-HighDensity**

***

**NousResearch/Nous-Capybara-34B** and **migtissera/Tess-M-Creative-v1.0** ties merged with mergekit.

I would suggest an exllama version for local inference with 40K+ context in 24GB:
https://huggingface.co/brucethemoose/Capybara-Tess-Yi-34B-200K-exl2-4bpw-fiction
https://huggingface.co/brucethemoose/Capybara-Tess-Yi-34B-200K-exl2-31bpw-fiction

Merged with the following config:

```
models:
  - model: /home/alpha/Storage/Models/Raw/larryvrh_Yi-34B-200K-Llamafied
    # no parameters necessary for base model
  - model: /home/alpha/Storage/Models/Raw/migtissera_Tess-M-v1.0
    parameters:
      density: 0.6
      weight: 1.0
  - model: /home/alpha/Storage/Models/Raw/Nous-Capybara-34B
    parameters:
      density: 0.6
      weight: 1.0
merge_method: ties
base_model: //home/alpha/Storage/Models/Raw/larryvrh_Yi-34B-200K-Llamafied
parameters:
  normalize: true
  int8_mask: true
dtype: float16
```

Both are 200K context models with Vicuna syntax, so: 

# Prompt Format:

```
SYSTEM: ...
USER: ...
ASSISTANT: ...
```
Sometimes the model "spells out" the stop token as `</s>` like Capybara, so you may need to add `</s>` this as an additional stopping condition.

***

Credits:

https://github.com/cg123/mergekit

https://huggingface.co/NousResearch/Nous-Capybara-34B/discussions

https://huggingface.co/migtissera/Tess-M-Creative-v1.0

https://huggingface.co/larryvrh/Yi-34B-200K-Llamafied

https://huggingface.co/01-ai/Yi-34B-200K