File size: 2,847 Bytes
ee710f5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
---
base_model:
- huihui-ai/MicroThinker-1B-Preview
- Hjgugugjhuhjggg/llama-3.2-1B-spinquant-hf
- huihui-ai/Llama-3.2-1B-Instruct-abliterated
library_name: transformers
tags:
- mergekit
- merge

---
# merge

This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

## Merge Details
### Merge Method

This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method using [huihui-ai/Llama-3.2-1B-Instruct-abliterated](https://huggingface.co/huihui-ai/Llama-3.2-1B-Instruct-abliterated) as a base.

### Models Merged

The following models were included in the merge:
* [huihui-ai/MicroThinker-1B-Preview](https://huggingface.co/huihui-ai/MicroThinker-1B-Preview)
* [Hjgugugjhuhjggg/llama-3.2-1B-spinquant-hf](https://huggingface.co/Hjgugugjhuhjggg/llama-3.2-1B-spinquant-hf)

### Configuration

The following YAML configuration was used to produce this model:

```yaml
models:
  - layer_range: [0, 1]
    model: huihui-ai/Llama-3.2-1B-Instruct-abliterated
    parameters:
      weight: 1
      density: 0.9
      gamma: 0.01
      normalize: true
      int8_mask: true
      random_seed: 0
      temperature: 0.5
      top_p: 0.65
      inference: true
      max_tokens: 999999999
      stream: true
      quantization:
        - method: int8
          value: 100
        - method: int4
          value: 100

  - layer_range: [0, 1]
    model: huihui-ai/MicroThinker-1B-Preview
    parameters:
      weight: 1
      density: 0.9
      gamma: 0.01
      normalize: true
      int8_mask: true
      random_seed: 0
      temperature: 0.5
      top_p: 0.65
      inference: true
      max_tokens: 999999999
      stream: true
      quantization:
        - method: int8
          value: 100
        - method: int4
          value: 100

  - layer_range: [0, 1]
    model: Hjgugugjhuhjggg/llama-3.2-1B-spinquant-hf
    parameters:
      weight: 1
      density: 0.9
      gamma: 0.01
      normalize: true
      int8_mask: true
      random_seed: 0
      temperature: 0.5
      top_p: 0.65
      inference: true
      max_tokens: 999999999
      stream: true
      quantization:
        - method: int8
          value: 100
        - method: int4
          value: 100

merge_method: linear
base_model: huihui-ai/Llama-3.2-1B-Instruct-abliterated
weight: 1
density: 0.9
gamma: 0.01
normalize: true
int8_mask: true
random_seed: 0
temperature: 0.5
top_p: 0.65
inference: true
max_tokens: 999999999
stream: true
quantization:
- method: int8
  value: 100
- method: int4
  value: 100
parameters:
  weight: 1
  density: 0.9
  gamma: 0.01
  normalize: true
  int8_mask: true
  random_seed: 0
  temperature: 0.5
  top_p: 0.65
  inference: true
  max_tokens: 999999999
  stream: true
  quantization:
    - method: int8
      value: 100
    - method: int4
      value: 100
dtype: float16

```