Azazelle
/

L3-Hecate-8B-v1.1

@@ -5,41 +5,86 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# GP26jJg
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [NousResearch/Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct) as a base.
-### Models Merged
-The following models were included in the merge:
-* output/hq_rp
 ### Configuration
 The following YAML configuration was used to produce this model:
 ```yaml
 base_model: NousResearch/Meta-Llama-3-8B-Instruct
 dtype: float32
-merge_method: task_arithmetic
-parameters:
-  normalize: 0.0
-slices:
-- sources:
-  - layer_range: [0, 32]
-    model: output/hq_rp
     parameters:
       weight:
-      - filter: mlp
-        value: 1.25
-      - value: 1.1
-  - layer_range: [0, 32]
-    model: NousResearch/Meta-Llama-3-8B-Instruct
 ```

 tags:
 - mergekit
 - merge
+- llama
+- conversational
+license: llama3
 ---
+# L3-Hecate-8B-v1.0
+![Hecate](https://huggingface.co/Azazelle/L3-Hecate-8B-v1.0/resolve/main/IhBchsAoR4ao0D2C2AEKuw.jpg)
+## About:
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+**Recommended Samplers:**
+```
+Temperature - 1.0
+TFS - 0.75
+Smoothing Factor - 0.3
+Smoothing Curve - 1.1
+Repetition Penalty - 1.08
+```
+### Merge Method
+This model was merged a series of model stock and lora merges, followed by ExPO. It uses a mix of smart and roleplay centered models to improve performance.
 ### Configuration
 The following YAML configuration was used to produce this model:
 ```yaml
+---
+# Mopey RP Mix
+models:
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/Llama-3-Sunfall-8b-lora
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/Llama-3-8B-Abomination-LORA
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/llama3-8b-hikikomori-v0.4
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/Llama-3-Instruct-LiPPA-LoRA-8B
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/BlueMoon_Llama3
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/Llama3_RP_ORPO_LoRA
+  - model: failspy/Llama-3-8B-Instruct-MopeyMule+Azazelle/Llama-3-LongStory-LORA
+merge_method: model_stock
+base_model: failspy/Llama-3-8B-Instruct-MopeyMule
+dtype: float32
+vocab_type: bpe
+name: mopey_rp
+---
+models:
+  - model: Nitral-AI/Hathor_Tahsin-L3-8B-v0.85
+  - model: Sao10K/L3-8B-Tamamo-v1
+  - model: Sao10K/L3-8B-Niitama-v1
+  - model: cycy233/L3-base-v2-e3.0
+  - model: Azazelle/L3-Hecate-8B-v1.0
+  - model: R136a1/Bungo-L3-8B
+  - model: Jellywibble/meseca-20062024-c1
+  - model: NeverSleep/Llama-3-Lumimaid-8B-v0.1-OAS
+  - model: Jellywibble/lora_120k_pref_data_ep2
+  - model: Nitral-AI/Hathor_Stable-v0.2-L3-8B
+  - model: mopey_rp
+merge_method: model_stock
 base_model: NousResearch/Meta-Llama-3-8B-Instruct
 dtype: float32
+vocab_type: bpe
+name: hq_rp
+---
+# ExPO
+models:
+  - model: hq_rp
     parameters:
       weight:
+        - filter: mlp
+          value: 1.25
+        - value: 1.1
+merge_method: task_arithmetic
+base_model: NousResearch/Meta-Llama-3-8B-Instruct
+parameters:
+  normalize: false
+dtype: float32
+vocab_type: bpe
 ```