KaraKaraWitch
/

LLENN-v0.75-Qwen2.5-72b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

KaraKaraWitch commited on Nov 6, 2024

Commit

35708d9

·

verified ·

1 Parent(s): e367c36

Create README.md

Files changed (1) hide show

README.md +80 -0

README.md ADDED Viewed

	@@ -0,0 +1,80 @@

+---
+license: other
+license_name: qwen
+license_link: https://huggingface.co/Qwen/Qwen2.5-72B/blob/main/LICENSE
+base_model:
+- rombodawg/Rombos-LLM-V2.5-Qwen-72b
+- abacusai/Dracarys2-72B-Instruct
+- EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
+- ZeusLabs/Chronos-Platinum-72B
+- Qwen/Qwen2.5-72B
+- m8than/banana-2-b-72b
+language:
+- en
+pipeline_tag: text-generation
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# LLENN-v0.75-Qwen2.5-72b
+[![image/png](https://cdn-uploads.huggingface.co/production/uploads/633e85093a17ab61de8d9073/mYiG-Ndxzqu8ofaBGbOIZ.png)](https://www.youtube.com/watch?v=PaEPo1sUc4Y "Cute Girl with a gun!")
+I liked the previous model, but didn't *exactly* liked the claude vibes it's giving me. So I removed magnum. Other than that, there isn't any new model to merge in so the rest is kept as-is.
+**Please do not ask for quants, contact others instead.**
+*All models are ready for testing on [featherless.ai](https://featherless.ai) as soon as it goes live.*
+### Models Merged
+The following models were included in the merge:
+* [rombodawg/Rombos-LLM-V2.5-Qwen-72b](https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-72b)
+* [abacusai/Dracarys2-72B-Instruct](https://huggingface.co/abacusai/Dracarys2-72B-Instruct)
+* [EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0)
+* [ZeusLabs/Chronos-Platinum-72B](https://huggingface.co/ZeusLabs/Chronos-Platinum-72B)
+* [m8than/banana-2-b-72b](https://huggingface.co/m8than/banana-2-b-72b)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0
+  - model: ZeusLabs/Chronos-Platinum-72B
+  - model: abacusai/Dracarys2-72B-Instruct
+  - model: rombodawg/Rombos-LLM-V2.5-Qwen-72b
+  - model: m8than/banana-2-b-72b
+merge_method: model_stock
+base_model: Qwen/Qwen2.5-72B
+parameters:
+  normalize: true
+dtype: bfloat16
+```
+### Prompt Format
+ChatML works for the most part.
+### Sampler Settings
+Personally I use the following:
+```
+Temp: 1.2
+Min P: 0.07
+Rep Pen: 1.1
+```
+Others have suggested the following:
+```
+Temp: 1.1
+Top P: 0.98
+Min P: 0.05
+```