Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

View all activity

Organizations

Spark NLP's profile picture ONNXConfig for all's profile picture Blog-explorers's profile picture Arcee AI's profile picture ZeroGPU Explorers's profile picture LocalLLaMA's profile picture Unofficial Mistral Community's profile picture Top Contributors: Model Downloads's profile picture John Snow Labs's profile picture Calme's profile picture Arxflix's profile picture Arcee Training Org's profile picture Arcee Globe's profile picture rg-preview's profile picture Agentica's profile picture open/ acc's profile picture BIO CALME's profile picture

MaziyarPanahi's activity

replied to sometimesanotion's post about 6 hours ago
replied to sometimesanotion's post about 11 hours ago
reacted to sometimesanotion's post with 🔥🔥👍 about 11 hours ago
view post
Post
2721
I'd like to draw your attention to a Lamarck-based experiment which uses Arcee AI's newly published arcee_fusion merge method for three out of its four merges. Yes, just four. This is a simple one, and its recipe is fully open:

sometimesanotion/Lamarck-14B-v0.7-Fusion

It unifies three branches, all of which feature models which bring Lamarck-14B-v0.7 and Qwenvergence-14B-v12-Prose together. One side features @jpacifico 's jpacifico/Chocolatine-2-14B-Instruct-v2.0.3 and the other features @suayptalha 's suayptalha/Lamarckvergence-14B paired with my models which were their merge ancestors.

A fusion merge - of a fusion merge and a SLERP of a fusion and older merge - should demonstrate the new merge method's behavior in interesting ways, especially in the first 1/4th of the model where the SLERP has less impact.

I welcome you to kick the tires and learn from it. It has prose quality near Qwenvergence v12's - as you'd expect.

Thank you, @mradermacher and @MaziyarPanahi , for the first-day quantizations! Your work helped get me started. https://huggingface.co/models?other=base_model:quantized:sometimesanotion/Lamarck-14B-v0.7-Fusion
·
New activity in MaziyarPanahi/WizardLM-2-8x22B-AWQ about 11 hours ago

Update tokenizer_config.json

1
#5 opened about 19 hours ago by
Epiphany
New activity in MaziyarPanahi/Sombrero-Opus-14B-Elite6-GGUF about 17 hours ago
New activity in MaziyarPanahi/ZYH-LLM-Qwen2.5-14B-V3-GGUF about 19 hours ago