sometimesanotion
/

Lamarck-14B-v0.6

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sometimesanotion commited on 26 days ago

Commit

0f6450f

·

verified ·

1 Parent(s): 04277a3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ Lamarck 14B v0.6:  A generalist merge focused on multi-step reasoning, prose, mu
 The tempo of Lamarck releases slowed because improving IFEVAL while maintaining other scores is no small task.  Previous releases were based on a SLERP merge of model_stock->della branches focused on reasoning and prose.  The prose branch got surprisingly good at reasoning, and the reasoning branch became a strong generalist in its own right.  Some of you have already downloaded it as [sometimesanotion/Qwen2.5-14B-Vimarckoso-v3](https://huggingface.co/sometimesanotion/Qwen2.5-14B-Vimarckoso-v3).
-Lamarck 0.6 aims to build upon Vimarckoso v3's all-around strength with improvements to prose and translation quality, and strong reasoning for its class.  Updates to come as leaderboards become available to evaluate it in-depth.  Even now, initial testing is showing solid translation, problem-solving, and prose capability.
 The strengths Lamarck has combined from its immediate ancestors are in turn derived from select finetunes and merges.  Kudoes to @arcee-ai, @CultriX, @sthenno-com, @Krystalan, @underwoods, @VAGOSolutions, and @rombodawg whose models had the most influence, as [Vimarckoso v3](https://huggingface.co/sometimesanotion/Qwen2.5-14B-Vimarckoso-v3)'s model card will show.

 The tempo of Lamarck releases slowed because improving IFEVAL while maintaining other scores is no small task.  Previous releases were based on a SLERP merge of model_stock->della branches focused on reasoning and prose.  The prose branch got surprisingly good at reasoning, and the reasoning branch became a strong generalist in its own right.  Some of you have already downloaded it as [sometimesanotion/Qwen2.5-14B-Vimarckoso-v3](https://huggingface.co/sometimesanotion/Qwen2.5-14B-Vimarckoso-v3).
+Lamarck 0.6 aims to build upon Vimarckoso v3's all-around strengths by using breadcrumbs and DELLA merges, with highly targeted weight/density gradients for every four layers and special andling for the first and final two layers.  This approach selectively merges the strongest aspects of its ancestors.
 The strengths Lamarck has combined from its immediate ancestors are in turn derived from select finetunes and merges.  Kudoes to @arcee-ai, @CultriX, @sthenno-com, @Krystalan, @underwoods, @VAGOSolutions, and @rombodawg whose models had the most influence, as [Vimarckoso v3](https://huggingface.co/sometimesanotion/Qwen2.5-14B-Vimarckoso-v3)'s model card will show.