sometimesanotion
/

Lamarck-14B-v0.7-Fusion

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sometimesanotion commited on 3 days ago

Commit

f2413f4

·

verified ·

1 Parent(s): 58cb2e4

Update README.md

Files changed (1) hide show

README.md +16 -31

README.md CHANGED Viewed

@@ -1,41 +1,26 @@
 ---
-base_model: []
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# output
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the [Arcee Fusion](https://arcee.ai) merge method using sometimesanotion/Slerp-Lamarckvevergence as a base.
-### Models Merged
-The following models were included in the merge:
-* sometimesanotion/Chocolatine-Fusion-Qwenvergence
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-name:                Lamarck-14B-v0.7-Fusion
-merge_method:        arcee_fusion
-base_model:          sometimesanotion/Slerp-Lamarckvevergence
-tokenizer_source:    base
-parameters:
-  int8_mask:         true
-  normalize:         true
-  rescale:           false
-dtype:               bfloat16
-out_dtype:           bfloat16
-models:
-  - model:           sometimesanotion/Chocolatine-Fusion-Qwenvergence
-```

 ---
+base_model:
+- sometimesanotion/Lamarck-14B-v0.7
+- sometimesanotion/Qwenvergence-14B-v12-Prose-DS
+- jpacifico/Chocolatine-2-14B-Instruct-v2.0.3
+- suayptalha/Lamarckvergence-14B
 library_name: transformers
 tags:
 - mergekit
 - merge
+license: apache-2.0
+language:
+- en
 ---
+# EXPERIMENTAL:
+So what's this new arcee_fusion merge method, and what can we do with it?  This model aims to find out, as a multi-stage merge where 3 out of 4 steps are fusions:
+* A fusion of [Lamarck-14B-v0.7](http://huggingface.co/sometimesanotion/Lamarck-14B-v0.7) and @suayptalha's [Lamarckvergence SLERP merge](http://huggingface.co/suayptalha/Lamarckvergence-14B) of Lamarck-14B-v0.7 and [Qwenvergence-14B-v12-Prose-DS](http://huggingface.co/sometimesanotion/Qwenvergence-14B-v12-Prose-DS).
+* A SLERP of Lamarck-14B-v0.7-Fusionvergence with Qwenvergence-14B-v12-Prose-DS, the latter emphasized in later layers.
+* A fusion of @jpacifico's [Chocolatine-2-14B-Instruct-v2.0.3](http://huggingface.co/jpacifico/Chocolatine-2-14B-Instruct-v2.0.3), itself a finetune of Arcee's (https://huggingface.co/arcee-ai/Virtuoso-Small-v2) with Lamarck-14B-v0.7 and Qwenvergence-14B-v12-Prose-DS, fusion-merged with - you guessed it - Qwenvergence-14B-v12-Prose-DS
+* A fusion of the previous two.
+I've seen strong prose from this model, which is natural considering its re-emphasis of Qwenvergence-14B-v12-Prose-DS.  A full evaluation will be cued shortly.
+This is actually a lot simpler than a mainline Lamarck release, and where it fits for efforts towards a Lamarck v0.8 depends greatly on evaluation and feedback.