evo_model_test
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the DARE TIES merge method using ./evolve_merges/input_models/merge-10162024_972739363 as a base.
Models Merged
The following models were included in the merge:
- ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
- ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
- ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
Configuration
The following YAML configuration was used to produce this model:
base_model: ./evolve_merges/input_models/merge-10162024_972739363
dtype: bfloat16
merge_method: dare_ties
parameters:
int8_mask: 1.0
normalize: 1.0
slices:
- sources:
- layer_range: [0, 4]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.6617851833521375
- filter: mlp
value: 1.0
- value: 0.7758506135029611
weight:
- filter: self_attn
value: 0.06553850894305135
- filter: mlp
value: 0.32372893196093133
- value: 0.24761893893703177
- layer_range: [0, 4]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 0.8619096186212604
- filter: mlp
value: 0.9632945037149085
- value: 1.0
weight:
- filter: self_attn
value: 0.5496368676404241
- filter: mlp
value: 0.2817627768141395
- value: 0.2831242003449033
- layer_range: [0, 4]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.9238831652008582
weight:
- filter: self_attn
value: 0.6983534009784523
- filter: mlp
value: 0.7786486269006042
- value: 0.3362711484417948
- layer_range: [0, 4]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.897712174766424
weight:
- filter: self_attn
value: 0.6494468053120542
- filter: mlp
value: 0.11769817501358182
- value: 0.23745407940550356
- sources:
- layer_range: [4, 8]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.768056839478356
- filter: mlp
value: 0.7392675781352855
- value: 1.0
weight:
- filter: self_attn
value: 0.4137398667324908
- filter: mlp
value: 0.5364761127195374
- value: -0.06120952450996993
- layer_range: [4, 8]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 0.9328263901133284
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.512662918449004
- filter: mlp
value: 0.8133160093541117
- value: 0.09518477923218693
- layer_range: [4, 8]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.6534355737222919
- filter: mlp
value: -0.2733724467069448
- value: 0.35896371241039604
- layer_range: [4, 8]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.9645408518441749
- value: 0.9920721804462888
weight:
- filter: self_attn
value: 0.043888879112993606
- filter: mlp
value: 0.37533863309727755
- value: 0.32692015564467836
- sources:
- layer_range: [8, 12]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.9340306321054911
- filter: mlp
value: 1.0
- value: 0.7968276665543247
weight:
- filter: self_attn
value: 0.14846986084920036
- filter: mlp
value: 0.3955452929300913
- value: 0.4270837195831495
- layer_range: [8, 12]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.3649415030710907
- filter: mlp
value: 0.16275044387393922
- value: 0.2758727640654811
- layer_range: [8, 12]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 0.8295983370283204
- filter: mlp
value: 0.7788134370117827
- value: 0.9398894811483364
weight:
- filter: self_attn
value: 0.28746483121862637
- filter: mlp
value: 0.3358374043922244
- value: 0.2275533582239845
- layer_range: [8, 12]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.727821766634972
weight:
- filter: self_attn
value: 0.3081244623443608
- filter: mlp
value: 0.45014674558784984
- value: 0.11047219740073362
- sources:
- layer_range: [12, 16]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.6489316039694529
- filter: mlp
value: 1.0
- value: 0.8272372022626591
weight:
- filter: self_attn
value: 0.470708064142626
- filter: mlp
value: -0.047129110924588186
- value: 0.42971949234723295
- layer_range: [12, 16]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.6616234442454084
- value: 1.0
weight:
- filter: self_attn
value: 0.26282202905677127
- filter: mlp
value: 0.4448525732857457
- value: 0.2229765978922556
- layer_range: [12, 16]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.6135513085208061
- value: 0.9581737790930396
weight:
- filter: self_attn
value: 0.24444794214178578
- filter: mlp
value: 0.07937992720612315
- value: -0.05228450555064985
- layer_range: [12, 16]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.1719406804216106
- filter: mlp
value: 0.0934880168140769
- value: 0.35045642161724166
- sources:
- layer_range: [16, 20]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.5446785752563841
- filter: mlp
value: 0.8810586946591301
- value: 0.9152297583356134
weight:
- filter: self_attn
value: -0.0016341576761690624
- filter: mlp
value: -0.14493024949671152
- value: 0.26832439639581773
- layer_range: [16, 20]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.5944606032155147
- value: 0.9302142529770252
weight:
- filter: self_attn
value: 0.35950618403078893
- filter: mlp
value: 0.11051887834512175
- value: 0.42291230769302385
- layer_range: [16, 20]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.6546859569496538
- value: 0.8503723026949942
weight:
- filter: self_attn
value: 0.35331354069135923
- filter: mlp
value: 0.11666399796526544
- value: 0.027977616826786067
- layer_range: [16, 20]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 0.8237153213010172
- filter: mlp
value: 0.7779880619326531
- value: 1.0
weight:
- filter: self_attn
value: 0.7145318763470817
- filter: mlp
value: 0.4104048815986916
- value: 0.07468194955613425
- sources:
- layer_range: [20, 24]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.5231923060339636
- filter: mlp
value: 1.0
- value: 0.9856713754180749
weight:
- filter: self_attn
value: 0.4081014822719611
- filter: mlp
value: 0.09758488254406042
- value: 0.3348194266336727
- layer_range: [20, 24]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.7490383834336071
- filter: mlp
value: 0.4662047924812158
- value: -0.24858277913931304
- layer_range: [20, 24]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.8502797089454639
weight:
- filter: self_attn
value: 0.276884170342346
- filter: mlp
value: 0.633656940319029
- value: 0.5235799339573071
- layer_range: [20, 24]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.8562223977334964
- value: 0.9716150483673114
weight:
- filter: self_attn
value: 0.5270260765195226
- filter: mlp
value: 0.32711936701658684
- value: 0.05670152518434478
- sources:
- layer_range: [24, 28]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.8553635955278736
weight:
- filter: self_attn
value: 0.35406982791511876
- filter: mlp
value: -0.11643971781340703
- value: 0.20075532527415488
- layer_range: [24, 28]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.87297120460794
- value: 1.0
weight:
- filter: self_attn
value: 0.07480839031742999
- filter: mlp
value: 0.18311115096539785
- value: 0.3625508152553395
- layer_range: [24, 28]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.494667527482752
- filter: mlp
value: 0.3944202674139632
- value: -0.19227439649461792
- layer_range: [24, 28]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.06851638816347627
- filter: mlp
value: 0.431372227001768
- value: 0.1747985843980182
- sources:
- layer_range: [28, 32]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.9094528371038374
- filter: mlp
value: 1.0
- value: 0.6090545725123906
weight:
- filter: self_attn
value: 0.25309591486694805
- filter: mlp
value: -0.263292487608102
- value: 0.1323202337738385
- layer_range: [28, 32]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 0.6494843615875994
- filter: mlp
value: 1.0
- value: 0.7515064103597758
weight:
- filter: self_attn
value: 0.07729701084822604
- filter: mlp
value: 0.2170958326731126
- value: 0.22214702687265422
- layer_range: [28, 32]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 0.8431056158343985
- filter: mlp
value: 0.8838909258744341
- value: 0.35295455870641634
weight:
- filter: self_attn
value: 0.6551015978225493
- filter: mlp
value: 0.016410780482769546
- value: 0.6370635339121399
- layer_range: [28, 32]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.04318024669287196
- filter: mlp
value: 0.7642269685567962
- value: 0.26850603466331324
- sources:
- layer_range: [32, 36]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.579520070097527
weight:
- filter: self_attn
value: -0.051737601944818495
- filter: mlp
value: 0.3503787657405606
- value: 0.08607827555366553
- layer_range: [32, 36]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.28766985337224327
- filter: mlp
value: 0.3046959778412749
- value: -0.0005520428411238121
- layer_range: [32, 36]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.915429997855087
- value: 1.0
weight:
- filter: self_attn
value: 0.440410051026902
- filter: mlp
value: -0.21574554516791783
- value: 0.15656972383477347
- layer_range: [32, 36]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.3263876152481672
- filter: mlp
value: -0.040618303294953154
- value: 0.47900376528192473
- sources:
- layer_range: [36, 40]
model: ./evolve_merges/input_models/merge-10162024_972739363
parameters:
density:
- filter: self_attn
value: 0.9171778237104341
- filter: mlp
value: 0.7229727777891508
- value: 0.9122033861491662
weight:
- filter: self_attn
value: 0.6154987734241069
- filter: mlp
value: 0.3910860949496661
- value: 0.5286422728941228
- layer_range: [36, 40]
model: ./evolve_merges/input_models/Magnum-Picaro-0.7-v2-12b_3809452655
parameters:
density:
- filter: self_attn
value: 0.6023409600465159
- filter: mlp
value: 1.0
- value: 1.0
weight:
- filter: self_attn
value: 0.39644253937030505
- filter: mlp
value: 0.7570672338863116
- value: 0.10261227723433294
- layer_range: [36, 40]
model: ./evolve_merges/input_models/Chronos-Gold-12B-1.0_1861025797
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 1.0
- value: 0.8342554461687561
weight:
- filter: self_attn
value: 0.4563403174251752
- filter: mlp
value: 0.313992481082509
- value: 0.022583139471508834
- layer_range: [36, 40]
model: ./evolve_merges/input_models/MN-12B-Mag-Mell-R1_399051020
parameters:
density:
- filter: self_attn
value: 1.0
- filter: mlp
value: 0.9211392650515542
- value: 1.0
weight:
- filter: self_attn
value: -0.17092104595693997
- filter: mlp
value: 0.13032109680489912
- value: -0.03480332269062497
- Downloads last month
- 185
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.