grimulkan
/

Llama-3.2-11B-Vision-Instruct-Hermes-3-lorablated

Model card Files Files and versions Community

grimulkan commited on Oct 21, 2024

Commit

73796a3

·

verified ·

1 Parent(s): 1876892

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -8,4 +8,6 @@ Please respect the respective licenses of Meta Llama & Nous Research.
 The method I used is detailed in [this post](https://www.reddit.com/r/LocalLLaMA/comments/1fzduyx/merging_llama_32_vision_adapters_onto_31_finetunes/). I also merged the tokenizer and generation configs.
 Example python code for weight merging is available in [merge_vision_example.py](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge/blob/main/merge_vision_example.py), which works for both 11B and 90B.
-A 90B version of this merge is [available here](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge).

 The method I used is detailed in [this post](https://www.reddit.com/r/LocalLLaMA/comments/1fzduyx/merging_llama_32_vision_adapters_onto_31_finetunes/). I also merged the tokenizer and generation configs.
 Example python code for weight merging is available in [merge_vision_example.py](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge/blob/main/merge_vision_example.py), which works for both 11B and 90B.
+This 11B merge is less stable than the 90B (which is very stable). Keep `temperature <= 0.7`.
+The 90B version of this merge is [available here](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge).