Update README.md
Browse files
README.md
CHANGED
@@ -8,4 +8,6 @@ Please respect the respective licenses of Meta Llama & Nous Research.
|
|
8 |
The method I used is detailed in [this post](https://www.reddit.com/r/LocalLLaMA/comments/1fzduyx/merging_llama_32_vision_adapters_onto_31_finetunes/). I also merged the tokenizer and generation configs.
|
9 |
Example python code for weight merging is available in [merge_vision_example.py](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge/blob/main/merge_vision_example.py), which works for both 11B and 90B.
|
10 |
|
11 |
-
|
|
|
|
|
|
8 |
The method I used is detailed in [this post](https://www.reddit.com/r/LocalLLaMA/comments/1fzduyx/merging_llama_32_vision_adapters_onto_31_finetunes/). I also merged the tokenizer and generation configs.
|
9 |
Example python code for weight merging is available in [merge_vision_example.py](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge/blob/main/merge_vision_example.py), which works for both 11B and 90B.
|
10 |
|
11 |
+
This 11B merge is less stable than the 90B (which is very stable). Keep `temperature <= 0.7`.
|
12 |
+
|
13 |
+
The 90B version of this merge is [available here](https://huggingface.co/grimulkan/Llama-3.2-90B-Vision-Hermes-3-lorablated-merge).
|