|
--- |
|
base_model: |
|
- unsloth/Qwen2.5-7B-bnb-4bit |
|
- unsloth/gemma-2-9b-it-bnb-4bit |
|
tags: |
|
- text-generation-inference |
|
- transformers |
|
- unsloth |
|
- qwen2 |
|
- gemma2 |
|
- trl |
|
license: apache-2.0 |
|
language: |
|
- en |
|
datasets: |
|
- Sweaterdog/MindCraft-LLM-tuning |
|
--- |
|
|
|
# Uploaded model |
|
|
|
- **Developed by:** Sweaterdog |
|
- **License:** apache-2.0 |
|
- **Finetuned from model :** unsloth/Qwen2.5-7B-bnb-4bit |
|
|
|
The MindCraft LLM tuning CSV file can be found here, this can be tweaked as needed. [MindCraft-LLM](https://huggingface.co/datasets/Sweaterdog/MindCraft-LLM-tuning/raw/main/Gemini-Minecraft%20-%20training_data_minecraft_updated.csv) |
|
|
|
# What is the Purpose? |
|
|
|
This model is built and designed to play Minecraft via the extension named "[MindCraft](https://github.com/kolbytn/mindcraft)" Which allows language models, like the ones provided in the files section, to play Minecraft. |
|
- Why a new model? |
|
# |
|
While, yes, models that aren't fine tuned to play Minecraft *Can* play Minecraft, most are slow, innaccurate, and not as smart, in the fine tuning, it expands reasoning, conversation examples, and command (tool) usage. |
|
- What kind of Dataset was used? |
|
# |
|
I'm deeming this model *"Hermes"*, it was trained for reasoning by using examples of in-game "Vision" as well as examples of spacial reasoning, for expanding thinking, I also added puzzle examples where the model broke down the process step by step to reach the goal. |
|
- Why choose Qwen2.5 for the base model? |
|
# |
|
During testing, to find the best local LLM for playing Minecraft, I came across two, Gemma 2, and Qwen2.5, these two were by far the best at playing Minecraft before fine-tuning, and I knew, once tuned, it would become better. |
|
|
|
# How to Use |
|
In order to use this model, A, download the GGUF file of the version you want, either a Qwen, or Gemma model, and then the Modelfile, after you download both, in the Modelfile, change the directory of the model, to your model. Here is a simple guide if needed for the rest: |
|
# |
|
1.Download the .gguf Model u want. For this example it is in the standard Windows "Download" Folder |
|
|
|
2.Download the Modelfile |
|
|
|
3.Open the Modelfile with / in notepad, or you can rename it to Modelfile.txt, and change the GGUF path, for example, this is my PATH "C:\Users\SweaterDog\OneDrive\Documents\Raw GGUF Files\Hermes-1.0\Hermes-1.Q8_0.gguf" |
|
|
|
4.Safe + Close Modelfile |
|
|
|
5.Rename "Modelfile.txt" into "Modelfile" if you changed it before-hand |
|
|
|
6.Open CMD and type in "ollama create Hermes1 -f Modelfile" (You can change the name to anything you'd like, for this example, I am just using the same name as the GGUF) |
|
|
|
7.Wait until finished |
|
|
|
8.In the CMD window, type "ollama run Hermes1" (replace the 1 in Hermes with whatever version you downloaded) |
|
|
|
# |
|
|
|
I'm aware it does say there are multiple Qwen2.5 files, even though there are two, and it also says there are Gemma2 models, even though there isn't, I am aware and have been trying to train the rest of these models. |
|
|
|
# |
|
|
|
This qwen2 and gemma2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library. |
|
|
|
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) |