|
--- |
|
license: apache-2.0 |
|
datasets: |
|
- Sweaterdog/Andy-3.5 |
|
language: |
|
- en |
|
base_model: |
|
- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B |
|
tags: |
|
- Minecraft |
|
--- |
|
|
|
# 🚀 Welcome to a new generation of Minecraft with Andy 3.5 🚀 |
|
## Andy 3.5 is a collection of LOCAL LLM's designed for playing Minecraft |
|
*Andy 3.5 is designed to be used with MindCraft, and is not designed nor intended to be used for any other applications* |
|
|
|
### How to Install |
|
|
|
1. Select the model you would like to use *(Larger is better)* |
|
2. Download a Modelfile *(One is directly for the tuned model, while the other is for the base model) |
|
3. Once downloaded, open Modelfile in a text editor, and change the path to the download location of the gguf file |
|
4. When changed, save the file, and open command terminal |
|
5. *(Optional if CMD is opened via file explorer)* Navigate to the correct directory using "cd" |
|
6. Run the command ollama create Andy-3.5 -f Modelfile |
|
7. Go to a profile in MindCraft |
|
8. Change the model to be Andy-3.5 |
|
9. Enjoy playing with an AI |
|
|
|
|
|
## How was model trained? |
|
|
|
The model was trained on a dataset of ~9,000 messages coming directly from MindCraft, ensuring quality data, not the newer version with ~12,000 prompts |
|
|
|
## What are capabilities and Limitations? |
|
|
|
The smaller model *(The preview ones at least)* had 1/3 of the parameters tuned, the larger preview model has not been released yet. |
|
Andy-3.5 was trained on EVERYTHING regarding Minecraft and MindCraft, it knows how to use commands natively without a system prompt. |
|
Andy-3.5 also knows how to build / use !newAction to perform commands, it was trained on lots of building, as well as, using !newAction to do tasks like manually making something or strip mining. |
|
|
|
****Know this is a PREVIEW model, it is NOT finished!**** |
|
|
|
## Why a preview model? |
|
|
|
Andy-3.5-preview was made to test the intelligence of a Minecraft Ai with the current dataset, it was meant to see the progress of the training and what area's are needed for the future |
|
DO NOT expect this model to be able to do everything perfectly, it only knows as much as the dataset told it, as well as the other 2/3 of the untouched parameters allow. |
|
The model *may* experience bugs, such as not saying your name, getting previous messages confused, or other small things. |
|
|
|
# What models can I choose? |
|
|
|
There are going to be 2 *(maybe 3)* model sizes avaliable, Regular, Mini *(And Maybe large)* |
|
* Regular is a 7B parameter model, tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) |
|
* Mini is a 1.5B parameter model, also tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B) |
|
* Large *(Might)* be a 32b parameter model, again tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) *- This model may not exist,* ***ever*** |
|
|
|
Out of all of the models, Teensy had the largest percent of parameters tuned, being 1/2 the models total size |
|
|
|
## Safety and FAQ |
|
|
|
Q: Is this model safe to use? |
|
A. Yes, this model is non-volatile, and cannot generate malicous content |
|
|
|
Q. Can this model be used on a server? |
|
A. Yes, In theory and practice the model is only capable of building and performing manual tasks via newAction |
|
|
|
Q. Who is responsible if this model does generate malicous content? |
|
A. You are responsible, even though the model was never trained to be able to make malicous content, there is a ***very very slight chance*** it still generates malicous code. |
|
|
|
Q. If I make media based on this model, like photos / videos, do I have to mention the Creator? |
|
A. No, if you are making a post about MindCraft, and using this model, you only have to mention the creator if you mention the model being used. |
|
|
|
## Important notes and considerations |
|
|
|
The preview model of Andy-3.5, is Andy-3.5-teensy, a small model and tune with only 360 million parameters, it ***"understand Minecraft"***. |
|
I would not recommend Andy-3.5-teensy, I felt like making a joke, and a joke was made, *(The Andy-3.5-teensy model was a big hope, but it sucks, try out the q2_k model!)* |
|
|
|
|
|
When the full versions of Andy-3.5 and Andy-3.5-mini *(And possibly Andy-3.5-large)* release, they will both be trained on a context length of 32,000 to ensure proper usage during playing. |