Andy-3.5 / README.md

Update README.md

8b9294e verified 8 days ago

4.27 kB

	---
	license: apache-2.0
	datasets:
	- Sweaterdog/Andy-3.5
	language:
	- en
	base_model:
	- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
	tags:
	- Minecraft
	---

	# 🚀 Welcome to a new generation of Minecraft with Andy 3.5 🚀
	## Andy 3.5 is a collection of LOCAL LLM's designed for playing Minecraft
	Andy 3.5 is designed to be used with MindCraft, and is not designed nor intended to be used for any other applications

	### How to Install

	1. Select the model you would like to use (Larger is better)
	2. Download a Modelfile *(One is directly for the tuned model, while the other is for the base model)
	3. Once downloaded, open Modelfile in a text editor, and change the path to the download location of the gguf file
	4. When changed, save the file, and open command terminal
	5. (Optional if CMD is opened via file explorer) Navigate to the correct directory using "cd"
	6. Run the command ollama create Andy-3.5 -f Modelfile
	7. Go to a profile in MindCraft
	8. Change the model to be Andy-3.5
	9. Enjoy playing with an AI


	## How was model trained?

	The model was trained on a dataset of ~9,000 messages coming directly from MindCraft, ensuring quality data, not the newer version with ~12,000 prompts

	## What are capabilities and Limitations?

	The smaller model (The preview ones at least) had 1/3 of the parameters tuned, the larger preview model has not been released yet.
	Andy-3.5 was trained on EVERYTHING regarding Minecraft and MindCraft, it knows how to use commands natively without a system prompt.
	Andy-3.5 also knows how to build / use !newAction to perform commands, it was trained on lots of building, as well as, using !newAction to do tasks like manually making something or strip mining.

	**Know this is a PREVIEW model, it is NOT finished!**

	## Why a preview model?

	Andy-3.5-preview was made to test the intelligence of a Minecraft Ai with the current dataset, it was meant to see the progress of the training and what area's are needed for the future
	DO NOT expect this model to be able to do everything perfectly, it only knows as much as the dataset told it, as well as the other 2/3 of the untouched parameters allow.
	The model may experience bugs, such as not saying your name, getting previous messages confused, or other small things.

	# What models can I choose?

	There are going to be 2 (maybe 3) model sizes avaliable, Regular, Mini (And Maybe large)
	* Regular is a 7B parameter model, tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)
	* Mini is a 1.5B parameter model, also tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B)
	* Large (Might) be a 32b parameter model, again tuned from [Deepseek-R1 Distilled](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) - This model may not exist, *ever*

	Out of all of the models, Teensy had the largest percent of parameters tuned, being 1/2 the models total size

	## Safety and FAQ

	Q: Is this model safe to use?
	A. Yes, this model is non-volatile, and cannot generate malicous content

	Q. Can this model be used on a server?
	A. Yes, In theory and practice the model is only capable of building and performing manual tasks via newAction

	Q. Who is responsible if this model does generate malicous content?
	A. You are responsible, even though the model was never trained to be able to make malicous content, there is a *very very slight chance* it still generates malicous code.

	Q. If I make media based on this model, like photos / videos, do I have to mention the Creator?
	A. No, if you are making a post about MindCraft, and using this model, you only have to mention the creator if you mention the model being used.

	## Important notes and considerations

	The preview model of Andy-3.5, is Andy-3.5-teensy, a small model and tune with only 360 million parameters, it *"understand Minecraft"*.
	I would not recommend Andy-3.5-teensy, I felt like making a joke, and a joke was made, (The Andy-3.5-teensy model was a big hope, but it sucks, try out the q2_k model!)


	When the full versions of Andy-3.5 and Andy-3.5-mini (And possibly Andy-3.5-large) release, they will both be trained on a context length of 32,000 to ensure proper usage during playing.