awoo
Browse filesSigned-off-by: Balazs Horvath <[email protected]>
README.md
CHANGED
@@ -26,6 +26,7 @@ The Yiff Toolkit is a comprehensive set of tools designed to enhance your creati
|
|
26 |
- [Table of Contents](#table-of-contents)
|
27 |
- [Dataset Tools](#dataset-tools)
|
28 |
- [Dataset Preparation](#dataset-preparation)
|
|
|
29 |
- [Grabber](#grabber)
|
30 |
- [Manual Method](#manual-method)
|
31 |
- [LoRA Training Guide](#lora-training-guide)
|
@@ -149,15 +150,17 @@ Before you begin collecting your dataset you will need to decide what you want t
|
|
149 |
|
150 |
For now let's imagine you want to teach your model *wicerbeasts* so you can generate your VRChat avatar every night. For this we'll make good use of the furry <abbr title="image board">booru</abbr> [e621.net](https://e621.net/). There are two nice ways to download data from this site with the metadata intact, I'll start with the fastest and then I will explain how you can selectively browse around the site and get the images you like one by one.
|
151 |
|
|
|
|
|
|
|
|
|
152 |
### Grabber
|
153 |
|
154 |
-
[Grabber](https://github.com/Bionus/imgbrd-grabber) makes your life easier when trying to compile datasets quickly.
|
155 |
|
156 |
[![A screenshot of Grabber.](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)
|
157 |
|
158 |
-
Clicking on the `Add` button
|
159 |
-
At the Download tab, `Tags` will be the column you can type in
|
160 |
-
`wickerbeast solo -meme -animated order:score`
|
161 |
|
162 |
You should also enable `Separate log files` for e621, this will download the metadata automatically alongside the pictures.
|
163 |
|
@@ -165,6 +168,9 @@ You should also enable `Separate log files` for e621, this will download the met
|
|
165 |
|
166 |
For Pony I've set up the Text file content like so: `rating_%rating%, %all:separator=^, %` for other models you might want to replace `rating_%rating%` with just `%rating%`.
|
167 |
|
|
|
|
|
|
|
168 |
---
|
169 |
|
170 |
### Manual Method
|
|
|
26 |
- [Table of Contents](#table-of-contents)
|
27 |
- [Dataset Tools](#dataset-tools)
|
28 |
- [Dataset Preparation](#dataset-preparation)
|
29 |
+
- [Create the `training_dir` Directory](#create-the-training_dir-directory)
|
30 |
- [Grabber](#grabber)
|
31 |
- [Manual Method](#manual-method)
|
32 |
- [LoRA Training Guide](#lora-training-guide)
|
|
|
150 |
|
151 |
For now let's imagine you want to teach your model *wicerbeasts* so you can generate your VRChat avatar every night. For this we'll make good use of the furry <abbr title="image board">booru</abbr> [e621.net](https://e621.net/). There are two nice ways to download data from this site with the metadata intact, I'll start with the fastest and then I will explain how you can selectively browse around the site and get the images you like one by one.
|
152 |
|
153 |
+
### Create the `training_dir` Directory
|
154 |
+
|
155 |
+
Before
|
156 |
+
|
157 |
### Grabber
|
158 |
|
159 |
+
[Grabber](https://github.com/Bionus/imgbrd-grabber) makes your life easier when trying to compile datasets quickly from imageboards.
|
160 |
|
161 |
[![A screenshot of Grabber.](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)
|
162 |
|
163 |
+
Clicking on the `Add` button on the Download tab lets you add a `group` which will get downloaded, `Tags` will be the where you can type in the search parameters like you would on e621.net, so for example the string `wickerbeast solo -comic -meme -animated order:score` will search for solo wickerbeast pictures without including comics, memes, and animated posts in descending order of their scores. For training SDXL LoRAs you usually won't need more than 50 images, but you should set the solo group to `40` and add a new group with `-solo` instead of `solo` and set the `Image Limit` to `10` for it to include some images with other characters in it. This will help the model learn a lot better!
|
|
|
|
|
164 |
|
165 |
You should also enable `Separate log files` for e621, this will download the metadata automatically alongside the pictures.
|
166 |
|
|
|
168 |
|
169 |
For Pony I've set up the Text file content like so: `rating_%rating%, %all:separator=^, %` for other models you might want to replace `rating_%rating%` with just `%rating%`.
|
170 |
|
171 |
+
You should also set the `Folder` into which the images will get downloaded. Let's try to use
|
172 |
+
Now you are ready to right-click on each group and download the images.
|
173 |
+
|
174 |
---
|
175 |
|
176 |
### Manual Method
|