k4d3 commited on
Commit
3e2908a
1 Parent(s): 535affd

Signed-off-by: Balazs Horvath <[email protected]>

Files changed (1) hide show
  1. README.md +10 -4
README.md CHANGED
@@ -26,6 +26,7 @@ The Yiff Toolkit is a comprehensive set of tools designed to enhance your creati
26
  - [Table of Contents](#table-of-contents)
27
  - [Dataset Tools](#dataset-tools)
28
  - [Dataset Preparation](#dataset-preparation)
 
29
  - [Grabber](#grabber)
30
  - [Manual Method](#manual-method)
31
  - [LoRA Training Guide](#lora-training-guide)
@@ -149,15 +150,17 @@ Before you begin collecting your dataset you will need to decide what you want t
149
 
150
  For now let's imagine you want to teach your model *wicerbeasts* so you can generate your VRChat avatar every night. For this we'll make good use of the furry <abbr title="image board">booru</abbr> [e621.net](https://e621.net/). There are two nice ways to download data from this site with the metadata intact, I'll start with the fastest and then I will explain how you can selectively browse around the site and get the images you like one by one.
151
 
 
 
 
 
152
  ### Grabber
153
 
154
- [Grabber](https://github.com/Bionus/imgbrd-grabber) makes your life easier when trying to compile datasets quickly.
155
 
156
  [![A screenshot of Grabber.](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)
157
 
158
- Clicking on the `Add` button
159
- At the Download tab, `Tags` will be the column you can type in
160
- `wickerbeast solo -meme -animated order:score`
161
 
162
  You should also enable `Separate log files` for e621, this will download the metadata automatically alongside the pictures.
163
 
@@ -165,6 +168,9 @@ You should also enable `Separate log files` for e621, this will download the met
165
 
166
  For Pony I've set up the Text file content like so: `rating_%rating%, %all:separator=^, %` for other models you might want to replace `rating_%rating%` with just `%rating%`.
167
 
 
 
 
168
  ---
169
 
170
  ### Manual Method
 
26
  - [Table of Contents](#table-of-contents)
27
  - [Dataset Tools](#dataset-tools)
28
  - [Dataset Preparation](#dataset-preparation)
29
+ - [Create the `training_dir` Directory](#create-the-training_dir-directory)
30
  - [Grabber](#grabber)
31
  - [Manual Method](#manual-method)
32
  - [LoRA Training Guide](#lora-training-guide)
 
150
 
151
  For now let's imagine you want to teach your model *wicerbeasts* so you can generate your VRChat avatar every night. For this we'll make good use of the furry <abbr title="image board">booru</abbr> [e621.net](https://e621.net/). There are two nice ways to download data from this site with the metadata intact, I'll start with the fastest and then I will explain how you can selectively browse around the site and get the images you like one by one.
152
 
153
+ ### Create the `training_dir` Directory
154
+
155
+ Before
156
+
157
  ### Grabber
158
 
159
+ [Grabber](https://github.com/Bionus/imgbrd-grabber) makes your life easier when trying to compile datasets quickly from imageboards.
160
 
161
  [![A screenshot of Grabber.](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)](https://huggingface.co/k4d3/yiff_toolkit/resolve/main/static/tutorial/grabber1.png)
162
 
163
+ Clicking on the `Add` button on the Download tab lets you add a `group` which will get downloaded, `Tags` will be the where you can type in the search parameters like you would on e621.net, so for example the string `wickerbeast solo -comic -meme -animated order:score` will search for solo wickerbeast pictures without including comics, memes, and animated posts in descending order of their scores. For training SDXL LoRAs you usually won't need more than 50 images, but you should set the solo group to `40` and add a new group with `-solo` instead of `solo` and set the `Image Limit` to `10` for it to include some images with other characters in it. This will help the model learn a lot better!
 
 
164
 
165
  You should also enable `Separate log files` for e621, this will download the metadata automatically alongside the pictures.
166
 
 
168
 
169
  For Pony I've set up the Text file content like so: `rating_%rating%, %all:separator=^, %` for other models you might want to replace `rating_%rating%` with just `%rating%`.
170
 
171
+ You should also set the `Folder` into which the images will get downloaded. Let's try to use
172
+ Now you are ready to right-click on each group and download the images.
173
+
174
  ---
175
 
176
  ### Manual Method