File size: 8,336 Bytes
afa09d1 c699101 94ec711 12a76ac afa09d1 c699101 94ec711 c699101 a076d34 c699101 a076d34 530c208 c699101 a076d34 c699101 a076d34 c699101 a076d34 c699101 a076d34 c699101 717befe c699101 a076d34 c699101 b172224 c699101 9d15ea3 530c208 f1f2df7 530c208 c699101 9f3ec14 61b6324 c699101 a7135ff 11b6794 08ea1aa 11b6794 a7135ff 763ceb8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 |
---
license: openrail++
tags:
- text-to-image
- stable-diffusion
- diffusers
widget:
- text: 1boy, male focus, holding drink, holding, drink, toned male, toned, pectorals, jacket, open jacket, open clothes, tank top, chain necklace, necklace, stud earrings, earrings, jewelry, cafe, plant, indoors, lens flare, solo, looking at viewer, open mouth, fang, white hair, yellow eyes, short hair, best quality, amazing quality, best aesthetic, absurdres, year 2023
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts, 1girl, breasts
output:
url: images/sample01.png
example_title: sample01
- text: 1boy, male focus, bishounen, holding sword, holding weapon, katana, sword, japanese clothes, haori, east asian architecture, solo, looking at viewer, expressionless, blue hair, purple eyes, long hair, best quality, amazing quality, best aesthetic, absurdres
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
output:
url: images/sample02.png
example_title: sample02
- text: 1boy, male focus, sky, star (sky), night, pointing up, night sky, hood down, starry sky, hood, blue theme, outdoors, long sleeves, shooting star, hoodie, short hair, jacket, scenery, cloud, from behind, blue eyes, best quality, amazing quality, best aesthetic, absurdres
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
output:
url: images/sample03.png
example_title: sample03
- text: 2boys, male focus, multiple boys, yaoi, couple, princess carry, carrying, shirt, pants, looking at another, smile, indoors, best quality, amazing quality, best aesthetic, absurdres
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
output:
url: images/sample04.png
example_title: sample04
- text: 1boy, male focus, dark-skinned male, dark skin, squatting, heart hands, bara, wooden floor, floor, indoors, gym uniform, sneakers, shoes, solo, looking at viewer, frown, sweatdrop, very short hair, best quality, amazing quality, best aesthetic, absurdres, year 2023
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
output:
url: images/sample05.png
example_title: sample05
- text: 1boy, male focus, short hair, blue hair, blue eyes, graphic t-shirt, punk t-shirt, digital illustration, cyan and black, looking at viewer, busy city street, belt, black pants, atmospheric lighting, midriff peek, night, blurry, best quality, amazing quality, best aesthetic, absurdres
parameters:
negative_prompt: lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
output:
url: images/sample06.png
example_title: sample06
---
# AnimeBoysXL v3.0
**It takes substantial time and efforts to bake models. If you appreciate my models, I would be grateful if you could support me on [Ko-fi](https://ko-fi.com/koolchh) ☕.**
<Gallery />
## Features
- ✔️ **Good for inference**: AnimeBoysXL v3.0 is a flexible model which is good at generating images of anime boys and males-only content in a wide range of styles.
- ✔️ **Good for training**: AnimeBoysXL v3.0 is suitable for further training, thanks to its neutral style and ability to recognize a great deal of concepts. Feel free to train your own anime boy model/LoRA from AnimeBoysXL.
## Inference Guide
- **Prompt**: Use tag-based prompts to describe your subject.
- Tag ordering matters. It is highly recommended to structure your prompt with the following templates:
```
1boy, male focus, character name, series name, anything else you'd like to describe, best quality, amazing quality, best aesthetic, absurdres
```
```
2boys, male focus, multiple boys, character name(s), series name, anything else you'd like to describe, best quality, amazing quality, best aesthetic, absurdres
```
- **Negative prompt**: Choose from one of the following two presets.
1. Heavy (*recommended*):
```
lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts
```
2. Light:
```
lowres, jpeg artifacts, worst quality, watermark, blurry, bad aesthetic
```
- **VAE**: Make sure you're using [SDXL VAE](https://huggingface.co/stabilityai/sdxl-vae/tree/main).
- **Sampling method, sampling steps and CFG scale**: I find **(Euler a, 28, 8.5)** good. You are encouraged to experiment with other settings.
- **Width and height**: **832*1216** for portrait, **1024*1024** for square, and **1216*832** for landscape.
## 🧨Diffusers Example Usage
```python
import torch
from diffusers import DiffusionPipeline
pipe = DiffusionPipeline.from_pretrained("Koolchh/AnimeBoysXL-v3.0", torch_dtype=torch.float16, use_safetensors=True, variant="fp16")
pipe.to("cuda")
prompt = "1boy, male focus, shirt, solo, looking at viewer, smile, black hair, brown eyes, short hair, best quality, amazing quality, best aesthetic, absurdres"
negative_prompt = "lowres, bad, text, error, missing, extra, fewer, cropped, jpeg artifacts, worst quality, bad quality, watermark, bad aesthetic, unfinished, chromatic aberration, scan, scan artifacts"
image = pipe(
prompt=prompt,
negative_prompt=negative_prompt,
width=1024,
height=1024,
guidance_scale=8.5,
num_inference_steps=28
).images[0]
```
## Training Details
AnimeBoysXL v3.0 is trained from [Pony Diffusion V6 XL](https://civitai.com/models/257749/pony-diffusion-v6-xl), on ~516k images.
The following tags are attached to the training data to make it easier to steer toward either more aesthetic or more flexible results.
### Quality tags
| tag | score |
|-------------------|-----------|
| `best quality` | >= 150 |
| `amazing quality` | [75, 150) |
| `great quality` | [25, 75) |
| `normal quality` | [0, 25) |
| `bad quality` | (-5, 0) |
| `worst quality` | <= -5 |
### Aesthetic tags
The aesthetic tags of AnimeBoysXL v3.0 reflect my aesthetic preference.
| tag |
|---------------------|
| `best aesthetic` |
| `amazing aesthetic` |
| `great aesthetic` |
| `normal aesthetic` |
| `bad aesthetic` |
### Rating tags
| tag | rating |
|-----------------|--------------|
| `sfw` | general |
| `slightly nsfw` | sensitive |
| `fairly nsfw` | questionable |
| `very nsfw` | explicit |
### Year tags
`year YYYY` where `YYYY` is in the range of [2005, 2023].
### Training configurations
- Hardware: 4 * Nvidia A100 80GB GPUs
- Optimizer: AdaFactor
- Gradient accumulation steps: 8
- Batch size: 4 * 8 * 4 = 128
- Learning rates:
- 8e-6 for U-Net
- 5.2e-6 for text encoder 1 (CLIP ViT-L)
- 4.8e-6 for text encoder 2 (OpenCLIP ViT-bigG)
- Learning rate schedule: constant with 250 warmup steps
- Mixed precision training type: FP16
- Epochs: 40
### Changes from v2.0
- Change the base model from Stable Diffusion XL Base 1.0 to Pony Diffusion V6 XL.
- Revamp the dataset's aesthetic tags based on the developer's preference.
- Update the criterion of quality tags.
- Use FP16 mixed-precision training.
- Train for more epochs.
## Special thanks
**chefFromSpace** for his assistance with the showcase images.
## License
Since AnimeBoysXL v3.0 is a derivative model of [Pony Diffusion V6 XL](https://civitai.com/models/257749/pony-diffusion-v6-xl) by PurpleSmartAI, it has a different license from the previous versions. Please read their license before using the model. |