File size: 2,813 Bytes
8910580
 
 
56dd234
8910580
 
cae4580
a2605b2
 
013454c
a2605b2
 
5fd4d05
a2605b2
 
 
e1e6ece
a2605b2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
---
pipeline_tag: text-to-image
license: other
license_name: faipl-1.0-sd
license_link: LICENSE
prior:
- Disty0/sotediffusion-wuerstchen3-alpha1
---

# SoteDiffusion Wuerstchen3

Anime finetune of Würstchen V3.  
Currently is in early state in training.  
No commercial use thanks to StabilityAI. 

# Usage
Please refer to the main model: https://huggingface.co/Disty0/sotediffusion-wuerstchen3-alpha1


## Dataset

Used the same dataset as Disty0/sote-diffusion-cascade-decoder-alpha0.  
Changed the training parameters.  
Trained with 98K~ images.  

## Training:

**GPU used for training**: 1x AMD RX 7900 XTX 24GB  

**Software used**: https://github.com/2kpr/StableCascade  

### Config:
```
experiment_id: sotediffusion-sc-b_3b
model_version: 3B
dtype: bfloat16
use_fsdp: False

batch_size: 16
grad_accum_steps: 16
updates: 6125
backup_every: 512
save_every: 256
warmup_updates: 100

lr: 1.0e-5
optimizer_type: Adafactor
adaptive_loss_weight: False
stochastic_rounding: True

image_size: 768
multi_aspect_ratio: [1/1, 1/2, 1/3, 2/3, 3/4, 1/5, 2/5, 3/5, 4/5, 1/6, 5/6, 9/16]
shift: 4

checkpoint_path: /mnt/DataSSD/AI/SoteDiffusion/StableCascade/
output_path: /mnt/DataSSD/AI/SoteDiffusion/StableCascade/
webdataset_path: file:/mnt/DataSSD/AI/anime_image_dataset/best/newest_best.tar

effnet_checkpoint_path: /mnt/DataSSD/AI/models/sd-cascade/effnet_encoder.safetensors
stage_a_checkpoint_path: /mnt/DataSSD/AI/models/sd-cascade/stage_a.safetensors
generator_checkpoint_path: /mnt/DataSSD/AI/SoteDiffusion/StableCascade/sotediffusion-sc_3b-stage_b-alpha0.safetensors
```


## Limitations and Bias

### Bias

- This model is intended for anime illustrations.  
  Realistic capabilites are not tested at all.  
- Still underbaked.  

### Limitations
- Far shot eyes are can bad.  


## License
(This part is copied directly from Animagine V3.1 and modified.)

SoteDiffusion models falls under [Fair AI Public License 1.0-SD](https://freedevproject.org/faipl-1.0-sd/) license, which is compatible with Stable Diffusion models’ license. Key points:

1. **Modification Sharing:** If you modify SoteDiffusion models, you must share both your changes and the original license.
2. **Source Code Accessibility:** If your modified version is network-accessible, provide a way (like a download link) for others to get the source code. This applies to derived models too.
3. **Distribution Terms:** Any distribution must be under this license or another with similar rules.
4. **Compliance:** Non-compliance must be fixed within 30 days to avoid license termination, emphasizing transparency and adherence to open-source values.

**Notes**: Anything not covered by Fair AI license is inherited from Stability AI Non-Commercial license which is named as LICENSE_INHERIT. Meaning, still no commercial use of any kind.