metadata
library_name: diffusers
pipeline_tag: text-to-image
Warning: THIS model is NOT suitable for use by minors. The model can/will generate X-rated/NFSW content.
E621 Rising Stable Diffusion 2.1 Model [epoch 19]
- Guaranteed NSFW or your money back
- Fine-tuned from Stable Diffusion v2-1-base
- 19 epochs of 450,000 images each, collected from E621 and curated based on scores, favorite counts, and tag filtering.
- Trained with 5,356 tags
512x512px
- Compatible with 🤗
diffusers
- Compatible with
stable-diffusion-webui
- Likely compatible with anything that accepts
.ckpt
and.yaml
files
Getting Started
Examples


Example Prompt
anthro solo female standing rating:questionable
species:equine biped
two_tone_fur grey_body grey_fur white_fur white_snout white_markings gloves_marking white_tail
blue_eyes facial_markings white_hair white_mane evil_grin
athletic_female
meta:shaded
meta:digital_media_artwork
meta:detailed
meta:digital_painting_artwork
seductive looking_at_viewer
tomboy
tomb raider outfit
Changes From E621
See a complete list of tags here.
- Symbols have been prefixed with
symbol:
, e.g.symbol:<3
- All categories except
general
have been prefixed with the category name, e.g.copyright:somename
. The categories are:artist
copyright
character
species
invalid
meta
lore
- Tag names are all lowercase and only contain
a-z
,0-9
,/
, and_
letters :
is used to separate the category name from the tag
Additional Tags
- Image rating
rating:explicit
rating:questionable
rating:safe
Training Procedure
- 204-272 images per batch (epoch variant)
512x512px
image size- Adam optimizer
- Beta1 =
0.9
- Beta2 =
0.999
- Weight decay =
1e-2
- Epsilon =
1e-08
- Beta1 =
- Constant learning rate
4e-6
bf16
mixed precision- 12 epochs of samples stretched to
512x512px
(ignore aspect ratio) - 4 epochs of samples resized to
512xH
orWx512px
with center crop (maintain aspect ratio) - 3 epochs of samples resized to
< 512x512px
(maintain aspect ratio) - Tags for each sample are shuffled for each epoch, starting from epoch 16