sd21-e621-rising-v1 / README.md
hearmeneigh's picture
Update README.md
01ecac8
|
raw
history blame
3.01 kB
metadata
library_name: diffusers
pipeline_tag: text-to-image

Warning: THIS model is NOT suitable for use by minors. The model can/will generate X-rated/NFSW content.

E621 Rising Stable Diffusion 2.1 Model [epoch 19]

  • Guaranteed NSFW or your money back
  • Fine-tuned from Stable Diffusion v2-1-base
  • 19 epochs of 450,000 images each, collected from E621 and curated based on scores, favorite counts, and tag filtering.
  • Trained with 5,356 tags
  • 512x512px
  • Compatible with 🤗 diffusers
  • Compatible with stable-diffusion-webui
  • Likely compatible with anything that accepts .ckpt and .yaml files

Getting Started

Examples

Example Prompt

anthro solo female standing rating:questionable

species:equine biped
two_tone_fur grey_body grey_fur white_fur white_snout white_markings gloves_marking white_tail
blue_eyes facial_markings white_hair white_mane evil_grin 
athletic_female

meta:shaded
meta:digital_media_artwork
meta:detailed
meta:digital_painting_artwork

seductive looking_at_viewer
tomboy
tomb raider outfit

Changes From E621

See a complete list of tags here.

  • Symbols have been prefixed with symbol:, e.g. symbol:<3
  • All categories except general have been prefixed with the category name, e.g. copyright:somename. The categories are:
    • artist
    • copyright
    • character
    • species
    • invalid
    • meta
    • lore
  • Tag names are all lowercase and only contain a-z, 0-9, /, and _ letters
  • : is used to separate the category name from the tag

Additional Tags

  • Image rating
    • rating:explicit
    • rating:questionable
    • rating:safe

Training Procedure

  • 204-272 images per batch (epoch variant)
  • 512x512px image size
  • Adam optimizer
    • Beta1 = 0.9
    • Beta2 = 0.999
    • Weight decay = 1e-2
    • Epsilon = 1e-08
  • Constant learning rate 4e-6
  • bf16 mixed precision
  • 12 epochs of samples stretched to 512x512px (ignore aspect ratio)
  • 4 epochs of samples resized to 512xH or Wx512px with center crop (maintain aspect ratio)
  • 3 epochs of samples resized to < 512x512px (maintain aspect ratio)
  • Tags for each sample are shuffled for each epoch, starting from epoch 16