Comparison of Stable Diffusion XL (SDXL) 0.9 vs 1.0 For DreamBooth Training - Surprising Results

#59
by MonsterMMORPG - opened

You can download SDXL 0.9 from here : https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9/tree/main

SDXL 0.9 was the first released beta version of Stable Diffusion XL.

I have used Kohya GUI SS and the config I shared here for training : https://www.patreon.com/posts/89213064

Video of how to use config : https://youtu.be/EEV8RPohsbw

For training: 15 training images (show below), 140 repeat, 1 epoch (so total 151402 = 4200 steps - takes less than 2 hours on RTX 3090 with 17 GB VRAM) and the real unsplash manually collected reg images from here : https://www.patreon.com/posts/massive-4k-woman-87700469 are used

Both for SDXL 0.9 and SDXL 1.0 exactly same training parameters and configuration used. For SDXL 0.9 I used the embedded VAE and for SDXL 1.0 I used the later released VAE which is supposed to be same as SDXL 0.9 VAE.

You can download original full resolution (6194 x 4034 pixels) and quality PNG images from attachments and see their PNG info (only PNG ones some failed so I uploaded as JPG) from Automatic1111 SD Web UI PNG info tab.

Prompt 1 PNG Info:

Medium shot photo of ohwx man wearing a very expensive suit in a studio with good lightning , hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_1_50percent.png

Prompt 2 PNG Info:

closeshot photo of ohwx man wearing a suit in a surreal outworldly garden, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_2_50percent.png

Prompt 3 PNG Info:

cinematic photo ohwx man riding dinosaur in a jungle with mud, sunny day shiny clear sky 35mm photograph,film,professional,4k,highly detailed
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_3_50percent.png

Prompt 4 PNG Info:

picture of (ohwx man) wearing a suit near a lake, simple flat color, 2 dimensional, flat 2d art style, cartoon
Negative prompt: photo, photograph, ugly, deformed, noisy, blurry, low contrast, realistic, distant shot, close shot, medium shot, 3d, cgi, render, studio shot, studio, shot, camera
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: "picture of (ohwx man), simple flat color, 2 dimensional, flat 2d art style", ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_4_50percent.png

Prompt 5 PNG Info:

closeshot handsome photo of (ohwx man) (in a warrior armor ) in a coliseum, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 129509750, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_5_50percent.png

Prompt 6 PNG Info:

photo of warrior ohwx man with a pet dragon , epic, cinematic, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2991427470, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_6_50percent.png

Prompt 7 PNG Info:

handsome portrait photo of (ohwx man) wearing a space armor on a space station, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2897227315, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: "model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]", Version: v1.7.0

prompt_7_50percent.png

Sign up or log in to comment