metadata
base_model: PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
library_name: diffusers
license: creativeml-openrail-m
tags:
- stable-diffusion
- stable-diffusion-diffusers
- text-to-image
- diffusers
- full
- pixart
- pixart sigma
inference: true
widget:
- text: >-
A blonde sexy girl, wearing glasses at latex shirt and a blue beanie with
a tattoo, blue and white, highly detailed, sublime, extremely beautiful,
sharp focus, refined, cinematic, intricate, elegant, dynamic, rich deep
colors, bright color, shining light, attractive, cute, pretty, background
full, epic composition, dramatic atmosphere, radiant, professional,
stunning
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/1.png
- text: >-
a wizard with a glowing staff and a glowing hat, colorful magic, dramatic
atmosphere, sharp focus, highly detailed, cinematic, original composition,
fine detail, intricate, elegant, creative, color spread, shiny, amazing,
symmetry, illuminated, inspired, pretty, attractive, artistic, dynamic
background, relaxed, professional, extremely inspirational, beautiful,
determined, cute, adorable, best
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/2.png
- text: >-
girl in modern car, intricate, elegant, highly detailed, extremely
complimentary colors, beautiful, glowing aesthetic, pretty, dramatic
light, sharp focus, perfect composition, clear artistic color, calm
professional background, precise, joyful, emotional, unique, cute, best,
gorgeous, great delicate, expressive, thought, iconic, fine, awesome,
creative, winning, charming, enhanced
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/3.png
- text: >-
A girl stands amidst scattered glass shards, surrounded by a beautifully
crafted and expansive world. The scene is depicted from a dynamic angle,
emphasizing her determined expression. The background features vast
landscapes with floating crystals and soft, glowing lights that create a
mystical and grand atmosphere.
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/ComfyUI_PixArt_00040_.png
- text: >-
A girl stands amidst scattered glass shards, surrounded by a beautifully
crafted and expansive world. The scene is depicted from a dynamic angle,
emphasizing her determined expression. The background features vast
landscapes with floating crystals and soft, glowing lights that create a
mystical and grand atmosphere.
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/ComfyUI_PixArt_00036_.png
- text: >-
A close-up shot of a beautiful girl in a serene world. She has white hair
and is blindfolded, with a calm expression. Her hands are pressed together
in a prayer pose, with fingers interlaced and palms touching. The
background is softly blurred, enhancing her ethereal presence.
parameters:
negative_prompt: blurry, cropped, ugly
output:
url: ./assets/ComfyUI_PixArt_00041_.png
SigmaJourney: PixartSigma + MidJourney v6
Inference
ComfyUI
- Download model file
transformer/diffusion_pytorch_model.safetensors
and put intoComfyUI/models/checkpoints
- Use ExtraModels node: https://github.com/city96/ComfyUI_ExtraModels?tab=readme-ov-file#pixart
import torch
from diffusers import DiffusionPipeline, EulerAncestralDiscreteScheduler
from diffusers.models import PixArtTransformer2DModel
model_id = "TensorFamily/SigmaJourney"
negative_prompt = "malformed, disgusting, overexposed, washed-out"
pipeline = DiffusionPipeline.from_pretrained("PixArt-alpha/PixArt-Sigma-XL-2-1024-MS", torch_dtype=torch.float16)
pipeline.transformer = PixArtTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.float16)
pipeline.scheduler = EulerAncestralDiscreteScheduler.from_config(pipeline.scheduler.config)
pipeline.to('cuda' if torch.cuda.is_available() else 'cpu')
prompt = "On the left, there is a red cube. On the right, there is a blue sphere. On top of the red cube is a dog. On top of the blue sphere is a cat"
image = pipeline(
prompt=prompt,
negative_prompt='blurry, cropped, ugly',
num_inference_steps=30,
generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826),
width=1024,
height=1024,
guidance_scale=5.5,
).images[0]
image.save("output.png", format="JPEG")