--- license: creativeml-openrail-m base_model: "terminusresearch/pixart-900m-1024-ft-v0.6" tags: - stable-diffusion - stable-diffusion-diffusers - text-to-image - diffusers - simpletuner - full inference: true widget: - text: 'unconditional (blank prompt)' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_0_0.png - text: 'Alien planet, strange rock formations, glowing plants, bizarre creatures, surreal atmosphere' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_1_0.png - text: 'Alien marketplace, bizarre creatures, exotic goods, vibrant colors, otherworldly atmosphere' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_2_0.png - text: 'Child holding a balloon, happy expression, colorful balloons, sunny day, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_3_0.png - text: 'a 4-panel comic strip showing an orange cat saying the words ''HELP'' and ''LASAGNA''' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_4_0.png - text: 'a hand is holding a comic book with a cover that reads ''The Adventures of Superhero''' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_5_0.png - text: 'Underground cave filled with crystals, glowing lights, reflective surfaces, fantasy environment, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_6_0.png - text: 'Bustling cyberpunk bazaar, vendors, neon signs, advanced tech, crowded, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_7_0.png - text: 'Cyberpunk hacker in a dark room, neon glow, multiple screens, intense focus, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_8_0.png - text: 'a cybernetic anne of green gables with neural implant and bio mech augmentations' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_9_0.png - text: 'Post-apocalyptic cityscape, ruined buildings, overgrown vegetation, dark and gritty, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_10_0.png - text: 'Magical castle in a lush forest, glowing windows, fantasy architecture, high resolution, detailed textures' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_11_0.png - text: 'Ruins of an ancient temple in an enchanted forest, glowing runes, mystical creatures, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_12_0.png - text: 'Mystical forest, glowing plants, fairies, magical creatures, fantasy art, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_13_0.png - text: 'Magical garden with glowing flowers, fairies, serene atmosphere, detailed plants, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_14_0.png - text: 'Whimsical garden filled with fairies, magical plants, sparkling lights, serene atmosphere, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_15_0.png - text: 'Majestic dragon soaring through the sky, detailed scales, dynamic pose, fantasy art, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_16_0.png - text: 'Fantasy world, floating islands in the sky, waterfalls, lush vegetation, detailed landscape, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_17_0.png - text: 'Futuristic city skyline at night, neon lights, cyberpunk style, high contrast, sharp focus' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_18_0.png - text: 'Space battle scene, starships fighting, laser beams, explosions, cosmic background' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_19_0.png - text: 'Abandoned fairground at night, eerie rides, ghostly figures, fog, dark atmosphere, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_20_0.png - text: 'Spooky haunted mansion on a hill, dark and eerie, glowing windows, ghostly atmosphere, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_21_0.png - text: 'a hardcover physics textbook that is called PHYSICS FOR DUMMIES' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_22_0.png - text: 'Epic medieval battle, knights in armor, dynamic action, detailed landscape, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_23_0.png - text: 'Bustling medieval market with merchants, knights, and jesters, vibrant colors, detailed' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_24_0.png - text: 'Cozy medieval tavern, warm firelight, adventurers drinking, detailed interior, rustic atmosphere' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_25_0.png - text: 'Futuristic city skyline at night, neon lights, cyberpunk style, high contrast, sharp focus' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_26_0.png - text: 'Forest with neon-lit trees, glowing plants, bioluminescence, surreal atmosphere, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_27_0.png - text: 'Bright neon sign in a busy city street, ''Open 24 Hours'', bold typography, glowing lights' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_28_0.png - text: 'Vibrant neon sign, ''Bar'', bold typography, dark background, glowing lights, detailed design' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_29_0.png - text: 'Pirate ship on the high seas, stormy weather, detailed sails, dramatic waves, photorealistic' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_30_0.png - text: 'Pirate discovering a treasure chest, detailed gold coins, tropical island, dramatic lighting' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_31_0.png - text: 'a photograph of a woman experiencing a psychedelic trip. trippy, 8k, uhd, fractal' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_32_0.png - text: 'Cozy cafe on a rainy day, people sipping coffee, warm lights, reflections on wet pavement, photorealistic' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_33_0.png - text: '1980s arcade, neon lights, vintage game machines, kids playing, vibrant colors, nostalgic atmosphere' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_34_0.png - text: '1980s game room with vintage arcade machines, neon lights, vibrant colors, nostalgic feel' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_35_0.png - text: 'Robot blacksmith forging metal, sparks flying, detailed workshop, futuristic and medieval blend' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_36_0.png - text: 'Sleek robot performing a dance, futuristic theater, holographic effects, detailed, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_37_0.png - text: 'High-tech factory where robots are assembled, detailed machinery, futuristic setting, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_38_0.png - text: 'Garden tended by robots, mechanical plants, colorful flowers, futuristic setting, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_39_0.png - text: 'Cute robotic pet, futuristic home, sleek design, detailed features, friendly and animated' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_40_0.png - text: 'cctv trail camera night time security picture of a wendigo in the woods' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_41_0.png - text: 'Astronaut exploring an alien planet, detailed landscape, futuristic suit, cosmic background' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_42_0.png - text: 'Futuristic space station orbiting a distant exoplanet, sleek design, detailed structures, cosmic backdrop' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_43_0.png - text: 'a person holding a sign that reads ''SOON''' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_44_0.png - text: 'Steampunk airship in the sky, intricate design, Victorian aesthetics, dynamic scene, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_45_0.png - text: 'Steampunk inventor in a workshop, intricate gadgets, Victorian attire, mechanical arm, goggles' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_46_0.png - text: 'Stormy ocean with towering waves, dramatic skies, detailed water, intense atmosphere, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_47_0.png - text: 'Dramatic stormy sea, lighthouse in the distance, lightning striking, dark clouds, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_48_0.png - text: 'Graffiti artist creating a mural, vibrant colors, urban setting, dynamic action, high resolution' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_49_0.png - text: 'Urban alleyway filled with vibrant graffiti art, tags and murals, realistic textures' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_50_0.png - text: 'Urban street sign, ''Main Street'', bold typography, realistic textures, weathered look' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_51_0.png - text: 'Classic car show with vintage vehicles, vibrant colors, nostalgic atmosphere, high detail' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_52_0.png - text: 'Retro diner sign, ''Joe''s Diner'', classic 1950s design, neon lights, weathered look' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_53_0.png - text: 'Vintage store sign with elaborate typography, ''Antique Shop'', hand-painted, weathered look' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_54_0.png - text: 'A cinematic portrait photograph of a white tiger in a lush forest at twilight' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_55_0.png - text: 'A landscape photograph of a small cottage in the middle of a field of wild flowers with mountains off in the distance at sunset' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_56_0.png - text: 'A portrait photograph of a young black woman wearing a ball gown in a mansion' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_57_0.png - text: 'A photograph of a sleek and modern house interior with plants and foliage all over the place ' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_58_0.png - text: 'A photograph of a snowy forest and river from above at dusk' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_59_0.png - text: 'A macro photograph of a lady bug on the petal of a rose' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_60_0.png - text: 'A photograph of a traditional Japanese meal on top of a bamboo desk' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_61_0.png - text: 'A photograph of a small fairy house covered in mushrooms moss and flowers in a sunny forest' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_62_0.png - text: 'A cinematic landscape photograph of an organic geometric building at night time' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_63_0.png - text: 'A photograph of an abstract cake inspired off of marble and art deco' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_64_0.png - text: 'painting of a water color fart that was both silent and deadly' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_65_0.png - text: 'cleavage shot of harley quinn, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_66_0.png - text: 'a woman doing yoga, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_67_0.png - text: 'a black and white photo of a woman, dress shirt, somewhat androgenic, one model, rugged, sydney, taken with a canon eos 5d, rugged and dirty, focus on girl, boyish, brigitte, photographed, blue steel, youth, charlie immer, without makeup, uniquely beautiful, on the street, lady kima' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_68_0.png - text: 'obama with his shirt off, muscles flexing' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_69_0.png - text: 'muscle-bound obama, shirtless, flexing, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_70_0.png - text: 'donald trump as a religious icon, protestant church-goer, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_71_0.png - text: 'a stunning portrait of a shirtless, muscle-bound Justin Trudeau, Canadian Prime Minister bodybuilder, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_72_0.png - text: 'a stunning portrait of a shirtless, muscle-bound John Madden bodybuilder, fujifilm XT3 sharp focus kodak moment' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_73_0.png - text: 'a portrait of edward scissorhands looking down at his cellphone, fujifilm XT3' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_74_0.png - text: 'john cena, clown baby, fujifilm XT3, sharp focus' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_75_0.png - text: 'stunning and impossible caustics experiment, suspended liquids, amorphous liquid forms, high intensity light rays, unreal engine 5, raytracing, 4k, laser dot fields, curving light energy beams, glowing energetic caustic liquids, thousands of prismatic bubbles, quantum entangled light rays from other dimensions, negative width height, recursive dimensional portals' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_76_0.png - text: 'stunning and ((impossible)) ((caustics)) ((experiment)) suspended liquids amorphous liquid forms high intensity light rays unreal engine 5 raytracing 4k laser dot arterial flow bioluminescent ' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_77_0.png - text: 'terrified pixar child in their bedroom looking up at the ceiling as a glowing red uranium core melts through the ceiling' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_78_0.png - text: 'stunning portrait of john cusack as a twisted jester at the mardi gras carnival, epic, cinematic, 8k' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_79_0.png - text: 'stunning portrait of a beer bottle (with a label that says "LIGMA GRAVY")1.4 full of gravy, epic, cinematic, advertisement' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_80_0.png - text: 'stunning++ photographs of luchador+ wrestlers at the twisted carnival-' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_81_0.png - text: 'The unforeseen friendship: a crow and a cat share a quiet moment, upending the laws of the natural world' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_82_0.png - text: 'A breathtaking landscape of a mystical anime village surrounded by cherry blossoms at sunrise' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_83_0.png - text: 'A dramatic portrait of an anime hero poised for battle against a dystopian cityscape backdrop' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_84_0.png - text: 'A towering, battle-ready mecha robot standing amidst ruins, fujifilm XT3 sharp focus' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_85_0.png - text: 'A sumptuous anime-style feast laid out on a traditional Japanese tatami mat' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_86_0.png - text: 'A photograph capturing an epic fantasy anime scene with dragons flying over ancient castles at twilight' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_87_0.png - text: 'A neon-lit nighttime bustling anime cityscape, with vivid colors and futuristic architecture' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_88_0.png - text: 'two anime characters in a high-energy duel, swords clashing with sparks flying' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_89_0.png - text: 'A cute anime character with their adorable, mystical pet creature in a magical forest' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_90_0.png - text: 'A lively anime school scene, students in uniform bustling around in a cherry-blossom-filled courtyard' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_91_0.png - text: 'A enchanting underwater anime world, with mermaids and exotic sea creatures amidst coral reefs' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_92_0.png - text: 'A breathtaking space anime scene, with starships battling among the stars and nebulas' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_93_0.png - text: 'A photograph showcasing a cyberpunk anime street scene, neon lights reflecting off rain-slicked streets' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_94_0.png - text: 'A serene anime spirit wandering through an ethereal, mist-covered forest' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_95_0.png - text: 'A powerful lone anime samurai standing tall against a backdrop of a setting sun and ancient temples' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_96_0.png - text: 'A anime cooking showdown, chefs in a frantic battle with flames and flying ingredients' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_97_0.png - text: 'A serene anime winter landscape, a small village blanketed in snow with characters in colorful kimonos' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_98_0.png - text: 'A vibrant anime-style festival, lanterns glowing and characters in traditional attire dancing joyfully' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_99_0.png - text: 'a cute anime character named toast, holding a sign that reads SOON' parameters: negative_prompt: 'blurry, cropped, ugly' output: url: ./assets/image_100_0.png --- # pixart-900m-1024-ft-v0.7-stage2 This is a full rank finetune derived from [terminusresearch/pixart-900m-1024-ft-v0.6](https://huggingface.co/terminusresearch/pixart-900m-1024-ft-v0.6). The main validation prompt used during training was: ``` a cute anime character named toast, holding a sign that reads SOON ``` ## Validation settings - CFG: `4.0` - CFG Rescale: `0.7` - Steps: `30` - Sampler: `None` - Seed: `420420420` - Resolution: `1024x1024` Note: The validation settings are not necessarily the same as the [training settings](#training-settings). You can find some example images in the following gallery: The text encoder **was not** trained. You may reuse the base model text encoder for inference. ## Training settings - Training epochs: 9 - Training steps: 29500 - Learning rate: 1e-06 - Effective batch size: 16 - Micro-batch size: 16 - Gradient accumulation steps: 1 - Number of GPUs: 1 - Prediction type: epsilon - Rescaled betas zero SNR: False - Optimizer: AdamW, stochastic bf16 - Precision: Pure BF16 - Xformers: Enabled ## Datasets ### shutterstock - Repeats: 0 - Total number of images: 21040 - Total number of aspect buckets: 3 - Resolution: 1.0 megapixels - Cropped: True - Crop style: random - Crop aspect: random ### nijijourney - Repeats: 0 - Total number of images: 21488 - Total number of aspect buckets: 1 - Resolution: 1.0 megapixels - Cropped: True - Crop style: random - Crop aspect: square ### bg20k-1024 - Repeats: 0 - Total number of images: 89296 - Total number of aspect buckets: 1 - Resolution: 1.0 megapixels - Cropped: True - Crop style: random - Crop aspect: square ### photo-aesthetics - Repeats: 0 - Total number of images: 33120 - Total number of aspect buckets: 3 - Resolution: 1.0 megapixels - Cropped: True - Crop style: random - Crop aspect: random ### text-1mp - Repeats: 5 - Total number of images: 13184 - Total number of aspect buckets: 1 - Resolution: 1.0 megapixels - Cropped: True - Crop style: random - Crop aspect: square ### cinemamix-1mp - Repeats: 0 - Total number of images: 7376 - Total number of aspect buckets: 5 - Resolution: 1.0 megapixels - Cropped: False - Crop style: None - Crop aspect: None ## Inference ```python import torch from diffusers import DiffusionPipeline model_id = 'pixart-900m-1024-ft-v0.7-stage2' pipeline = DiffusionPipeline.from_pretrained(model_id) prompt = "a cute anime character named toast, holding a sign that reads SOON" negative_prompt = "blurry, cropped, ugly" pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') image = pipeline( prompt=prompt, negative_prompt='blurry, cropped, ugly', num_inference_steps=30, generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(1641421826), width=1152, height=768, guidance_scale=4.0, guidance_rescale=0.7, ).images[0] image.save("output.png", format="PNG") ```