Prompt Expansion for Adaptive Text-to-Image Generation
Abstract
Text-to-image generation models are powerful but difficult to use. Users craft specific prompts to get better images, though the images can be repetitive. This paper proposes a Prompt Expansion framework that helps users generate high-quality, diverse images with less effort. The Prompt Expansion model takes a text query as input and outputs a set of expanded text prompts that are optimized such that when passed to a text-to-image model, generates a wider variety of appealing images. We conduct a human evaluation study that shows that images generated through Prompt Expansion are more aesthetically pleasing and diverse than those generated by baseline methods. Overall, this paper presents a novel and effective approach to improving the text-to-image generation experience.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation (2023)
- Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models (2023)
- Customization Assistant for Text-to-image Generation (2023)
- InstructBooth: Instruction-following Personalized Text-to-Image Generation (2023)
- BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper