Update README.md
Browse files
README.md
CHANGED
@@ -75,6 +75,7 @@ prompt: 1girl, solo, ball, swimsuit, bikini, mole, beachball, white bikini, brea
|
|
75 |
![image7](./000092_scribble_concat.webp)
|
76 |
|
77 |
|
|
|
78 |
## How to Get Started with the Model
|
79 |
|
80 |
Use the code below to get started with the model.
|
@@ -194,3 +195,30 @@ images = pipe(
|
|
194 |
|
195 |
images[0].save(f"your image save path, png format is usually better than jpg or webp in terms of image quality but got much bigger")
|
196 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
75 |
![image7](./000092_scribble_concat.webp)
|
76 |
|
77 |
|
78 |
+
|
79 |
## How to Get Started with the Model
|
80 |
|
81 |
Use the code below to get started with the model.
|
|
|
195 |
|
196 |
images[0].save(f"your image save path, png format is usually better than jpg or webp in terms of image quality but got much bigger")
|
197 |
```
|
198 |
+
|
199 |
+
|
200 |
+
## Evaluation Data
|
201 |
+
The test data is randomly sample from popular wallpaper anime images(pixiv, nijijourney and so on), the purpose of the project is to letting everyone can draw an anime Illustration.
|
202 |
+
We select 100 images and generate text with waifu-tagger[https://huggingface.co/spaces/SmilingWolf/wd-tagger] and generate 4 images per prompt, totally 400 images generated, the images
|
203 |
+
should be 1024 * 1024 or same bucket resolution to acheive the best performance. We caculate the Laion Aesthetic Score to measure the beauty and the PerceptualSimilarity to measure the
|
204 |
+
control ability, we find the quality of images have a good consistency with the meric values. We compare our methods with other SOTA huggingface models and list the result below. We are
|
205 |
+
the models that have highest aesthectic score, and can generate visually appealing images if you prompt it properly.
|
206 |
+
|
207 |
+
## Quantitative Result
|
208 |
+
| metric | xinsir/anime-painter | lllyasviel/control_v11p_sd15_scribble |
|
209 |
+
|-------|-------|-------|-------|
|
210 |
+
| laion_aesthetic | **5.95** | 5.86 |
|
211 |
+
| perceptual similarity | **0.5171** | 0.577 |
|
212 |
+
|
213 |
+
laion_aesthetic(the higher the better)
|
214 |
+
perceptual similarity(the lower the better)
|
215 |
+
|
216 |
+
Note: The values are caculated when save in webp format, when save in png the aesthetic values will increase 0.1-0.3, but the relative relation remains unchanged.
|
217 |
+
|
218 |
+
### Conclusion
|
219 |
+
|
220 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
221 |
+
|
222 |
+
In our evaluation, the model got better aesthetic score in anime images compared with lllyasviel/control_v11p_sd15_scribble, we want to compare with other sdxl-1.0-scribble model but find nothing, The model is better in control ability when test with perception similarity due to bigger base model and complex data augmentation.
|
223 |
+
Besides, the model has lower rate to generate abnormal images which tend to include some abnormal human structure.
|
224 |
+
|