File size: 2,800 Bytes
3a1ed93
 
 
 
 
 
 
 
 
 
 
 
 
 
bb1ec28
3a1ed93
caf6cbf
3a1ed93
 
 
bb1ec28
 
 
caf6cbf
 
 
 
 
bb1ec28
caf6cbf
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3a1ed93
caf6cbf
3a1ed93
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
---
license: cc-by-nc-sa-4.0
tags:
- common-canvas
- stable-diffusion
- sdxl
datasets:
- common-canvas/commoncatalog-cc-by-sa
- common-canvas/commoncatalog-cc-by
- common-canvas/commoncatalog-cc-by-nc-sa
- common-canvas/commoncatalog-cc-by-nc
language:
- en
---
# CommonCanvas-XL-NC 0.1

## Specifications
**Input:** CommonCatalog Text Captions  
**Output:** CommonCatalog Images  
**Architecture:** Stable Diffusion XL  
**Credit:** CommonCanvas, StabilityAI, mosaicML, @multimodalart, @Wauplin, @lhoestq  
**NSFW:** Yes  
**Text:** https://arxiv.org/abs/2310.16825  
**LICENSE:**
<p xmlns:cc="http://creativecommons.org/ns#" >This work is licensed under <a href="https://creativecommons.org/licenses/by-nc-sa/4.0/?ref=chooser-v1" target="_blank" rel="license noopener noreferrer" style="display:inline-block;">CC BY-NC-SA 4.0
  <img style="height:22px!important;margin-left:3px;" src="https://mirrors.creativecommons.org/presskit/icons/cc.svg?ref=chooser-v1" label="creative commons license logo" alt="creative commons license logo">
  <img style="height:22px!important;margin-left:3px;" src="https://mirrors.creativecommons.org/presskit/icons/by.svg?ref=chooser-v1" alt="terms of license logo">
  <img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/nc.svg?ref=chooser-v1" alt="non-commercial use logo">
  <img style="height:22px!important;margin-left:3px;vertical-align:text-bottom;" src="https://mirrors.creativecommons.org/presskit/icons/sa.svg?ref=chooser-v1" alt="share alike logo"></a></p>  

## Details
* training data :  Flickr100M dataset
* bias : internet connected Western countries
* limitations : text generation, complex composition, faces, non-English languages, VAE
* use : research, deployment, examination, art, education, creative use
* prohibited : commercial use
* suggested training : mosaicML  https://github.com/mosaicml/diffusion.

* 
* ## Citation
```
@article{gokaslan2023commoncanvas,
  title={CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images},
  author={Gokaslan, Aaron and Cooper, A Feder and Collins, Jasmine and Seguin, Landan and Jacobson, Austin and Patel, Mihir and Frankle, Jonathan and Stephenson, Cory and Kuleshov, Volodymyr},
  journal={arXiv preprint arXiv:2310.16825},
  year={2023}
}
```

### Code

```py
from diffusers import StableDiffusionXLPipeline
pipe = StableDiffusionXLPipeline.from_pretrained(
    "common-canvas/CommonCanvas-XL-NC", 
    custom_pipeline="multimodalart/sdxl_perturbed_attention_guidance", #read more at https://huggingface.co/multimodalart/sdxl_perturbed_attention_guidance
    torch_dtype=torch.float16
).to(device)

prompt = "a cat sitting in a car seat"
image = pipe(prompt, num_inference_steps=25).images[0]    
```