zR
commited on
Commit
•
84c5dee
1
Parent(s):
7cdc618
GPU memory update
Browse files- .gitignore +9 -0
- README.md +14 -12
- README_zh.md +4 -2
.gitignore
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
output/
|
2 |
+
*__pycache__/
|
3 |
+
samples*/
|
4 |
+
runs/
|
5 |
+
checkpoints/
|
6 |
+
master_ip
|
7 |
+
logs/
|
8 |
+
*.DS_Store
|
9 |
+
.idea
|
README.md
CHANGED
@@ -19,7 +19,7 @@ inference: false
|
|
19 |
</div>
|
20 |
<p align="center">
|
21 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
|
22 |
-
<a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
|
23 |
<a href="#">📜 arxiv (coming soon) </a>
|
24 |
</p>
|
25 |
|
@@ -87,18 +87,20 @@ inference: false
|
|
87 |
CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
|
88 |
The table below provides a list of the video generation models we currently offer, along with their basic information.
|
89 |
|
90 |
-
| Model Name | CogVideoX-2B (Current Repos)
|
91 |
-
|
92 |
-
| Supported Prompt Language | English
|
93 |
| GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
|
94 |
-
| GPU Memory Required for Fine-tuning (bs=1) |
|
95 |
-
| Prompt Length | 226 Tokens
|
96 |
-
| Video Length | 6 seconds
|
97 |
-
| Frames Per Second | 8 frames
|
98 |
-
| Resolution | 720 * 480
|
99 |
-
| Positional Embeddings | Sinusoidal
|
100 |
-
| Quantized Inference | Not Supported
|
101 |
-
| Multi-card Inference | Not Supported
|
|
|
|
|
102 |
|
103 |
## Quick Start 🤗
|
104 |
|
|
|
19 |
</div>
|
20 |
<p align="center">
|
21 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
|
22 |
+
<a href="https://github.com/THUDM/CogVideo">🌐 Github(with PDF paper)</a> |
|
23 |
<a href="#">📜 arxiv (coming soon) </a>
|
24 |
</p>
|
25 |
|
|
|
87 |
CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
|
88 |
The table below provides a list of the video generation models we currently offer, along with their basic information.
|
89 |
|
90 |
+
| Model Name | CogVideoX-2B (Current Repos) |
|
91 |
+
|--------------------------------------------|-----------------------------------------------|
|
92 |
+
| Supported Prompt Language | English |
|
93 |
| GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
|
94 |
+
| GPU Memory Required for Fine-tuning (bs=1) | 42GB |
|
95 |
+
| Prompt Length | 226 Tokens |
|
96 |
+
| Video Length | 6 seconds |
|
97 |
+
| Frames Per Second | 8 frames |
|
98 |
+
| Resolution | 720 * 480 |
|
99 |
+
| Positional Embeddings | Sinusoidal |
|
100 |
+
| Quantized Inference | Not Supported |
|
101 |
+
| Multi-card Inference | Not Supported |
|
102 |
+
|
103 |
+
**Note** Using [SAT](https://github.com/THUDM/SwissArmyTransformer) model cost 18GB for inference. Check our github.
|
104 |
|
105 |
## Quick Start 🤗
|
106 |
|
README_zh.md
CHANGED
@@ -6,7 +6,7 @@
|
|
6 |
</div>
|
7 |
<p align="center">
|
8 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
|
9 |
-
<a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
|
10 |
<a href="#">📜 arxiv (即将发布) </a>
|
11 |
</p>
|
12 |
|
@@ -77,7 +77,7 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
|
|
77 |
|---------------|---------------------|
|
78 |
| 提示词语言 | English |
|
79 |
| 推理显存消耗 | 36GB(会在PR合并之前优化) |
|
80 |
-
| 微调显存消耗 (bs=1) |
|
81 |
| 提示词长度上限 | 226 Tokens |
|
82 |
| 视频生成长度 | 6 seconds |
|
83 |
| 视频生成帧率 (每秒) | 8 frames |
|
@@ -86,6 +86,8 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
|
|
86 |
| 量化 | 不支持 |
|
87 |
| 多卡推理 | 不支持 |
|
88 |
|
|
|
|
|
89 |
## 快速上手 🤗
|
90 |
|
91 |
本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。
|
|
|
6 |
</div>
|
7 |
<p align="center">
|
8 |
<a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
|
9 |
+
<a href="https://github.com/THUDM/CogVideo">🌐 Github(包含PDF论文)</a> |
|
10 |
<a href="#">📜 arxiv (即将发布) </a>
|
11 |
</p>
|
12 |
|
|
|
77 |
|---------------|---------------------|
|
78 |
| 提示词语言 | English |
|
79 |
| 推理显存消耗 | 36GB(会在PR合并之前优化) |
|
80 |
+
| 微调显存消耗 (bs=1) | 42GB |
|
81 |
| 提示词长度上限 | 226 Tokens |
|
82 |
| 视频生成长度 | 6 seconds |
|
83 |
| 视频生成帧率 (每秒) | 8 frames |
|
|
|
86 |
| 量化 | 不支持 |
|
87 |
| 多卡推理 | 不支持 |
|
88 |
|
89 |
+
**Note** 使用 [SAT](https://github.com/THUDM/SwissArmyTransformer) 推理SAT版本模型仅需18G显存。欢迎前往我们的github查看。
|
90 |
+
|
91 |
## 快速上手 🤗
|
92 |
|
93 |
本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。
|