zR commited on
Commit
84c5dee
1 Parent(s): 7cdc618

GPU memory update

Browse files
Files changed (3) hide show
  1. .gitignore +9 -0
  2. README.md +14 -12
  3. README_zh.md +4 -2
.gitignore ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ output/
2
+ *__pycache__/
3
+ samples*/
4
+ runs/
5
+ checkpoints/
6
+ master_ip
7
+ logs/
8
+ *.DS_Store
9
+ .idea
README.md CHANGED
@@ -19,7 +19,7 @@ inference: false
19
  </div>
20
  <p align="center">
21
  <a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
22
- <a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
23
  <a href="#">📜 arxiv (coming soon) </a>
24
  </p>
25
 
@@ -87,18 +87,20 @@ inference: false
87
  CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
88
  The table below provides a list of the video generation models we currently offer, along with their basic information.
89
 
90
- | Model Name | CogVideoX-2B (Current Repos) |
91
- |--------------------------------------------|--------------------------------------------------|
92
- | Supported Prompt Language | English |
93
  | GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
94
- | GPU Memory Required for Fine-tuning (bs=1) | 46.2GB |
95
- | Prompt Length | 226 Tokens |
96
- | Video Length | 6 seconds |
97
- | Frames Per Second | 8 frames |
98
- | Resolution | 720 * 480 |
99
- | Positional Embeddings | Sinusoidal |
100
- | Quantized Inference | Not Supported |
101
- | Multi-card Inference | Not Supported |
 
 
102
 
103
  ## Quick Start 🤗
104
 
 
19
  </div>
20
  <p align="center">
21
  <a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README_zh.md">📄 中文阅读</a> |
22
+ <a href="https://github.com/THUDM/CogVideo">🌐 Github(with PDF paper)</a> |
23
  <a href="#">📜 arxiv (coming soon) </a>
24
  </p>
25
 
 
87
  CogVideoX is an open-source video generation model that shares the same origins as [清影](https://chatglm.cn/video).
88
  The table below provides a list of the video generation models we currently offer, along with their basic information.
89
 
90
+ | Model Name | CogVideoX-2B (Current Repos) |
91
+ |--------------------------------------------|-----------------------------------------------|
92
+ | Supported Prompt Language | English |
93
  | GPU Memory Required for Inference | 36GB (will be optimized before the PR is merged) |
94
+ | GPU Memory Required for Fine-tuning (bs=1) | 42GB |
95
+ | Prompt Length | 226 Tokens |
96
+ | Video Length | 6 seconds |
97
+ | Frames Per Second | 8 frames |
98
+ | Resolution | 720 * 480 |
99
+ | Positional Embeddings | Sinusoidal |
100
+ | Quantized Inference | Not Supported |
101
+ | Multi-card Inference | Not Supported |
102
+
103
+ **Note** Using [SAT](https://github.com/THUDM/SwissArmyTransformer) model cost 18GB for inference. Check our github.
104
 
105
  ## Quick Start 🤗
106
 
README_zh.md CHANGED
@@ -6,7 +6,7 @@
6
  </div>
7
  <p align="center">
8
  <a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
9
- <a href="https://github.com/THUDM/CogVideo">🌐 Github</a> |
10
  <a href="#">📜 arxiv (即将发布) </a>
11
  </p>
12
 
@@ -77,7 +77,7 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
77
  |---------------|---------------------|
78
  | 提示词语言 | English |
79
  | 推理显存消耗 | 36GB(会在PR合并之前优化) |
80
- | 微调显存消耗 (bs=1) | 46.2GB |
81
  | 提示词长度上限 | 226 Tokens |
82
  | 视频生成长度 | 6 seconds |
83
  | 视频生成帧率 (每秒) | 8 frames |
@@ -86,6 +86,8 @@ CogVideoX是 [清影](https://chatglm.cn/video) 同源的开源版本视频生
86
  | 量化 | 不支持 |
87
  | 多卡推理 | 不支持 |
88
 
 
 
89
  ## 快速上手 🤗
90
 
91
  本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。
 
6
  </div>
7
  <p align="center">
8
  <a href="https://huggingface.co/THUDM/CogVideoX-2b/blob/main/README.md">📄 Read in English</a> |
9
+ <a href="https://github.com/THUDM/CogVideo">🌐 Github(包含PDF论文)</a> |
10
  <a href="#">📜 arxiv (即将发布) </a>
11
  </p>
12
 
 
77
  |---------------|---------------------|
78
  | 提示词语言 | English |
79
  | 推理显存消耗 | 36GB(会在PR合并之前优化) |
80
+ | 微调显存消耗 (bs=1) | 42GB |
81
  | 提示词长度上限 | 226 Tokens |
82
  | 视频生成长度 | 6 seconds |
83
  | 视频生成帧率 (每秒) | 8 frames |
 
86
  | 量化 | 不支持 |
87
  | 多卡推理 | 不支持 |
88
 
89
+ **Note** 使用 [SAT](https://github.com/THUDM/SwissArmyTransformer) 推理SAT版本模型仅需18G显存。欢迎前往我们的github查看。
90
+
91
  ## 快速上手 🤗
92
 
93
  本模型已经支持使用 huggingface 的 diffusers 库进行部署,你可以按照以下步骤进行部署。