Skip to content

Commit 75e6a50

Browse files
committed
feedback
1 parent a5bb935 commit 75e6a50

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

docs/source/en/api/pipelines/cogvideox.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -92,6 +92,8 @@ export_to_video(video, "output.mp4", fps=8)
9292

9393
Compilation is slow the first time but subsequent calls to the pipeline are faster.
9494

95+
The average inference time with torch.compile on a 80GB A100 is 76.27 seconds compared to 96.89 seconds for an uncompiled model.
96+
9597
```py
9698
import torch
9799
from diffusers import CogVideoXPipeline
@@ -170,6 +172,14 @@ export_to_video(video, "output.mp4", fps=8)
170172
- The image-to-video (I2V) checkpoints work with multiple resolutions. The width can vary from 768 to 1360, but the height must be 758. Both height and width must be divisible by 16.
171173

172174
- Both T2V and I2V checkpoints work best with 81 and 161 frames. It is recommended to export the generated video at 16fps.
175+
176+
- Refer to the table below to view memory usage when various memory-saving techniques are enabled.
177+
178+
| method | memory usage (enabled) | memory usage (disabled) |
179+
|---|---|---|
180+
| enable_model_cpu_offload | 19GB | 33GB |
181+
| enable_sequential_cpu_offload | <4GB | ~33GB (very slow inference speed) |
182+
| enable_tiling | 11GB (with enable_model_cpu_offload) | --- |
173183

174184
## CogVideoXPipeline
175185

0 commit comments

Comments
 (0)