StableV2V / README.md
AlonzoLeeeooo's picture
Update README.md
5d838c1 verified
|
raw
history blame
1.75 kB
---
license: mit
---
<div align="center">
# StableV2V: Stablizing Shape Consistency in Video-to-Video Editing
Chang Liu, Rui Li, Kaidong Zhang, Yunwei Lan, Dong Liu
[[`Paper`]](https://arxiv.org/abs/2411.11045) / [[`Project`]](https://alonzoleeeooo.github.io/StableV2V/) / [[`GitHub`]](https://github.com/AlonzoLeeeooo/StableV2V) / [[`DAVIS-Edit`]](https://huggingface.co/datasets/AlonzoLeeeooo/DAVIS-Edit)
</div>
Official pre-trained model weights of the paper titled "StableV2V: Stablizing Shape Consistency in Video-to-Video Editing".
# Model Weights Structure
We construct our model weights following the structure below:
```
StableV2V
β”œβ”€β”€ controlnet-depth <----- ControlNet (depth), required by CIG
β”œβ”€β”€ controlnet-scribble <----- ControlNet (scribble, needed in sketch-based editing application)
β”œβ”€β”€ ctrl-adapter-i2vgenxl-depth <----- Ctrl-Adapter (I2VGen-XL, depth), required by CIG
β”œβ”€β”€ i2vgenxl <----- I2VGen-XL, required by CIG
β”œβ”€β”€ instruct-pix2pix <----- InstructPix2Pix, required by PFE
β”œβ”€β”€ paint-by-example <----- Paint-by-Example, required by PFE
β”œβ”€β”€ stable-diffusion-v1-5-inpaint <----- SD Inpaint, required by PFE
β”œβ”€β”€ stable-diffusion-v1.5 <----- SD v1.5, required by CIG
β”œβ”€β”€ 50000.ckpt <----- Shape-guided depth refinement network
β”œβ”€β”€ README.md
β”œβ”€β”€ dpt_swin2_large_384.pt <----- MiDaS, required by ISA
β”œβ”€β”€ raft-things.pth <----- RAFT, required by ISA
β”œβ”€β”€ u2net.pth <----- U2-net, required by ISA
└── 50000.ckpt <----- Shape-guided depth refinement network, required by ISA
```