Spaces:
Running
on
Zero
Running
on
Zero
Commit
·
7d9885d
1
Parent(s):
f4a615b
UPDATE
Browse files
app.py
CHANGED
@@ -85,17 +85,17 @@ MARKDOWN = \
|
|
85 |
<a style='font-size:18px;color: #000000' href='https://cfeng16.github.io/this-and-that/'> [Project Page] </a> </div> \
|
86 |
</div>
|
87 |
|
88 |
-
This&That is a
|
89 |
|
90 |
-
This
|
91 |
-
Only VGL mode (image+language+gesture conditioned) is provided
|
92 |
|
93 |
-
### Note: The
|
94 |
-
### Note:
|
95 |
-
### Note: Click "Clear All" to
|
96 |
-
### Note: The first run may
|
97 |
|
98 |
-
If **This&That** is helpful, please
|
99 |
|
100 |
"""
|
101 |
|
|
|
85 |
<a style='font-size:18px;color: #000000' href='https://cfeng16.github.io/this-and-that/'> [Project Page] </a> </div> \
|
86 |
</div>
|
87 |
|
88 |
+
This&That is a robotics scenario (based on the Bridge dataset for this demo), a Language-Gesture-Image-conditioned Video Generation Model for Robot Planning.
|
89 |
|
90 |
+
This demo focuses on the Video Diffusion Model.
|
91 |
+
Only the VGL mode (image + language + gesture conditioned) is provided, but you can find the complete test code and all pretrained weights available.
|
92 |
|
93 |
+
### Note: The default gesture point indices are [4, 10] (5th and 11th) for two gesture points, or [4] (5th) for one gesture point.
|
94 |
+
### Note: Currently, the supported resolution is 256x384.
|
95 |
+
### Note: Click "Clear All" to reset everything, or "Undo Point" to remove the last gesture point.
|
96 |
+
### Note: The first run may take longer. Clicking "Clear All" before each run is the safest option.
|
97 |
|
98 |
+
If **This&That** is helpful, please star the [GitHub Repo](https://github.com/Kiteretsu77/This_and_That_VDM). Thank you!
|
99 |
|
100 |
"""
|
101 |
|