Spaces:
Running on Zero
Running on Zero
Commit ·
7d9885d
1
Parent(s): f4a615b
UPDATE
Browse files
app.py
CHANGED
|
@@ -85,17 +85,17 @@ MARKDOWN = \
|
|
| 85 |
<a style='font-size:18px;color: #000000' href='https://cfeng16.github.io/this-and-that/'> [Project Page] </a> </div> \
|
| 86 |
</div>
|
| 87 |
|
| 88 |
-
This&That is a
|
| 89 |
|
| 90 |
-
This
|
| 91 |
-
Only VGL mode (image+language+gesture conditioned) is provided
|
| 92 |
|
| 93 |
-
### Note: The
|
| 94 |
-
### Note:
|
| 95 |
-
### Note: Click "Clear All" to
|
| 96 |
-
### Note: The first run may
|
| 97 |
|
| 98 |
-
If **This&That** is helpful, please
|
| 99 |
|
| 100 |
"""
|
| 101 |
|
|
|
|
| 85 |
<a style='font-size:18px;color: #000000' href='https://cfeng16.github.io/this-and-that/'> [Project Page] </a> </div> \
|
| 86 |
</div>
|
| 87 |
|
| 88 |
+
This&That is a robotics scenario (based on the Bridge dataset for this demo), a Language-Gesture-Image-conditioned Video Generation Model for Robot Planning.
|
| 89 |
|
| 90 |
+
This demo focuses on the Video Diffusion Model.
|
| 91 |
+
Only the VGL mode (image + language + gesture conditioned) is provided, but you can find the complete test code and all pretrained weights available.
|
| 92 |
|
| 93 |
+
### Note: The default gesture point indices are [4, 10] (5th and 11th) for two gesture points, or [4] (5th) for one gesture point.
|
| 94 |
+
### Note: Currently, the supported resolution is 256x384.
|
| 95 |
+
### Note: Click "Clear All" to reset everything, or "Undo Point" to remove the last gesture point.
|
| 96 |
+
### Note: The first run may take longer. Clicking "Clear All" before each run is the safest option.
|
| 97 |
|
| 98 |
+
If **This&That** is helpful, please star the [GitHub Repo](https://github.com/Kiteretsu77/This_and_That_VDM). Thank you!
|
| 99 |
|
| 100 |
"""
|
| 101 |
|