Any-to-Any
Diffusion Single File
Art
Image Generation
Image Editing
Video Generation
Vision Translation
Bridge Model
Instructions to use Yuanshi/ViBT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusion Single File
How to use Yuanshi/ViBT with Diffusion Single File:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
Refine model card content and add pipeline tag (#1)
Browse files- Refine model card content and add pipeline tag (3c6131e61a638f75af086b6710e350e49ab43896)
Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -7,6 +7,7 @@ tags:
|
|
| 7 |
- Video Generation
|
| 8 |
- Vision Translation
|
| 9 |
- Bridge Model
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
# 🎥 ViBT: Vision Bridge Transformer at Scale
|
|
@@ -17,4 +18,4 @@ tags:
|
|
| 17 |
<a href="https://github.com/Yuanshi9815/ViBT"><img src="https://img.shields.io/badge/GitHub-Code-blue.svg?logo=github&" alt="GitHub"></a>
|
| 18 |
</div>
|
| 19 |
|
| 20 |
-
|
|
|
|
| 7 |
- Video Generation
|
| 8 |
- Vision Translation
|
| 9 |
- Bridge Model
|
| 10 |
+
pipeline_tag: any-to-any
|
| 11 |
---
|
| 12 |
|
| 13 |
# 🎥 ViBT: Vision Bridge Transformer at Scale
|
|
|
|
| 18 |
<a href="https://github.com/Yuanshi9815/ViBT"><img src="https://img.shields.io/badge/GitHub-Code-blue.svg?logo=github&" alt="GitHub"></a>
|
| 19 |
</div>
|
| 20 |
|
| 21 |
+
This repository introduces **Vision Bridge Transformer (ViBT)**, a large-scale instantiation of Brownian Bridge Models designed for efficient conditional generation. ViBT directly models the trajectory between inputs and outputs, creating an efficient data-to-data translation paradigm. The models demonstrate effectiveness for various image and video translation tasks, including instruction-based image editing and complex video translation.
|