Refine model card content and add pipeline tag (#1)
Browse files- Refine model card content and add pipeline tag (3c6131e61a638f75af086b6710e350e49ab43896)
Co-authored-by: Niels Rogge <[email protected]>
README.md
CHANGED
|
@@ -7,6 +7,7 @@ tags:
|
|
| 7 |
- Video Generation
|
| 8 |
- Vision Translation
|
| 9 |
- Bridge Model
|
|
|
|
| 10 |
---
|
| 11 |
|
| 12 |
# 🎥 ViBT: Vision Bridge Transformer at Scale
|
|
@@ -17,4 +18,4 @@ tags:
|
|
| 17 |
<a href="https://github.com/Yuanshi9815/ViBT"><img src="https://img.shields.io/badge/GitHub-Code-blue.svg?logo=github&" alt="GitHub"></a>
|
| 18 |
</div>
|
| 19 |
|
| 20 |
-
|
|
|
|
| 7 |
- Video Generation
|
| 8 |
- Vision Translation
|
| 9 |
- Bridge Model
|
| 10 |
+
pipeline_tag: any-to-any
|
| 11 |
---
|
| 12 |
|
| 13 |
# 🎥 ViBT: Vision Bridge Transformer at Scale
|
|
|
|
| 18 |
<a href="https://github.com/Yuanshi9815/ViBT"><img src="https://img.shields.io/badge/GitHub-Code-blue.svg?logo=github&" alt="GitHub"></a>
|
| 19 |
</div>
|
| 20 |
|
| 21 |
+
This repository introduces **Vision Bridge Transformer (ViBT)**, a large-scale instantiation of Brownian Bridge Models designed for efficient conditional generation. ViBT directly models the trajectory between inputs and outputs, creating an efficient data-to-data translation paradigm. The models demonstrate effectiveness for various image and video translation tasks, including instruction-based image editing and complex video translation.
|