Any-to-Any
Transformers
Safetensors
English
xoron
multimodal
Mixture of Experts
text-to-image
image editing
image to video
text-to-video
video editing
text-to-speech
speech-to-text
speech-to-speech
image-to-text
video-to-text
agentic
tool-use
flow-matching
3d-rope
titok
vidtok
dual-stream-attention
zero-shot-voice-cloning
bigvgan
snake-activation
multi-receptive-field-fusion
custom_code
Update README.md
Browse files
README.md
CHANGED
|
@@ -109,7 +109,7 @@ datasets:
|
|
| 109 |
</div>
|
| 110 |
|
| 111 |
<p align="center">
|
| 112 |
-
<img src="assets/
|
| 113 |
</p>
|
| 114 |
|
| 115 |
|
|
|
|
| 109 |
</div>
|
| 110 |
|
| 111 |
<p align="center">
|
| 112 |
+
<img src="assets/IMG_2970.png" alt="Training-Stage" width="200">
|
| 113 |
</p>
|
| 114 |
|
| 115 |
|