File size: 638 Bytes
91d65b4 d5f0ecb 91d65b4 912aed1 91d65b4 b0ebb4e 91d65b4 b0ebb4e d5f0ecb b0ebb4e d5f0ecb b0ebb4e d5f0ecb b0ebb4e | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 | ---
title: Zeeb
emoji: 🎬
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.31.0
python_version: '3.11'
app_file: app.py
pinned: false
short_description: "Video-LLM - OLMo 2 + LoRA + VQ-VAE text-to-video"
---
# Zeeb — Video-LLM
Text-to-Video generation using **OLMo 2 1B Instruct** + **LoRA** + **VQ-VAE**.
## Pipeline
```
Text Prompt → LLM (constrained decoding) → Visual Tokens → VQ-VAE Decoder → Video
```
## Training Pipeline
1. Train VQ-VAE on 50K COCO images (real photos)
2. Tokenize 10K OpenVid-1M clips through VQ-VAE
3. Fine-tune OLMo 2 1B + LoRA on tokenized data
4. Push trained model to EeshaAI/zeeb
|