Spaces:

eeshaAI
/

Zeeb

Sleeping

File size: 638 Bytes

91d65b4
 
d5f0ecb
91d65b4
 
 
912aed1
 
91d65b4
 
b0ebb4e
91d65b4
 
b0ebb4e
d5f0ecb
b0ebb4e
d5f0ecb
 
 
b0ebb4e
d5f0ecb
 
b0ebb4e

---
title: Zeeb
emoji: 🎬
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.31.0
python_version: '3.11'
app_file: app.py
pinned: false
short_description: "Video-LLM - OLMo 2 + LoRA + VQ-VAE text-to-video"
---

# Zeeb — Video-LLM

Text-to-Video generation using **OLMo 2 1B Instruct** + **LoRA** + **VQ-VAE**.

## Pipeline
```
Text Prompt → LLM (constrained decoding) → Visual Tokens → VQ-VAE Decoder → Video
```

## Training Pipeline
1. Train VQ-VAE on 50K COCO images (real photos)
2. Tokenize 10K OpenVid-1M clips through VQ-VAE
3. Fine-tune OLMo 2 1B + LoRA on tokenized data
4. Push trained model to EeshaAI/zeeb