File size: 638 Bytes
91d65b4
 
d5f0ecb
91d65b4
 
 
912aed1
 
91d65b4
 
b0ebb4e
91d65b4
 
b0ebb4e
d5f0ecb
b0ebb4e
d5f0ecb
 
 
b0ebb4e
d5f0ecb
 
b0ebb4e
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
title: Zeeb
emoji: 🎬
colorFrom: purple
colorTo: pink
sdk: gradio
sdk_version: 5.31.0
python_version: '3.11'
app_file: app.py
pinned: false
short_description: "Video-LLM - OLMo 2 + LoRA + VQ-VAE text-to-video"
---

# Zeeb — Video-LLM

Text-to-Video generation using **OLMo 2 1B Instruct** + **LoRA** + **VQ-VAE**.

## Pipeline
```
Text Prompt → LLM (constrained decoding) → Visual Tokens → VQ-VAE Decoder → Video
```

## Training Pipeline
1. Train VQ-VAE on 50K COCO images (real photos)
2. Tokenize 10K OpenVid-1M clips through VQ-VAE
3. Fine-tune OLMo 2 1B + LoRA on tokenized data
4. Push trained model to EeshaAI/zeeb