Fix README short_description
Browse files
README.md
CHANGED
|
@@ -1,17 +1,35 @@
|
|
| 1 |
---
|
| 2 |
title: Zeeb
|
| 3 |
-
emoji:
|
| 4 |
colorFrom: purple
|
| 5 |
colorTo: pink
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
-
python_version: '3.
|
| 9 |
app_file: app.py
|
| 10 |
pinned: false
|
| 11 |
-
short_description:
|
| 12 |
-
hf_oauth: true
|
| 13 |
-
hf_oauth_scopes:
|
| 14 |
-
- inference-api
|
| 15 |
---
|
| 16 |
|
| 17 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
title: Zeeb
|
| 3 |
+
emoji: π¬
|
| 4 |
colorFrom: purple
|
| 5 |
colorTo: pink
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 5.0.0
|
| 8 |
+
python_version: '3.10'
|
| 9 |
app_file: app.py
|
| 10 |
pinned: false
|
| 11 |
+
short_description: LoRA fine-tune OLMo 2 1B for video token generation
|
|
|
|
|
|
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# Zeeb β Video-LLM Trainer
|
| 15 |
+
|
| 16 |
+
Fine-tune **OLMo 2 1B Instruct** with **LoRA (r=4)** to generate video tokens using visual tokenization.
|
| 17 |
+
|
| 18 |
+
## Pipeline
|
| 19 |
+
```
|
| 20 |
+
Text Prompt β LLM (OLMo 2 1B + LoRA) β Visual Tokens β VQ-VAE Decoder β Video
|
| 21 |
+
```
|
| 22 |
+
|
| 23 |
+
## How It Works
|
| 24 |
+
1. Click **"Start Training"** to begin
|
| 25 |
+
2. The model downloads OLMo 2 1B Instruct from HuggingFace
|
| 26 |
+
3. Expands vocabulary with 1,024 visual tokens
|
| 27 |
+
4. Applies LoRA (r=4) for memory-efficient fine-tuning
|
| 28 |
+
5. Trains for 3 epochs on tokenized video data
|
| 29 |
+
6. Merges LoRA weights and pushes to [EeshaAI/zeeb](https://huggingface.co/EeshaAI/zeeb)
|
| 30 |
+
|
| 31 |
+
## Files
|
| 32 |
+
- `app.py` β Gradio training interface
|
| 33 |
+
- `train_on_hf_spaces.py` β Training logic (OLMo 2 1B + LoRA)
|
| 34 |
+
- `tokenized_dataset.json` β Tokenized video-text training data
|
| 35 |
+
- `requirements.txt` β Python dependencies
|