eeshaAI commited on
Commit
d5f0ecb
Β·
verified Β·
1 Parent(s): ecb6204

Fix README short_description

Browse files
Files changed (1) hide show
  1. README.md +26 -8
README.md CHANGED
@@ -1,17 +1,35 @@
1
  ---
2
  title: Zeeb
3
- emoji: πŸ‘
4
  colorFrom: purple
5
  colorTo: pink
6
  sdk: gradio
7
- sdk_version: 6.14.0
8
- python_version: '3.13'
9
  app_file: app.py
10
  pinned: false
11
- short_description: Image generation model powered stable diffusion 3.5
12
- hf_oauth: true
13
- hf_oauth_scopes:
14
- - inference-api
15
  ---
16
 
17
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  title: Zeeb
3
+ emoji: 🎬
4
  colorFrom: purple
5
  colorTo: pink
6
  sdk: gradio
7
+ sdk_version: 5.0.0
8
+ python_version: '3.10'
9
  app_file: app.py
10
  pinned: false
11
+ short_description: LoRA fine-tune OLMo 2 1B for video token generation
 
 
 
12
  ---
13
 
14
+ # Zeeb β€” Video-LLM Trainer
15
+
16
+ Fine-tune **OLMo 2 1B Instruct** with **LoRA (r=4)** to generate video tokens using visual tokenization.
17
+
18
+ ## Pipeline
19
+ ```
20
+ Text Prompt β†’ LLM (OLMo 2 1B + LoRA) β†’ Visual Tokens β†’ VQ-VAE Decoder β†’ Video
21
+ ```
22
+
23
+ ## How It Works
24
+ 1. Click **"Start Training"** to begin
25
+ 2. The model downloads OLMo 2 1B Instruct from HuggingFace
26
+ 3. Expands vocabulary with 1,024 visual tokens
27
+ 4. Applies LoRA (r=4) for memory-efficient fine-tuning
28
+ 5. Trains for 3 epochs on tokenized video data
29
+ 6. Merges LoRA weights and pushes to [EeshaAI/zeeb](https://huggingface.co/EeshaAI/zeeb)
30
+
31
+ ## Files
32
+ - `app.py` β€” Gradio training interface
33
+ - `train_on_hf_spaces.py` β€” Training logic (OLMo 2 1B + LoRA)
34
+ - `tokenized_dataset.json` β€” Tokenized video-text training data
35
+ - `requirements.txt` β€” Python dependencies