Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -10,5 +10,30 @@ pinned: false
|
|
| 10 |
license: apache-2.0
|
| 11 |
short_description: Smarter Models, Smaller Footprint
|
| 12 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
|
| 14 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
|
|
|
|
|
|
| 10 |
license: apache-2.0
|
| 11 |
short_description: Smarter Models, Smaller Footprint
|
| 12 |
---
|
| 13 |
+
# CurvOpt-LLM — Realtime Optimizer
|
| 14 |
+
|
| 15 |
+
**Curvature-guided mixed-precision optimization for LLMs. No retraining required.**
|
| 16 |
+
|
| 17 |
+
## What This Does
|
| 18 |
+
- Loads any HuggingFace causal LM
|
| 19 |
+
- Computes Fisher diagonal curvature per layer (real gradients)
|
| 20 |
+
- Assigns FP32 / FP16 / BF16 per layer based on sensitivity
|
| 21 |
+
- Rewrites and saves a deployable optimized model (downloadable ZIP)
|
| 22 |
+
- Reports electricity, CO₂, and water footprint savings
|
| 23 |
+
|
| 24 |
+
## How to Use
|
| 25 |
+
1. Select a model from the dropdown (or enter a custom HF model ID)
|
| 26 |
+
2. Set calibration samples (1–32) and PPL tolerance
|
| 27 |
+
3. Click **Run Optimization**
|
| 28 |
+
4. Download the optimized model ZIP when done
|
| 29 |
+
|
| 30 |
+
## Supported Models
|
| 31 |
+
OPT family · GPT-2 family · Pythia · Phi · BLOOM · Mistral · Llama-2 · Qwen · Falcon · and any `AutoModelForCausalLM` compatible model.
|
| 32 |
+
|
| 33 |
+
## Research
|
| 34 |
+
Based on Fisher Information / Optimal Brain Damage curvature analysis.
|
| 35 |
+
Novel contribution: per-request curvature-gated mixed precision with user intent feedback.
|
| 36 |
|
| 37 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
| 38 |
+
|
| 39 |
+
|