Spaces:

CiLprototype
/

CurvOpt-SmarterModels

Runtime error

syedameeng commited on Feb 26

Commit

5f31bde

verified ·

1 Parent(s): 1fa744c

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -10,5 +10,30 @@ pinned: false
 license: apache-2.0
 short_description: Smarter Models, Smaller Footprint
 ---
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 license: apache-2.0
 short_description: Smarter Models, Smaller Footprint
 ---
+# CurvOpt-LLM — Realtime Optimizer
+**Curvature-guided mixed-precision optimization for LLMs. No retraining required.**
+## What This Does
+- Loads any HuggingFace causal LM
+- Computes Fisher diagonal curvature per layer (real gradients)
+- Assigns FP32 / FP16 / BF16 per layer based on sensitivity
+- Rewrites and saves a deployable optimized model (downloadable ZIP)
+- Reports electricity, CO₂, and water footprint savings
+## How to Use
+1. Select a model from the dropdown (or enter a custom HF model ID)
+2. Set calibration samples (1–32) and PPL tolerance
+3. Click **Run Optimization**
+4. Download the optimized model ZIP when done
+## Supported Models
+OPT family · GPT-2 family · Pythia · Phi · BLOOM · Mistral · Llama-2 · Qwen · Falcon · and any `AutoModelForCausalLM` compatible model.
+## Research
+Based on Fisher Information / Optimal Brain Damage curvature analysis.
+Novel contribution: per-request curvature-gated mixed precision with user intent feedback.
 Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference