syedameeng commited on
Commit
5f31bde
·
verified ·
1 Parent(s): 1fa744c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -10,5 +10,30 @@ pinned: false
10
  license: apache-2.0
11
  short_description: Smarter Models, Smaller Footprint
12
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
10
  license: apache-2.0
11
  short_description: Smarter Models, Smaller Footprint
12
  ---
13
+ # CurvOpt-LLM — Realtime Optimizer
14
+
15
+ **Curvature-guided mixed-precision optimization for LLMs. No retraining required.**
16
+
17
+ ## What This Does
18
+ - Loads any HuggingFace causal LM
19
+ - Computes Fisher diagonal curvature per layer (real gradients)
20
+ - Assigns FP32 / FP16 / BF16 per layer based on sensitivity
21
+ - Rewrites and saves a deployable optimized model (downloadable ZIP)
22
+ - Reports electricity, CO₂, and water footprint savings
23
+
24
+ ## How to Use
25
+ 1. Select a model from the dropdown (or enter a custom HF model ID)
26
+ 2. Set calibration samples (1–32) and PPL tolerance
27
+ 3. Click **Run Optimization**
28
+ 4. Download the optimized model ZIP when done
29
+
30
+ ## Supported Models
31
+ OPT family · GPT-2 family · Pythia · Phi · BLOOM · Mistral · Llama-2 · Qwen · Falcon · and any `AutoModelForCausalLM` compatible model.
32
+
33
+ ## Research
34
+ Based on Fisher Information / Optimal Brain Damage curvature analysis.
35
+ Novel contribution: per-request curvature-gated mixed precision with user intent feedback.
36
 
37
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
38
+
39
+