webxos
/

microd_v1

webxos commited on 25 days ago

Commit

25c5968

verified ·

1 Parent(s): 168dfa1

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -33,12 +33,12 @@ This model was made with the Micro Distillery app available at:
 webxos.netlify.app/MICROD
-    -Model Distillation Training: Simulate GRPO optimization with VAE filtering for small LLMs (42M-345M params).
-    -Policy Experimentation: Test group sizes, KL penalties, cache reuse for RLHF-like training.
-    -VAE Filtering: Apply latent space compression to improve distillation quality.
-    -Sandbox Testing: Execute safe Python code with feedback masking.
-    -Export & Deployment: Generate deployable models for inference in various frameworks.
-    -Offline Usage: PWA supports offline training simulation and exports.
   <div id="app">
     <!-- TOP BAR -->
@@ -94,7 +94,7 @@ If you use this data in research, please cite:
 }
-### Using Transformers
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer

 webxos.netlify.app/MICROD
+-Model Distillation Training: Simulate GRPO optimization with VAE filtering for small LLMs (42M-345M params).
+-Policy Experimentation: Test group sizes, KL penalties, cache reuse for RLHF-like training.
+-VAE Filtering: Apply latent space compression to improve distillation quality.
+-Sandbox Testing: Execute safe Python code with feedback masking.
+-Export & Deployment: Generate deployable models for inference in various frameworks.
+-Offline Usage: PWA supports offline training simulation and exports.
   <div id="app">
     <!-- TOP BAR -->
 }
+### EXAMPLE: Using Transformers
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer