webxos
/

microd_v1

webxos commited on 21 days ago

Commit

fbe8695

verified ·

1 Parent(s): 6328f90

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -13,10 +13,10 @@ tags:
 - vae
 - pytorch
 ---
-# Micro-Distilled GRPO+VAE Model
 This model was made with the Micro Distillery app available at:
-webxos.netlify.app/MICROD
     Model Distillation Training: Simulate GRPO optimization with VAE filtering for small LLMs (42M-345M params).
     Policy Experimentation: Test group sizes, KL penalties, cache reuse for RLHF-like training.
@@ -25,7 +25,6 @@ webxos.netlify.app/MICROD
     Export & Deployment: Generate deployable models for inference in various frameworks.
     Offline Usage: PWA supports offline training simulation and exports.
 ## Model Description
 This is a distilled language model trained using Group Relative Policy Optimization (GRPO) with VAE filtering.

 - vae
 - pytorch
 ---
+# Microd v1.0 by MICRO DISTILLERY
 This model was made with the Micro Distillery app available at:
+webxos.netlify.app/MICROD
     Model Distillation Training: Simulate GRPO optimization with VAE filtering for small LLMs (42M-345M params).
     Policy Experimentation: Test group sizes, KL penalties, cache reuse for RLHF-like training.
     Export & Deployment: Generate deployable models for inference in various frameworks.
     Offline Usage: PWA supports offline training simulation and exports.
 ## Model Description
 This is a distilled language model trained using Group Relative Policy Optimization (GRPO) with VAE filtering.