hunterbown commited on
Commit
a237691
·
verified ·
1 Parent(s): f0f16e7

docs: publish clean model card (validation + control telemetry)

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -71,10 +71,10 @@ Set your target information ratio \( S^* \), and our PI controller automatically
71
  <summary><b>Training curves (details)</b></summary>
72
 
73
  **DataBPT (bits/token)**
74
- ![DataBPT curve](assets/figures/data_bpt_curve.png)
75
 
76
  **ParamBPT (bits/token)**
77
- ![ParamBPT curve](assets/figures/param_bpt_curve.png)
78
 
79
  </details>
80
 
@@ -129,4 +129,4 @@ Optimal $S^*$ scaling laws are still being discovered. We found 1.0% works for 1
129
  * **SCU training code:** Apache-2.0 License ([GitHub repository](https://github.com/Hmbown/shannon-control-unit))
130
  * **IP status:** U.S. patent pending (provisional filed September 2025)
131
 
132
- > Repro tips: block size 1024, batch 1, grad-accum 4, gradient checkpointing on, `use_cache=False`.
 
71
  <summary><b>Training curves (details)</b></summary>
72
 
73
  **DataBPT (bits/token)**
74
+ ![DataBPT curve](https://raw.githubusercontent.com/Hmbown/shannon-control-unit/main/assets/figures/data_bpt_curve.png)
75
 
76
  **ParamBPT (bits/token)**
77
+ ![ParamBPT curve](https://raw.githubusercontent.com/Hmbown/shannon-control-unit/main/assets/figures/param_bpt_curve.png)
78
 
79
  </details>
80
 
 
129
  * **SCU training code:** Apache-2.0 License ([GitHub repository](https://github.com/Hmbown/shannon-control-unit))
130
  * **IP status:** U.S. patent pending (provisional filed September 2025)
131
 
132
+ > Repro tips: block size 1024, batch 1, grad-accum 4, gradient checkpointing on, `use_cache=False`.