docs: publish clean model card (validation + control telemetry)
Browse files
README.md
CHANGED
|
@@ -71,10 +71,10 @@ Set your target information ratio \( S^* \), and our PI controller automatically
|
|
| 71 |
<summary><b>Training curves (details)</b></summary>
|
| 72 |
|
| 73 |
**DataBPT (bits/token)**
|
| 74 |
-

|
| 75 |
|
| 76 |
**ParamBPT (bits/token)**
|
| 77 |
-

|
| 78 |
|
| 79 |
</details>
|
| 80 |
|
|
@@ -129,4 +129,4 @@ Optimal $S^*$ scaling laws are still being discovered. We found 1.0% works for 1
|
|
| 129 |
* **SCU training code:** Apache-2.0 License ([GitHub repository](https://github.com/Hmbown/shannon-control-unit))
|
| 130 |
* **IP status:** U.S. patent pending (provisional filed September 2025)
|
| 131 |
|
| 132 |
-
> Repro tips: block size 1024, batch 1, grad-accum 4, gradient checkpointing on, `use_cache=False`.
|
|
|
|
| 71 |
<summary><b>Training curves (details)</b></summary>
|
| 72 |
|
| 73 |
**DataBPT (bits/token)**
|
| 74 |
+

|
| 75 |
|
| 76 |
**ParamBPT (bits/token)**
|
| 77 |
+

|
| 78 |
|
| 79 |
</details>
|
| 80 |
|
|
|
|
| 129 |
* **SCU training code:** Apache-2.0 License ([GitHub repository](https://github.com/Hmbown/shannon-control-unit))
|
| 130 |
* **IP status:** U.S. patent pending (provisional filed September 2025)
|
| 131 |
|
| 132 |
+
> Repro tips: block size 1024, batch 1, grad-accum 4, gradient checkpointing on, `use_cache=False`.
|