ykae
/

monarch-bert-base-mnli

Text Classification

monarch-matrices

hardware-efficient

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

ykae commited on Jan 5

Commit

01ab50d

·

verified ·

1 Parent(s): f2aedc8

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -56,7 +56,6 @@ Standard compression and distillation often requires massive retraining. We prov
 * **Training Time:** A few hours on **1x NVIDIA H100**.
 * **Data:** Only **MNLI** + **500k Wikipedia Samples**.
-* **Math over Brute Force:** By replacing all FFNs with **Monarch Matrices** $O(N \log N)$, we reduced the mathematical complexity (GFLOPs) by **66%**.
 * **Trade-off:** This extreme compression comes with a moderate accuracy drop (~5%). *Need higher accuracy? Check out our [Hybrid Version](https://huggingface.co/ykae/monarch-bert-base-mnli-hybrid) (<1% loss).*
 ## 🚀 Key Benchmarks

 * **Training Time:** A few hours on **1x NVIDIA H100**.
 * **Data:** Only **MNLI** + **500k Wikipedia Samples**.
 * **Trade-off:** This extreme compression comes with a moderate accuracy drop (~5%). *Need higher accuracy? Check out our [Hybrid Version](https://huggingface.co/ykae/monarch-bert-base-mnli-hybrid) (<1% loss).*
 ## 🚀 Key Benchmarks