Update README.md
Browse files
README.md
CHANGED
|
@@ -11,4 +11,4 @@ pipeline_tag: text-generation
|
|
| 11 |
This is the 3B BlockFFN model used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
|
| 12 |
It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
|
| 13 |
|
| 14 |
-
Links: [[Paper](
|
|
|
|
| 11 |
This is the 3B BlockFFN model used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
|
| 12 |
It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
|
| 13 |
|
| 14 |
+
Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
|