SparseLLM
/

BlockFFN-3B-SFT-EAGLE

Text Generation

Model card Files Files and versions

Raincleared commited on Jul 14, 2025

Commit

988214f

·

verified ·

1 Parent(s): 6ea2993

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,4 +11,4 @@ pipeline_tag: text-generation
 This is the 3B BlockFFN model used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
 It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
-Links: [[Paper](TODO)] [[Codes](https://github.com/thunlp/BlockFFN)]

 This is the 3B BlockFFN model used in the paper *BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity* for acceleration tests.
 It is directly adaptable to the `inference` implementation of our [codes](https://github.com/thunlp/BlockFFN).
+Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]