BlockFFN-3B-SFT / README.md
Raincleared's picture
Update README.md
3a4eab6 verified
|
raw
history blame
481 Bytes
metadata
license: apache-2.0
language:
  - en
  - zh
pipeline_tag: text-generation

BlockFFN-3B-SFT

This is the original 3B BlockFFN checkpoint used in the paper BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity for acceleration tests. You can load and use this model simply by using AutoTokenizer and AutoModelForCausalLM.

Links: [Paper] [Codes]