SparseLLM
/

ReluLLaMA-7B

Text Generation

text-generation-inference

Model card Files Files and versions

Yixin Song commited on Dec 15, 2023

Commit

71acee4

·

1 Parent(s): f47cf05

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -59,7 +59,11 @@ We evaluate the model on the datasets of [Open LLM Leaderboard](https://huggingf
 ### Inference Tool
-Coming soon.
 ### License Disclaimer:

 ### Inference Tool
+We utilize [PowerInfer](https://github.com/SJTU-IPADS/PowerInfer) for pure CPU inference, here we list the inference speed of pure CPU inference with fp16 precision.
+Dense Inference: 0.85 tokens/s
+Sparse Inference: 2.26 tokens/s
 ### License Disclaimer: