SparseLLM
/

relu-100B

Text Generation

text-generation-inference

Model card Files Files and versions

Yixin Song commited on Feb 7, 2024

Commit

a3aef6d

·

verified ·

1 Parent(s): 9e22b0c

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -43,13 +43,11 @@ We pretrain the model on  100 billion tokens, including:
 Please kindly cite using the following BibTeX:
 ```bibtex
-@misc{zhang2024relu2,
       title={ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs},
       author={Zhengyan Zhang and Yixin Song and Guanghui Yu and Xu Han and Yankai Lin and Chaojun Xiao and Chenyang Song and Zhiyuan Liu and Zeyu Mi and Maosong Sun},
-      year={2024},
-      eprint={2402.03804},
-      archivePrefix={arXiv},
-      primaryClass={cs.LG}
 }
 ```

 Please kindly cite using the following BibTeX:
 ```bibtex
+@article{zhang2024relu2,
       title={ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs},
       author={Zhengyan Zhang and Yixin Song and Guanghui Yu and Xu Han and Yankai Lin and Chaojun Xiao and Chenyang Song and Zhiyuan Liu and Zeyu Mi and Maosong Sun},
+      journal = {arXiv preprint arXiv:2402.03804},
+       year={2024},
 }
 ```