pingzhili
/

fairseq-moe-15b

Text Generation

Model card Files Files and versions

Phando commited on Sep 23, 2023

Commit

40f056e

·

1 Parent(s): fb04c8e

Create README.md

Files changed (1) hide show

README.md +6 -0

README.md ADDED Viewed

	@@ -0,0 +1,6 @@

+---
+language: en
+---
+This is a Hugging Face transformers-style conversion of the original SMoE 15B-parameter model from the paper "[Efficient Large Scale Language Modeling with Mixtures of Experts](https://arxiv.org/abs/2112.10684)" from Artetxe et al. The original model card can be found at https://github.com/facebookresearch/fairseq/blob/main/examples/moe_lm/model_card.md.
+The usage example and modeling code can be found at https://github.com/pingzhili/light-fairseq