Snowflake
/

snowflake-arctic-instruct

Text Generation

Mixture of Experts

Model card Files Files and versions

jeffra commited on Apr 24, 2024

Commit

e7ae5bd

·

verified ·

1 Parent(s): d32df80

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ tags:
 ## Model Details
-Arctic is a Dense-MoE Hybrid transformer architecture pre-trained from scratch by the Snowflake AI
 Research Team. We are releasing model checkpoints for both the base and instruct-tuned versions of
 Arctic under an Apache-2.0 license. This means you can use them freely in your own research,
 prototypes, and products. Please see our blog
@@ -37,7 +37,7 @@ For the latest details about Snowflake Arctic including tutorials, etc. please r
 Arctic combines a 10B dense transformer model with a residual 128x3.66B MoE MLP resulting in 480B
 total and 17B active parameters chosen using a top-2 gating. For more details about Arctic's model
-architecture please see our cookbook
 ## Usage
@@ -62,4 +62,4 @@ pip install "deepspeed>=0.14.2"
 The Arctic github page has several resources around running inference:
 * Example with pure-HF: https://github.com/Snowflake-Labs/snowflake-arctic/blob/main/inference
-* Tutorial using vLLM: https://github.com/Snowflake-Labs/snowflake-arctic/tree/main/inference/vllm

 ## Model Details
+Arctic is a dense-MoE Hybrid transformer architecture pre-trained from scratch by the Snowflake AI
 Research Team. We are releasing model checkpoints for both the base and instruct-tuned versions of
 Arctic under an Apache-2.0 license. This means you can use them freely in your own research,
 prototypes, and products. Please see our blog
 Arctic combines a 10B dense transformer model with a residual 128x3.66B MoE MLP resulting in 480B
 total and 17B active parameters chosen using a top-2 gating. For more details about Arctic's model
+Architecture, training process, data, etc. [see our series of cookbooks](https://www.snowflake.com/en/data-cloud/arctic/cookbook/).
 ## Usage
 The Arctic github page has several resources around running inference:
 * Example with pure-HF: https://github.com/Snowflake-Labs/snowflake-arctic/blob/main/inference
+* Tutorial using vLLM: https://github.com/Snowflake-Labs/snowflake-arctic/tree/main/inference/vllm