Cheeeeeeeeky
/

affine

Text Generation

Model card Files Files and versions

Cheeeeeeeeky commited on Jan 22

Commit

4411328

·

verified ·

1 Parent(s): 0622c2a

upload model

Files changed (1) hide show

inference/README.md +14 -0

inference/README.md ADDED Viewed

	@@ -0,0 +1,14 @@

+# DeepSeek V3.2
+First convert huggingface model weights to the the format required by our inference demo. Set `MP` to match your available GPU count:
+```bash
+cd inference
+export EXPERTS=256
+python convert.py --hf-ckpt-path ${HF_CKPT_PATH} --save-path ${SAVE_PATH} --n-experts ${EXPERTS} --model-parallel ${MP}
+```
+Launch the interactive chat interface and start exploring DeepSeek's capabilities:
+```bash
+export CONFIG=config_671B_v3.2.json
+torchrun --nproc-per-node ${MP} generate.py --ckpt-path ${SAVE_PATH} --config ${CONFIG} --interactive
+```