| # DeepSeek V3.2 |
|
|
| First convert huggingface model weights to the the format required by our inference demo. Set `MP` to match your available GPU count: |
| ```bash |
| cd inference |
| export EXPERTS=256 |
| python convert.py --hf-ckpt-path ${HF_CKPT_PATH} --save-path ${SAVE_PATH} --n-experts ${EXPERTS} --model-parallel ${MP} |
| ``` |
|
|
| Launch the interactive chat interface and start exploring DeepSeek's capabilities: |
| ```bash |
| export CONFIG=config_671B_v3.2.json |
| torchrun --nproc-per-node ${MP} generate.py --ckpt-path ${SAVE_PATH} --config ${CONFIG} --interactive |
| ``` |