katuni4ka
/

tiny-random-deepseek_v32

Model card Files Files and versions

tiny-random-deepseek_v32 / inference /README.md

katuni4ka's picture

Upload 23 files

a93cb08 verified about 2 months ago

|

history blame contribute delete

548 Bytes

	# DeepSeek V3.2

	First convert huggingface model weights to the the format required by our inference demo. Set `MP` to match your available GPU count:
	```bash
	cd inference
	export EXPERTS=256
	python convert.py --hf-ckpt-path ${HF_CKPT_PATH} --save-path ${SAVE_PATH} --n-experts ${EXPERTS} --model-parallel ${MP}
	```

	Launch the interactive chat interface and start exploring DeepSeek's capabilities:
	```bash
	export CONFIG=config_671B_v3.2.json
	torchrun --nproc-per-node ${MP} generate.py --ckpt-path ${SAVE_PATH} --config ${CONFIG} --interactive
	```