Moonlight-16B-A3B-Instruct-Fast / modeling_deepseek.py

Commit History

use torchtitan moe impl
fe340b5

Jackmin108 commited on

original files
1805272

Jackmin108 commited on