Update model card with benchmark results (MMLU 48.7%, HellaSwag 58%, GSM8K 55%) 4fdc85b verified rawcell commited on 7 days ago
Upload configuration_deepseek.py with huggingface_hub 419d51e verified rawcell commited on 7 days ago
Add custom inference handler for DeepSeekV3 architecture 6718e1a verified rawcell commited on 7 days ago
Upload Moonlight-16B-A3B-Instruct abliterated with Bruno MoE gate abliteration 13117a3 verified rawcell commited on 7 days ago