docs: update model card with v3 training data (58K SFT, 26K DPO), MLX benchmark, Markdown RT format 7f641b5 verified intrect commited on Feb 15
feat: update to DPO v4 merged model (SFT + DPO v4 language leak fix) c34c4f4 verified intrect commited on Feb 15
fix: change tokenizer_class to Qwen2TokenizerFast for vLLM compatibility ecf8e6b verified intrect commited on Feb 13
docs: update training data distribution with accurate numbers (SFT 36,713 + DPO 24,779) d35ad96 verified intrect commited on Feb 12
docs: update model card with GGUF formats, benchmarks, usage examples 2448e8a verified intrect commited on Feb 12
Fix tokenizer_config.json - remove extra_special_tokens list causing vLLM error f650ef7 verified intrect commited on Jan 28
Fix config.json for vLLM compatibility (remove layer_types, fix rope_parameters) 7dc1232 verified intrect commited on Jan 28