Commit History

docs: 벤치마크 섹션 한국어로 변환
1bc7a0f
verified

intrect commited on

data: add raw benchmark results JSON (KMMLU + HAE-RAE, 3-model comparison)
de3e104
verified

intrect commited on

docs: add Korean LLM benchmark results (KMMLU + HAE-RAE, 3-model comparison)
4043357
verified

intrect commited on

fix: replace Qwen default system prompt with VELA identity in chat templates
8fa0fb0
verified

intrect commited on

docs: add llama-cpp-python, vllm, ollama tags for library discovery
6eb484c
verified

intrect commited on

docs: add MLX 4-bit format, update quant links in model card
8bdb2c7
verified

intrect commited on

feat: add MLX 4-bit quantized model (Apple Silicon optimized)
d7f1985
verified

intrect commited on

docs: update README to v1.3 (DPO v6)
a8a5cbd
verified

intrect commited on

feat: update to DPO v6 merged model (BF16 safetensors)
ffbdde5
verified

intrect commited on

feat: add DPO v6 GGUF Q4_K_M model
2616a76
verified

intrect commited on

chore: remove v4 GGUF (replacing with v6)
4917e1e
verified

intrect commited on

Update README.md
07d2ac1
verified

intrect commited on

feat: vela-v5-merged 업로드 (SFT merged, rep_penalty=1.05)
a8cb4d0
verified

intrect commited on

fix: generation_config rep_penalty=1.05, top_k=20, top_p=0.8 (vela-v5-merged)
a0a3973
verified

intrect commited on

docs: add recommended inference settings and backend configuration guide
3e8a713
verified

intrect commited on

fix: match llama-cpp-python defaults (top_k=40, top_p=0.95, rep_penalty=1.0)
dea4e49
verified

intrect commited on

fix: rollback generation_config to safe Qwen2.5 defaults (rep_penalty 1.3→1.1)
bfc3043
verified

intrect commited on

fix: update generation params — repetition_penalty 1.05→1.3, top_k 20→50, top_p 0.8→0.92
d4a1479
verified

intrect commited on

docs: add GitHub framework link and badges
67d9f79
verified

intrect commited on

Update README.md
3fbe015
verified

intrect commited on

docs: add real output example (RT + Quick Assessment + Analysis Report)
c66f3a2
verified

intrect commited on

docs: update model card with v3 training data (58K SFT, 26K DPO), MLX benchmark, Markdown RT format
7f641b5
verified

intrect commited on

feat: update to DPO v4 merged model (SFT + DPO v4 language leak fix)
c34c4f4
verified

intrect commited on

Delete vela-q8_0.gguf with huggingface_hub
b4ff857
verified

intrect commited on

Delete model.safetensors with huggingface_hub
843923f
verified

intrect commited on

feat: replace SFT-only GGUF with DPO v4 merged Q4_K_M
88a0458
verified

intrect commited on

Delete vela-q4_k_m.gguf with huggingface_hub
a3cb0d9
verified

intrect commited on

fix: change tokenizer_class to Qwen2TokenizerFast for vLLM compatibility
ecf8e6b
verified

intrect commited on

docs: update training data distribution with accurate numbers (SFT 36,713 + DPO 24,779)
d35ad96
verified

intrect commited on

feat: add Q8_0 GGUF quantized model (7.6GB)
ecb3e2c
verified

intrect commited on

feat: add Q4_K_M GGUF quantized model (4.4GB)
7b906a5
verified

intrect commited on

docs: update model card with GGUF formats, benchmarks, usage examples
2448e8a
verified

intrect commited on

Fix tokenizer_config.json - remove extra_special_tokens list causing vLLM error
f650ef7
verified

intrect commited on

Fix config.json for vLLM compatibility (remove layer_types, fix rope_parameters)
7dc1232
verified

intrect commited on

Fix year to 2026
086591b
verified

intrect commited on

Update model card for DPO v4
0330d0c
verified

intrect commited on

Update to DPO v4 merged model
c5f7d77
verified

intrect commited on

Upload folder using huggingface_hub
ee82134
verified

intrect commited on

initial commit
a713b02
verified

intrect commited on