Commit History

fix: LoRA ํ”„๋กฌํ”„ํŠธ ์ •๋ ฌ + ๋„๊ตฌ ํŒŒ๋ผ๋ฏธํ„ฐ ์Šคํ‚ค๋งˆ + API ํ‚ค ์ธ์ฝ”๋”ฉ
c794814
verified

umyunsang commited on

fix: synthesize_with_lora โ†’ synthesize_final
45f5eaf
verified

umyunsang commited on

fix: ํ”„๋กฌํ”„ํŠธ ํ•™์Šตํ˜•์‹ ์ •๋ ฌ + synthesize_final ๋ฒ ์ด์Šค๋ชจ๋ธ ์ „ํ™˜
1f74f5a
verified

umyunsang commited on

fix: draft max_tokens 512โ†’2048 (thought ๋ธ”๋ก์ด ํ† ํฐ ์†Œ์ง„ํ•˜์—ฌ ๋นˆ ์ดˆ์•ˆ ๋ฌธ์ œ)
b15e72a
verified

umyunsang commited on

feat: LoRA-First ์•„ํ‚คํ…์ฒ˜ โ€” self-RAG ์ œ๊ฑฐ, ๋ณ‘๋ ฌ ์ดˆ์•ˆ+๊ฒ€์ƒ‰, LoRA ํ•ฉ์„ฑ
671d971
verified

umyunsang commited on

fix: DirectEnginePlanner tokenizer native tool calling + RegexPlanner fallback
e7c7f3c
verified

umyunsang commited on

sync: api_server.py KV_CACHE_DTYPE=fp8 ์ง€์›
0b5bcad
verified

umyunsang commited on

sync: main branch src/ with PR#561+#563 (tool calling + E2E observability)
0b04246
verified

umyunsang commited on

fix: __interrupt__ ๋…ธ๋“œ ๊ฐ์ง€ โ†’ awaiting_approval emit
ee978c4

umyunsang commited on

fix: _strip_thought_blocks์— <think>...</think> ํŒจํ„ด ์ถ”๊ฐ€ (EXAONE-4.0 ์ถ”๋ก  ๋ชจ๋“œ ์ง€์›)
3f5e9ae

umyunsang commited on

fix: append_evidence ์ปจํ…์ŠคํŠธ ๊ฐœ์„  โ€” accumulated์—์„œ ์ด์ „ ๋‹ต๋ณ€ ์ถ”์ถœ + ๋นˆ stripped ํ…์ŠคํŠธ fallback
77f0193

umyunsang commited on

fix: ํŒŒ์ดํ”„๋ผ์ธ synthesis ํ…์ŠคํŠธ ์ „๋‹ฌ ์‹คํŒจ ๋ฐ capability ํƒ€์ž„์•„์›ƒ 5๊ฐœ ์ˆ˜์ •
5a6f9bd

umyunsang commited on

fix: planner_node์—์„œ plan() PlanValidationError ๋ฏธ์บ์น˜ ์ˆ˜์ •
087a3e6

github-actions commited on

fix: chat_completions ์ฝ”๋“œ๋ฆฌ๋ทฐ ๋ฐ˜์˜ + DirectEnginePlannerAdapter ๋„์ž…
88a5069

github-actions commited on

fix: /v1/chat/completions ์—”๋“œํฌ์ธํŠธ ์ถ”๊ฐ€ โ€” LLMPlannerAdapter 404 ์ˆ˜์ •
5bafa93

github-actions commited on

fix: SentenceTransformer device=cpu โ€” vLLM VRAM ์ถฉ๋Œ ๋ฐฉ์ง€
952058d

github-actions commited on

Upload folder using huggingface_hub
9e65b56
verified

umyunsang commited on