fix: CUDA isolation + vllm import ์ ๊ฑฐ + health check ์์ b5e31d0 Deploy commited on about 6 hours ago
fix: PORT ์ถฉ๋ + _init_graph lifespan ์ด๋ + lora-modules ์์ 79159da Deploy commited on about 7 hours ago
fix: LoRA ํ๋กฌํํธ ์ ๋ ฌ + ๋๊ตฌ ํ๋ผ๋ฏธํฐ ์คํค๋ง + API ํค ์ธ์ฝ๋ฉ c794814 verified umyunsang commited on 1 day ago
fix: ํ๋กฌํํธ ํ์ตํ์ ์ ๋ ฌ + synthesize_final ๋ฒ ์ด์ค๋ชจ๋ธ ์ ํ 1f74f5a verified umyunsang commited on 1 day ago
fix: draft max_tokens 512โ2048 (thought ๋ธ๋ก์ด ํ ํฐ ์์งํ์ฌ ๋น ์ด์ ๋ฌธ์ ) b15e72a verified umyunsang commited on 1 day ago
feat: LoRA-First ์ํคํ ์ฒ โ self-RAG ์ ๊ฑฐ, ๋ณ๋ ฌ ์ด์+๊ฒ์, LoRA ํฉ์ฑ 671d971 verified umyunsang commited on 1 day ago
fix: DirectEnginePlanner tokenizer native tool calling + RegexPlanner fallback e7c7f3c verified umyunsang commited on 1 day ago
sync: main branch src/ with PR#561+#563 (tool calling + E2E observability) 0b04246 verified umyunsang commited on 1 day ago
fix: _strip_thought_blocks์ <think>...</think> ํจํด ์ถ๊ฐ (EXAONE-4.0 ์ถ๋ก ๋ชจ๋ ์ง์) 3f5e9ae umyunsang commited on 1 day ago
fix: append_evidence ์ปจํ ์คํธ ๊ฐ์ โ accumulated์์ ์ด์ ๋ต๋ณ ์ถ์ถ + ๋น stripped ํ ์คํธ fallback 77f0193 umyunsang commited on 1 day ago
fix: ํ์ดํ๋ผ์ธ synthesis ํ ์คํธ ์ ๋ฌ ์คํจ ๋ฐ capability ํ์์์ 5๊ฐ ์์ 5a6f9bd umyunsang commited on 1 day ago
fix: planner_node์์ plan() PlanValidationError ๋ฏธ์บ์น ์์ 087a3e6 github-actions commited on 1 day ago
fix: chat_completions ์ฝ๋๋ฆฌ๋ทฐ ๋ฐ์ + DirectEnginePlannerAdapter ๋์ 88a5069 github-actions commited on 1 day ago
fix: /v1/chat/completions ์๋ํฌ์ธํธ ์ถ๊ฐ โ LLMPlannerAdapter 404 ์์ 5bafa93 github-actions commited on 1 day ago
fix: SentenceTransformer device=cpu โ vLLM VRAM ์ถฉ๋ ๋ฐฉ์ง 952058d github-actions commited on 1 day ago