faz7_rag.py: --int8 dynamic quant (~2x kucuk bellek 708->325MB, ~1.2x hiz, ciktilar birebir) 99a2344 verified kdirgul commited on 6 days ago
faz7_rag.py: decode-cache (step/KV-cache) — CPU uretim O(L), ~6x hizli, full-recompute ile birebir a7490d2 verified kdirgul commited on 6 days ago
faz7_rag: --device cpu (LambaCPU saf-PyTorch yolu; fork import soft, CPU RAG yerelde calisiyor) 48d322f verified kdirgul commited on 6 days ago
RAG-SFT: faz7_rag.py (extractive QA converter + prompt hizalama) 5027d66 verified kdirgul commited on 13 days ago
faz7_rag: kisa/no-think talimati + <think> strip + --max_new 96 0a8d724 verified kdirgul commited on 13 days ago
faz7_rag: RAG prompt SFT formatina hizalandi + dil-duyarli yonerge e1f3192 verified kdirgul commited on 13 days ago
faz7_rag: --ab modu (RAG vs ham, tek kosu) + DEMO_QUERIES 6567f05 verified kdirgul commited on 13 days ago
faz7_rag: RAG pipeline (retrieve+compress+inject, demo corpus) 14ee509 verified kdirgul commited on 13 days ago