ceofast
/

kumru-2b-lora

@@ -17,22 +17,22 @@ pipeline_tag: text-generation
   # Kumru-2B LoRA Adapter
-  Bu repo, **VNGRS Kumru-2B** modelinin (`vngrs-ai/Kumru-2B`) SFT sürümünü temel alarak,
-  `vngrs-ai/Kumru-2B-Base` modeli üzerine uygulanmak üzere çıkarılmış bir **LoRA** adaptörüdür.
-  Adapter, Kumru’nun chat/instruction davranışını `vngrs-ai/Kumru-2B-Base` tabanlı dağıtımlara
-  hafif dosya boyutuyla taşımak için oluşturulmuştur.
-  ## Model Özeti
-  - **Taban model:** `vngrs-ai/Kumru-2B-Base`
-  - **Kaynak (hedef) model:** `vngrs-ai/Kumru-2B` (SFT/chat)
-  - **Teknik:** Low-Rank Adaptation (LoRA)
-  - **LoRA rank / alpha:** 64 / 64 _(farklı sürüm oluşturduysanız güncelleyin)_
-  - **Katman kapsamı:** Tüm self-attention ve MLP projeksiyonları
-  - **Çıktı:** PEFT uyumlu `adapter_config.json` + `adapter_model.safetensors`
-  - **Lisans:** Apache 2.0 (VNGRS’in Kumru modelleriyle uyumlu)
-  ## Kullanım
   ```python
   from peft import PeftModel
@@ -53,22 +53,23 @@ pipeline_tag: text-generation
   outputs = model.generate(inputs, max_new_tokens=512, temperature=0.7, top_p=0.9)
   print(tokenizer.decode(outputs[0][inputs.shape[-1]:], skip_special_tokens=True))
   ```
-  > Not: Adapter yalnızca vngrs-ai/Kumru-2B-Base ile birlikte kullanılmalıdır.
-  ## Çıkarma Süreci
-  Adapter, base ve SFT checkpoint’leri arasındaki delta’nın SVD ile düşük-rank faktörlere ayrılmasıyla elde edilmiştir.
-  Bu sürümde hesaplanan rekonstrüksiyon hatası yaklaşık 0.409’dur; kaliteyi korumak için rank/alpha değerlerini artırıp
-  yeni bir sürüm çıkarabilirsiniz (ör. rank 1024 / alpha 2048). En kısa sürede daha düşük hatalı sürüm eklenecektir.
   - Script: export_kumru.py
-  ## Bilinen Sınırlamalar
-  - Kumru-2B hâlâ 2B parametreli bir modeldir; uzun bağlam, nadir teknik terimler ve matematikte hatalar görülebilir.
-  - Rank düşük olduğunda SVD tabanlı LoRA, orijinal SFT checkpoint’i kadar stabil olmayabilir.
-  - Eğitim verisi VNGRS’in kamuya açık Türkçe corpus temizleme akışına dayanmaktadır; doğruluk/hallucination problemleri
-    hâlen görülebilir.
 ### Framework versions
 - PEFT 0.11.1

   # Kumru-2B LoRA Adapter
+  This repository provides a **LoRA** adapter distilled from the **VNGRS Kumru-2B** model (
+  `vngrs-ai/Kumru-2B`, the SFT/chat variant) to be applied on top of the base model
+  `vngrs-ai/Kumru-2B-Base`. The goal is to transfer Kumru’s chat/instruction behavior
+  to `Kumru-2B-Base` deployments with a lightweight file footprint.
+  ## Model Summary
+  - **Base model:** `vngrs-ai/Kumru-2B-Base`
+  - **Source (target behavior) model:** `vngrs-ai/Kumru-2B` (SFT/chat)
+  - **echnique:** Low-Rank Adaptation (LoRA)
+  - **LoRA rank / alpha:** 768 / 1024 _(update these if you produce a different build)_
+  - **Layer coverage:** All self-attention and MLP projections
+  - **Output artifacts:** PEFT-compatible `adapter_config.json` + `adapter_model.safetensors`
+  - **License:** Apache 2.0 (aligned with VNGRS Kumru model licensing)
+  ## Usage
   ```python
   from peft import PeftModel
   outputs = model.generate(inputs, max_new_tokens=512, temperature=0.7, top_p=0.9)
   print(tokenizer.decode(outputs[0][inputs.shape[-1]:], skip_special_tokens=True))
   ```
+  > Not: This adapter must be used together with `vngrs-ai/Kumru-2B-Base`.
+  ## Extraction Process
+  The adapter is obtained by computing the delta between the base and the SFT checkpoints and factorizing it with **SVD**
+  into low-rank components. In this release, the measured reconstruction error is approximately **0.409**. To better preserve
+  quality, you may increase rank/alpha and export a new version (e.g., rank **1024** / alpha **2048**). A lower-error build will
+  be added as soon as possible.
   - Script: export_kumru.py
+  ## Known Limitations
+  - Kumru-2B is still a ~2B-parameter model; it may struggle with very long context, rare technical terms, and complex math.
+  - With low ranks, SVD-based LoRA can be less stable than the original SFT checkpoint.
+  - Training data is based on VNGRS’s public Turkish corpus cleaning pipeline; truthfulness/hallucination issues may still occur.
 ### Framework versions
 - PEFT 0.11.1