Matukaze
/

test107

@@ -19,6 +19,7 @@ tags:
 This repository provides a **LoRA adapter** fine-tuned from
 **Matukaze/test105** using **LoRA + Unsloth**.
 This repository contains **LoRA adapter weights only**.
 The base model must be loaded separately.
@@ -40,6 +41,11 @@ tool use, and recovery from errors.
 - Epochs: 0
 - Learning rate: 5e-06
 - LoRA: r=32, alpha=128
 ## Usage

 This repository provides a **LoRA adapter** fine-tuned from
 **Matukaze/test105** using **LoRA + Unsloth**.
+This model initialised from Qwen2.5-7B-Instruct.
 This repository contains **LoRA adapter weights only**.
 The base model must be loaded separately.
 - Epochs: 0
 - Learning rate: 5e-06
 - LoRA: r=32, alpha=128
+- This model was fine-tuned sequentially using LoRA
+- First stage:fine-tuned on DBBench SFT dataset and merged.
+- Second stage:furter fine-tuned on ALFWorld SFT datasset and merged.
+-This model architecture remains unchanged.
 ## Usage