Mani124124
/

structeval-lora

@@ -8,12 +8,12 @@ license: apache-2.0
 library_name: peft
 pipeline_tag: text-generation
 tags:
-- base_model:adapter:unsloth/Qwen3-4B-Instruct-2507
 - lora
-- transformers
 ---
-＜【課題】ここは自分で記入して下さい＞
 This repository provides a **LoRA adapter** fine-tuned from
 **unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
@@ -33,7 +33,7 @@ while intermediate reasoning (Chain-of-Thought) is masked.
 - Base model: unsloth/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
-- Max sequence length: 256
 - Epochs: 1
 - Learning rate: 5e-05
 - LoRA: r=16, alpha=32
@@ -63,6 +63,3 @@ Training data: u-10bei/structured_data_with_cot_dataset_512_v5
 Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
 Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
-### Framework versions
-- PEFT 0.18.1

 library_name: peft
 pipeline_tag: text-generation
 tags:
+- qlora
 - lora
+- structured-output
 ---
+unsloth/Qwen3-4B-Instruct-structeval-lora
 This repository provides a **LoRA adapter** fine-tuned from
 **unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
 - Base model: unsloth/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
+- Max sequence length: 128
 - Epochs: 1
 - Learning rate: 5e-05
 - LoRA: r=16, alpha=32
 Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
 Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.