Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -8,12 +8,12 @@ license: apache-2.0
|
|
| 8 |
library_name: peft
|
| 9 |
pipeline_tag: text-generation
|
| 10 |
tags:
|
| 11 |
-
-
|
| 12 |
- lora
|
| 13 |
-
-
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
| 17 |
|
| 18 |
This repository provides a **LoRA adapter** fine-tuned from
|
| 19 |
**unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
|
|
@@ -33,7 +33,7 @@ while intermediate reasoning (Chain-of-Thought) is masked.
|
|
| 33 |
|
| 34 |
- Base model: unsloth/Qwen3-4B-Instruct-2507
|
| 35 |
- Method: QLoRA (4-bit)
|
| 36 |
-
- Max sequence length:
|
| 37 |
- Epochs: 1
|
| 38 |
- Learning rate: 5e-05
|
| 39 |
- LoRA: r=16, alpha=32
|
|
@@ -63,6 +63,3 @@ Training data: u-10bei/structured_data_with_cot_dataset_512_v5
|
|
| 63 |
|
| 64 |
Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
|
| 65 |
Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
|
| 66 |
-
### Framework versions
|
| 67 |
-
|
| 68 |
-
- PEFT 0.18.1
|
|
|
|
| 8 |
library_name: peft
|
| 9 |
pipeline_tag: text-generation
|
| 10 |
tags:
|
| 11 |
+
- qlora
|
| 12 |
- lora
|
| 13 |
+
- structured-output
|
| 14 |
---
|
| 15 |
|
| 16 |
+
unsloth/Qwen3-4B-Instruct-structeval-lora
|
| 17 |
|
| 18 |
This repository provides a **LoRA adapter** fine-tuned from
|
| 19 |
**unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
|
|
|
|
| 33 |
|
| 34 |
- Base model: unsloth/Qwen3-4B-Instruct-2507
|
| 35 |
- Method: QLoRA (4-bit)
|
| 36 |
+
- Max sequence length: 128
|
| 37 |
- Epochs: 1
|
| 38 |
- Learning rate: 5e-05
|
| 39 |
- LoRA: r=16, alpha=32
|
|
|
|
| 63 |
|
| 64 |
Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
|
| 65 |
Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
|
|
|
|
|
|
|
|
|