Mani124124 commited on
Commit
c4d619e
·
verified ·
1 Parent(s): c8a5bb5

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -7
README.md CHANGED
@@ -8,12 +8,12 @@ license: apache-2.0
8
  library_name: peft
9
  pipeline_tag: text-generation
10
  tags:
11
- - base_model:adapter:unsloth/Qwen3-4B-Instruct-2507
12
  - lora
13
- - transformers
14
  ---
15
 
16
- <【課題】ここは自分で記入して下さい>
17
 
18
  This repository provides a **LoRA adapter** fine-tuned from
19
  **unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
@@ -33,7 +33,7 @@ while intermediate reasoning (Chain-of-Thought) is masked.
33
 
34
  - Base model: unsloth/Qwen3-4B-Instruct-2507
35
  - Method: QLoRA (4-bit)
36
- - Max sequence length: 256
37
  - Epochs: 1
38
  - Learning rate: 5e-05
39
  - LoRA: r=16, alpha=32
@@ -63,6 +63,3 @@ Training data: u-10bei/structured_data_with_cot_dataset_512_v5
63
 
64
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
65
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.
66
- ### Framework versions
67
-
68
- - PEFT 0.18.1
 
8
  library_name: peft
9
  pipeline_tag: text-generation
10
  tags:
11
+ - qlora
12
  - lora
13
+ - structured-output
14
  ---
15
 
16
+ unsloth/Qwen3-4B-Instruct-structeval-lora
17
 
18
  This repository provides a **LoRA adapter** fine-tuned from
19
  **unsloth/Qwen3-4B-Instruct-2507** using **QLoRA (4-bit, Unsloth)**.
 
33
 
34
  - Base model: unsloth/Qwen3-4B-Instruct-2507
35
  - Method: QLoRA (4-bit)
36
+ - Max sequence length: 128
37
  - Epochs: 1
38
  - Learning rate: 5e-05
39
  - LoRA: r=16, alpha=32
 
63
 
64
  Dataset License: MIT License. This dataset is used and distributed under the terms of the MIT License.
65
  Compliance: Users must comply with the MIT license (including copyright notice) and the base model's original terms of use.