makotonlo
/

LLM2026_SFT_final

Text Generation

structured-output

Model card Files Files and versions

makotonlo commited on Feb 7

Commit

ad2b72b

·

verified ·

1 Parent(s): 51655a6

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -33,9 +33,9 @@ while intermediate reasoning (Chain-of-Thought) is masked.
 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
-- Max sequence length: 512
-- Epochs: 1
-- Learning rate: 1e-06
 - LoRA: r=64, alpha=128
 ## Usage

 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
+- Max sequence length: 1024
+- Epochs: 2
+- Learning rate: 5e-5
 - LoRA: r=64, alpha=128
 ## Usage