macab
/

lora_structeval_t_qwen3_4b_1st

Text Generation

structured-output

Model card Files Files and versions

macab commited on Feb 10

Commit

715fa47

·

verified ·

1 Parent(s): 072ea96

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -33,7 +33,7 @@ while intermediate reasoning (Chain-of-Thought) is masked.
 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
-- Max sequence length: 512
 - Epochs: 1
 - Learning rate: 1e-06
 - LoRA: r=64, alpha=128

 - Base model: Qwen/Qwen3-4B-Instruct-2507
 - Method: QLoRA (4-bit)
+- Max sequence length: 768
 - Epochs: 1
 - Learning rate: 1e-06
 - LoRA: r=64, alpha=128