LordNeel
/

training-scripts

LordNeel commited on Jan 21

Commit

3cc883e

verified ·

1 Parent(s): af0763b

Upload train_glm47_flash.py with huggingface_hub

Files changed (1) hide show

train_glm47_flash.py CHANGED Viewed

@@ -158,7 +158,7 @@ training_config = SFTConfig(
     per_device_eval_batch_size=1,
     gradient_accumulation_steps=16,  # Effective batch size: 16
     learning_rate=2e-4,
-    max_seq_length=1024,  # Reduced for memory
     # Memory optimization
     gradient_checkpointing=True,

     per_device_eval_batch_size=1,
     gradient_accumulation_steps=16,  # Effective batch size: 16
     learning_rate=2e-4,
+    max_length=1024,  # Reduced for memory
     # Memory optimization
     gradient_checkpointing=True,