ceperaltab
/

diamond-vision-training-code

ceperaltab commited on 17 days ago

Commit

d0c87cc

verified ·

1 Parent(s): 81c1e1b

Upload train.py with huggingface_hub

Files changed (1) hide show

train.py CHANGED Viewed

@@ -60,7 +60,7 @@ def main():
         per_device_train_batch_size=1,
         gradient_accumulation_steps=8,
         learning_rate=2e-4,
-        max_seq_length=2048,           # CV code is verbose — larger than default
         # Logging & checkpointing
         logging_steps=10,

         per_device_train_batch_size=1,
         gradient_accumulation_steps=8,
         learning_rate=2e-4,
+        # NOTE: max_seq_length is NOT supported in SFTConfig (trl>=0.12.0) — removed
         # Logging & checkpointing
         logging_steps=10,