Spaces:

hajimemat
/

glaive-7b-training

Runtime error

Hajime MATSUMOTO commited on 18 days ago

Commit

702c22f

1 Parent(s): ce5bcf8

Reduce batch size to avoid OOM on L40S 48GB

Files changed (1) hide show

train.py CHANGED Viewed

@@ -234,10 +234,10 @@ training_args = TrainingArguments(
     num_train_epochs=2,
     max_steps=-1,  # -1 = エポックベース
-    # バッチサイズ (L40S 48GBなら大きく取れる)
-    per_device_train_batch_size=8,
-    per_device_eval_batch_size=8,
-    gradient_accumulation_steps=4,  # 有効バッチサイズ: 8*4=32
     # 学習率
     learning_rate=1e-4,

     num_train_epochs=2,
     max_steps=-1,  # -1 = エポックベース
+    # バッチサイズ (L40S 48GB + 7B QLoRA)
+    per_device_train_batch_size=2,
+    per_device_eval_batch_size=2,
+    gradient_accumulation_steps=16,  # 有効バッチサイズ: 2*16=32
     # 学習率
     learning_rate=1e-4,