Spaces:
Runtime error
Runtime error
Hajime MATSUMOTO
commited on
Commit
ยท
113833d
1
Parent(s):
c179929
L40S optimization: batch 8, disable gradient checkpointing, parallel dataloader
Browse files
train.py
CHANGED
|
@@ -237,10 +237,10 @@ training_args = TrainingArguments(
|
|
| 237 |
num_train_epochs=1,
|
| 238 |
max_steps=-1, # -1 = ใจใใใฏใใผใน
|
| 239 |
|
| 240 |
-
# ใใใใตใคใบ (
|
| 241 |
-
per_device_train_batch_size=
|
| 242 |
-
per_device_eval_batch_size=
|
| 243 |
-
gradient_accumulation_steps=
|
| 244 |
|
| 245 |
# ๅญฆ็ฟ็ (1ใจใใใฏใงๅๆใใใใ้ซใ)
|
| 246 |
learning_rate=2e-4,
|
|
@@ -264,7 +264,10 @@ training_args = TrainingArguments(
|
|
| 264 |
# ใใฎไป
|
| 265 |
report_to="none",
|
| 266 |
group_by_length=True,
|
| 267 |
-
gradient_checkpointing=
|
|
|
|
|
|
|
|
|
|
| 268 |
|
| 269 |
# ๅ้็จ
|
| 270 |
save_safetensors=True,
|
|
|
|
| 237 |
num_train_epochs=1,
|
| 238 |
max_steps=-1, # -1 = ใจใใใฏใใผใน
|
| 239 |
|
| 240 |
+
# ใใใใตใคใบ (L40S 48GB - ๆปใใ่จญๅฎ)
|
| 241 |
+
per_device_train_batch_size=8,
|
| 242 |
+
per_device_eval_batch_size=8,
|
| 243 |
+
gradient_accumulation_steps=2, # ๆๅนใใใใตใคใบ: 8*2=16
|
| 244 |
|
| 245 |
# ๅญฆ็ฟ็ (1ใจใใใฏใงๅๆใใใใ้ซใ)
|
| 246 |
learning_rate=2e-4,
|
|
|
|
| 264 |
# ใใฎไป
|
| 265 |
report_to="none",
|
| 266 |
group_by_length=True,
|
| 267 |
+
gradient_checkpointing=False, # L40Sใฏ48GBใใใฎใงใชใใง้ซ้ๅ
|
| 268 |
+
torch_compile=False, # ๅๅใณใณใใคใซๆ้ใ้ฟใใ
|
| 269 |
+
dataloader_num_workers=4,
|
| 270 |
+
dataloader_pin_memory=True,
|
| 271 |
|
| 272 |
# ๅ้็จ
|
| 273 |
save_safetensors=True,
|