diff --git "a/debug.log" "b/debug.log" --- "a/debug.log" +++ "b/debug.log" @@ -1,7 +1,7 @@ - Loading dataset from disk: 0%| | 0/205 [00:00:39] [PID:12849] Skipping import of cpp extensions due to incompatible torch version 2.9.1+cu128 for torchao version 0.13.0 -[2026-02-10 14:55:09,162] [WARNING] [accelerate.utils.dataclasses.__post_init__:1962] [PID:12849] sharding_strategy is deprecated in favor of reshard_after_forward. This will be removed in a future version of Accelerate. -[2026-02-10 15:32:29,946] [WARNING] [py.warnings._showwarnmsg:110] [PID:12849] /root/miniconda3/envs/py3.11/lib/python3.11/site-packages/torch/distributed/fsdp/fully_sharded_data_parallel.py:675: FutureWarning: FSDP.state_dict_type() and FSDP.set_state_dict_type() are being deprecated. Please use APIs, get_state_dict() and set_state_dict(), which can support different parallelisms, FSDP1, FSDP2, DDP. API doc: https://pytorch.org/docs/stable/distributed.checkpoint.html#torch.distributed.checkpoint.state_dict.get_state_dict .Tutorial: https://pytorch.org/tutorials/recipes/distributed_checkpoint_recipe.html . + Loading dataset from disk: 0%| | 0/208 [00:00:39] [PID:14756] Skipping import of cpp extensions due to incompatible torch version 2.9.1+cu128 for torchao version 0.13.0 +[2026-02-10 15:44:41,826] [WARNING] [accelerate.utils.dataclasses.__post_init__:1962] [PID:14756] sharding_strategy is deprecated in favor of reshard_after_forward. This will be removed in a future version of Accelerate. +[2026-02-10 16:12:36,776] [WARNING] [py.warnings._showwarnmsg:110] [PID:14756] /root/miniconda3/envs/py3.11/lib/python3.11/site-packages/torch/distributed/fsdp/fully_sharded_data_parallel.py:675: FutureWarning: FSDP.state_dict_type() and FSDP.set_state_dict_type() are being deprecated. Please use APIs, get_state_dict() and set_state_dict(), which can support different parallelisms, FSDP1, FSDP2, DDP. API doc: https://pytorch.org/docs/stable/distributed.checkpoint.html#torch.distributed.checkpoint.state_dict.get_state_dict .Tutorial: https://pytorch.org/tutorials/recipes/distributed_checkpoint_recipe.html . warnings.warn(