runtime error
Exit code: 1. Reason: 00<?, ?B/s][A model-00004-of-00004.safetensors: 1%| | 18.1M/3.56G [00:01<03:29, 16.9MB/s][A model-00004-of-00004.safetensors: 2%|▏ | 62.5M/3.56G [00:02<01:49, 31.9MB/s][A model-00004-of-00004.safetensors: 3%|▎ | 112M/3.56G [00:03<01:39, 34.8MB/s] [A model-00004-of-00004.safetensors: 9%|▊ | 303M/3.56G [00:04<00:37, 87.1MB/s][A model-00004-of-00004.safetensors: 18%|█▊ | 648M/3.56G [00:05<00:19, 153MB/s] [A model-00004-of-00004.safetensors: 39%|███▉ | 1.40G/3.56G [00:06<00:06, 327MB/s][A model-00004-of-00004.safetensors: 57%|█████▋ | 2.02G/3.56G [00:08<00:03, 407MB/s][A model-00004-of-00004.safetensors: 81%|████████ | 2.89G/3.56G [00:09<00:01, 537MB/s][A model-00004-of-00004.safetensors: 100%|██████████| 3.56G/3.56G [00:09<00:00, 379MB/s] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 4/4 [00:00<00:00, 61455.00it/s] generation_config.json: 0%| | 0.00/243 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 243/243 [00:00<00:00, 1.47MB/s] Traceback (most recent call last): File "/app/app.py", line 19, in <module> llm_model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...