runtime error

Exit code: 1. Reason: G/3.97G [00:10<00:00, 533MB/s] model-00001-of-00002.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 3.97G/3.97G [00:10<00:00, 363MB/s] model-00002-of-00002.safetensors: 0%| | 0.00/2.20G [00:00<?, ?B/s] model-00002-of-00002.safetensors: 2%|▏ | 50.6M/2.20G [00:01<00:46, 46.8MB/s] model-00002-of-00002.safetensors: 12%|β–ˆβ– | 275M/2.20G [00:02<00:13, 146MB/s]  model-00002-of-00002.safetensors: 30%|β–ˆβ–ˆβ–‰ | 660M/2.20G [00:03<00:06, 245MB/s] model-00002-of-00002.safetensors: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 1.19G/2.20G [00:04<00:02, 353MB/s] model-00002-of-00002.safetensors: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 1.94G/2.20G [00:05<00:00, 485MB/s] model-00002-of-00002.safetensors: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2.20G/2.20G [00:05<00:00, 405MB/s] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:00<00:00, 30840.47it/s] generation_config.json: 0%| | 0.00/242 [00:00<?, ?B/s] generation_config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 242/242 [00:00<00:00, 2.62MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 560, in <module> chat_model_state = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 600, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 311, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4939, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...