runtime error

Exit code: 1. Reason: tensors: 100%|██████████| 4.54G/4.54G [00:04<00:00, 910MB/s] Downloading shards: 100%|██████████| 3/3 [00:18<00:00, 6.00s/it] Downloading shards: 100%|██████████| 3/3 [00:18<00:00, 6.25s/it] Loading checkpoint shards: 0%| | 0/3 [00:00<?, ?it/s] Loading checkpoint shards: 33%|███▎ | 1/3 [01:01<02:03, 61.98s/it] Loading checkpoint shards: 67%|██████▋ | 2/3 [02:05<01:03, 63.03s/it] Loading checkpoint shards: 100%|██████████| 3/3 [03:03<00:00, 60.43s/it] Loading checkpoint shards: 100%|██████████| 3/3 [03:03<00:00, 61.03s/it] /usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:942: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn( generation_config.json: 0%| | 0.00/116 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 116/116 [00:00<00:00, 732kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 75, in <module> model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 561, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3558, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 502, in dispatch_model model.to(device) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2534, in to raise ValueError( ValueError: `.to` is not supported for `4-bit` or `8-bit` bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct `dtype`.

Container logs:

Fetching error logs...