runtime error
Exit code: 3. Reason: INFO: Started server process [1] INFO: Waiting for application startup. `torch_dtype` is deprecated! Use `dtype` instead! Fetching 5 files: 0%| | 0/5 [00:00<?, ?it/s] Fetching 5 files: 20%|██ | 1/5 [01:05<04:23, 65.79s/it] Fetching 5 files: 80%|████████ | 4/5 [01:05<00:12, 12.48s/it] Fetching 5 files: 100%|██████████| 5/5 [01:05<00:00, 13.18s/it] Loading checkpoint shards: 0%| | 0/5 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 5/5 [00:00<00:00, 102801.57it/s] ERROR: Traceback (most recent call last): File "/usr/local/lib/python3.9/site-packages/starlette/routing.py", line 694, in lifespan async with self.lifespan_context(app) as maybe_state: File "/usr/local/lib/python3.9/site-packages/starlette/routing.py", line 571, in __aenter__ await self._router.startup() File "/usr/local/lib/python3.9/site-packages/starlette/routing.py", line 673, in startup handler() File "/app/main.py", line 64, in startup_event model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.9/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.9/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.9/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.9/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead. ERROR: Application startup failed. Exiting.
Container logs:
Fetching error logs...