runtime error

Exit code: 1. Reason: [00:00<00:00, 1.24MB/s] tokenizer_config.json: 0%| | 0.00/7.30k [00:00<?, ?B/s] tokenizer_config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 7.30k/7.30k [00:00<00:00, 30.9MB/s] vocab.json: 0%| | 0.00/2.78M [00:00<?, ?B/s] vocab.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2.78M/2.78M [00:00<00:00, 36.7MB/s] merges.txt: 0%| | 0.00/1.67M [00:00<?, ?B/s] merges.txt: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.67M/1.67M [00:00<00:00, 56.1MB/s] tokenizer.json: 0%| | 0.00/7.03M [00:00<?, ?B/s] tokenizer.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 7.03M/7.03M [00:00<00:00, 52.9MB/s] config.json: 0%| | 0.00/660 [00:00<?, ?B/s] config.json: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 660/660 [00:00<00:00, 4.49MB/s] `torch_dtype` is deprecated! Use `dtype` instead! Traceback (most recent call last): File "/app/app.py", line 79, in <module> rag_chain = build_chain() File "/app/app.py", line 39, in build_chain model = AutoModelForCausalLM.from_pretrained( model_id, device_map="auto", # menjaΕ‘ u "auto" ako imaΕ‘ GPU space torch_dtype=torch.float16 ) File "/usr/local/lib/python3.13/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( ~~~~~~~~~~~~~~~~~~~~~~~~~~~^ pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ) ^ File "/usr/local/lib/python3.13/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.13/site-packages/transformers/modeling_utils.py", line 4806, in from_pretrained raise ValueError( ...<2 lines>... ) ValueError: Using a `device_map`, `tp_plan`, `torch.device` context manager or setting `torch.set_default_device(device)` requires `accelerate`. You can install it with `pip install accelerate`

Container logs:

Fetching error logs...