runtime error
Exit code: 1. Reason: .20G [00:02<00:10, 185MB/s] [A model-00002-of-00002.safetensors: 29%|██▊ | 630M/2.20G [00:03<00:06, 234MB/s][A model-00002-of-00002.safetensors: 45%|████▌ | 998M/2.20G [00:04<00:04, 286MB/s][A model-00002-of-00002.safetensors: 100%|██████████| 2.20G/2.20G [00:04<00:00, 445MB/s] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 44384.17it/s] generation_config.json: 0%| | 0.00/242 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 242/242 [00:00<00:00, 1.55MB/s] Traceback (most recent call last): File "/usr/local/bin/f5-tts_infer-gradio", line 3, in <module> from f5_tts.infer.infer_gradio import main File "/app/src/f5-tts/src/f5_tts/infer/infer_gradio.py", line 584, in <module> chat_model_state = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 604, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 277, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 5140, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 504, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead. Traceback (most recent call last): File "/app/f5_tts_app.py", line 11, in <module> subprocess.run(command, shell=True, check=True) File "/usr/local/lib/python3.10/subprocess.py", line 526, in run raise CalledProcessError(retcode, process.args, subprocess.CalledProcessError: Command 'f5-tts_infer-gradio' returned non-zero exit status 1.
Container logs:
Fetching error logs...