runtime error
Exit code: 1. Reason: u accept the terms and conditions of this license: https://developer.nvidia.com/ngc/nvidia-deep-learning-container-license A copy of this license is made available in this container at /NGC-DL-CONTAINER-LICENSE for your convenience. WARNING: The NVIDIA Driver was not detected. GPU functionality will not be available. Use the NVIDIA Container Toolkit to start this container with GPU support; see https://docs.nvidia.com/datacenter/cloud-native/ . Health check server on :7860 Note: Environment variable`HF_TOKEN` is set and is the current active token independently from the token you've just configured. Logged in to HF Hub Downloading step-200 checkpoint from Hub... Fetching 12 files: 0%| | 0/12 [00:00<?, ?it/s] Fetching 12 files: 25%|βββ | 3/12 [00:01<00:04, 2.02it/s] Fetching 12 files: 100%|ββββββββββ| 12/12 [00:01<00:00, 8.08it/s] Checkpoint downloaded to /tmp/dm-lora/checkpoint-200 Files: ['README.md', 'adapter_config.json', 'tokenizer_config.json', 'chat_template.jinja', 'processor_config.json', 'trainer_state.json', 'scheduler.pt', 'tokenizer.json', 'training_args.bin', 'rng_state.pth', 'optimizer.pt', 'adapter_model.safetensors'] Loading unsloth/Qwen3.5-9B via Unsloth (bf16)... Traceback (most recent call last): File "/app/train_dm_lora.py", line 86, in <module> from unsloth import FastLanguageModel File "/usr/local/lib/python3.10/dist-packages/unsloth/__init__.py", line 105, in <module> import unsloth_zoo File "/usr/local/lib/python3.10/dist-packages/unsloth_zoo/__init__.py", line 220, in <module> from .device_type import ( File "/usr/local/lib/python3.10/dist-packages/unsloth_zoo/device_type.py", line 231, in <module> DEVICE_TYPE : str = get_device_type() File "/usr/local/lib/python3.10/dist-packages/unsloth_zoo/device_type.py", line 218, in get_device_type raise NotImplementedError("Unsloth cannot find any torch accelerator? You need a GPU.") NotImplementedError: Unsloth cannot find any torch accelerator? You need a GPU.
Container logs:
Fetching error logs...