Spaces:

alex4cip
/

simple-chat

Sleeping

alex4cip Claude commited on Oct 30, 2025

Commit

2c96300

1 Parent(s): 51c066f

feat: Add RTX 5080 support and remove requirements-local.txt

- Add CUDA compatibility testing for unsupported GPUs (RTX 5080/Blackwell)
- Detect compute capability and fall back to CPU for sm_120+ GPUs
- Update setup.py with PyTorch nightly builds for Blackwell GPUs
- Add comprehensive GPU troubleshooting guide in INSTALL.md
- Remove requirements-local.txt (deprecated in favor of setup.py)
- Enhance hardware detection with cuda_compatible flag

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

Files changed (5) hide show

INSTALL.md +136 -17
RTX_5080_README.md +94 -0
app.py +68 -11
requirements-local.txt +0 -25
setup.py +167 -17

INSTALL.md CHANGED Viewed

@@ -45,23 +45,44 @@ requirements-local.txt    # 로컬용 (PyTorch >=2.2.0)
 ---
-## 방법 2: setup.py (자동 감지)
 ### 설치
 ```bash
 python setup.py
 ```
-### 동작 방식
-1. `SPACE_ID` 환경 변수 확인
-2. HF Spaces → PyTorch 2.2.0 설치
-3. 로컬 → PyTorch 최신 버전 설치
-4. Apple Silicon 감지 시 MPS 지원
 ### 장점
-- 자동 환경 감지
-- 한 명령으로 설치
-- 플랫폼별 최적화
 ---
@@ -157,6 +178,92 @@ A: PyTorch 버전만 다르고 나머지는 동일하게 유지하세요.
 ## 문제 해결
 ### ImportError: spaces
 **로컬 환경**:
 ```
@@ -165,15 +272,27 @@ A: PyTorch 버전만 다르고 나머지는 동일하게 유지하세요.
 ### PyTorch 버전 충돌
 ```bash
-pip uninstall torch -y
-pip install -r requirements-local.txt
 ```
-### CUDA 버전 불일치
-```bash
-# CUDA 버전 확인
-nvcc --version
-# 적절한 PyTorch 설치
-pip install torch --index-url https://download.pytorch.org/whl/cu121
 ```

 ---
+## 방법 2: setup.py (자동 감지) ⭐ 새로운 CUDA 지원!
 ### 설치
 ```bash
+# 가상환경 생성 및 활성화
+python -m venv venv
+source venv/bin/activate  # Windows: venv\Scripts\activate
+# 스마트 설치 실행
 python setup.py
 ```
+### 동작 방식 (NEW! CUDA 자동 감지)
+1. **환경 감지**: `SPACE_ID` 환경 변수 확인
+2. **HF Spaces**: PyTorch 2.2.0 설치 (ZeroGPU 호환)
+3. **로컬 환경**:
+   - 🔍 **NVIDIA GPU 감지**: `nvidia-smi` 실행
+   - 🔍 **CUDA 버전 감지**: `nvcc --version` 또는 nvidia-smi에서 추출
+   - ✅ **CUDA별 PyTorch 설치**:
+     - CUDA 11.8 → PyTorch cu118
+     - CUDA 12.1-12.3 → PyTorch cu121
+     - CUDA 12.4-12.8 → PyTorch cu124
+   - 🍎 **Apple Silicon**: MPS 지원
+   - 💻 **GPU 없음**: CPU 전용 PyTorch
+### 지원하는 CUDA 버전
+| CUDA 버전 | PyTorch 변형 | Index URL |
+|-----------|--------------|-----------|
+| 11.8 | cu118 | https://download.pytorch.org/whl/cu118 |
+| 12.1-12.3 | cu121 | https://download.pytorch.org/whl/cu121 |
+| 12.4-12.8 | cu124 | https://download.pytorch.org/whl/cu124 |
 ### 장점
+- ✅ **완전 자동 CUDA 감지** (NEW!)
+- ✅ 한 명령으로 설치
+- ✅ 플랫폼별 최적화
+- ✅ 설치 후 자동 검증
+- ✅ 실패 시 CPU로 폴백
 ---
 ## 문제 해결
+### 🔥 GPU가 감지되지 않음 (torch.cuda.is_available() = False)
+#### 증상 1: Driver/library version mismatch
+```bash
+nvidia-smi
+# 출력: Failed to initialize NVML: Driver/library version mismatch
+```
+**원인**: NVIDIA 드라이버 업데이트 후 재부팅하지 않음
+**해결책**:
+```bash
+# 시스템 재부팅 (가장 간단하고 효과적)
+sudo reboot
+```
+재부팅 후 다시 확인:
+```bash
+nvidia-smi
+python setup.py  # PyTorch 재설치
+```
+#### 증상 2: PyTorch가 CPU 버전으로 설치됨
+```python
+import torch
+print(torch.__version__)  # 출력: 2.9.0+cpu (CUDA 없음)
+```
+**원인**: pip install torch가 기본 CPU 버전을 설치함
+**해결책**:
+```bash
+# 현재 PyTorch 제거
+pip uninstall torch torchvision torchaudio -y
+# setup.py로 재설치 (자동 CUDA 감지)
+python setup.py
+```
+#### 증상 3: CUDA 버전 불일치
+```python
+# 시스템 CUDA: 12.8
+# PyTorch CUDA: 12.4
+# 오류: forward compatibility was attempted on non supported HW
+```
+**원인**: PyTorch CUDA 버전과 드라이버 CUDA 버전 불일치
+**해결책**:
+```bash
+# 1. 드라이버 재부팅 (우선 시도)
+sudo reboot
+# 2. 드라이버 재설치 (재부팅으로 안 되면)
+sudo ubuntu-drivers autoinstall
+sudo reboot
+# 3. PyTorch 재설치
+python setup.py
+```
+#### 증상 4: nvidia-smi는 작동하지만 PyTorch에서 GPU 인식 안 됨
+```bash
+nvidia-smi  # ✅ 작동
+python -c "import torch; print(torch.cuda.is_available())"  # ❌ False
+```
+**해결책**:
+```bash
+# PyTorch를 CUDA 버전으로 강제 재설치
+pip uninstall torch torchvision torchaudio -y
+# CUDA 버전 확인
+nvidia-smi | grep "CUDA Version"  # 예: CUDA Version: 12.1
+# 해당 CUDA 버전의 PyTorch 설치
+# CUDA 12.1-12.3
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
+# CUDA 12.4+
+pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124
+# 검증
+python -c "import torch; print(f'CUDA available: {torch.cuda.is_available()}')"
+```
 ### ImportError: spaces
 **로컬 환경**:
 ```
 ### PyTorch 버전 충돌
 ```bash
+pip uninstall torch torchvision torchaudio -y
+python setup.py  # 자동 CUDA 감지 및 설치
 ```
+### 설치 검증
+```python
+# 완전한 환경 검증
+import torch
+print(f"PyTorch: {torch.__version__}")
+print(f"CUDA available: {torch.cuda.is_available()}")
+print(f"CUDA compiled: {torch.version.cuda}")
+if torch.cuda.is_available():
+    print(f"GPU name: {torch.cuda.get_device_name(0)}")
+    print(f"GPU count: {torch.cuda.device_count()}")
+```
+**예상 출력 (GPU 환경)**:
+```
+PyTorch: 2.5.1+cu124
+CUDA available: True
+CUDA compiled: 12.4
+GPU name: NVIDIA GeForce RTX 4090
+GPU count: 1
 ```

RTX_5080_README.md ADDED Viewed

	@@ -0,0 +1,94 @@

+# RTX 5080 (Blackwell) Compatibility Notice
+## Issue
+The NVIDIA GeForce RTX 5080 uses the Blackwell architecture with compute capability **sm_120** (12.0). As of January 2025, PyTorch does not yet support this compute capability, even in nightly builds.
+### Error Message
+```
+CUDA error: no kernel image is available for execution on the device
+```
+This occurs because PyTorch binaries are not compiled with kernels for sm_120.
+## Current Status
+- **GPU Model**: NVIDIA GeForce RTX 5080
+- **Compute Capability**: sm_120 (12.0)
+- **Driver Version**: 580.95.05 (supports CUDA 13.0)
+- **PyTorch Version**: 2.7.0.dev20250310+cu124 (nightly)
+- **PyTorch Supported Architectures**: sm_50, sm_60, sm_70, sm_75, sm_80, sm_86, sm_90
+- **Support Status**: ❌ Not supported
+## Solution Implemented
+The application now automatically detects Blackwell GPUs (compute capability ≥ 12.0) and falls back to CPU mode:
+1. **Hardware Detection**: `test_cuda_compatibility()` checks compute capability
+2. **Automatic Fallback**: Falls back to CPU if sm_120 is detected
+3. **Clear Messaging**: Displays warnings about unsupported GPU
+## Running the Application
+The app will automatically run in CPU mode:
+```bash
+source venv/bin/activate
+python app.py
+```
+You'll see messages like:
+```
+⚠️  Detected compute capability 12.0 (sm_120)
+   This GPU architecture is not yet supported by PyTorch
+⚠️  Local - CPU fallback (NVIDIA GeForce RTX 5080 not supported by PyTorch)
+```
+## Future Support
+PyTorch support for Blackwell GPUs is expected in future releases. Monitor:
+- https://github.com/pytorch/pytorch/issues
+- https://pytorch.org/get-started/locally/
+When support is added:
+1. Update PyTorch: `pip install --upgrade torch`
+2. The app will automatically detect and use GPU
+## Alternative Solutions
+### 1. Build PyTorch from Source (Advanced)
+```bash
+# Clone PyTorch
+git clone --recursive https://github.com/pytorch/pytorch
+cd pytorch
+# Set CUDA architecture flags
+export TORCH_CUDA_ARCH_LIST="12.0"
+export CUDA_HOME=/usr/local/cuda
+# Build (takes 1-2 hours)
+python setup.py develop
+```
+**Note**: This is time-consuming and may not work until PyTorch officially adds sm_120 support.
+### 2. Use Older GPU (Temporary)
+If available, use an older GPU (RTX 40xx, 30xx, etc.) that has compute capability ≤ 9.0.
+### 3. Wait for Official Support
+The most practical approach is to use CPU mode until PyTorch adds official support.
+## Performance Notes
+**CPU Mode Performance**:
+- Inference is slower but functional
+- Small models (< 1B parameters): Acceptable
+- Large models (> 7B parameters): Very slow
+- Consider using smaller models for now
+## Questions?
+Check PyTorch compatibility:
+```bash
+python -c "import torch; print(f'CUDA available: {torch.cuda.is_available()}'); print(f'Compute capability: {torch.cuda.get_device_capability(0) if torch.cuda.is_available() else \"N/A\"}')"
+```

app.py CHANGED Viewed

@@ -14,6 +14,38 @@ import torch
 # Hardware Environment Detection
 # ============================================================================
 def detect_hardware_environment():
     """
     Comprehensive hardware environment detection
@@ -26,7 +58,8 @@ def detect_hardware_environment():
             'gpu_name': str or None,
             'cpu_count': int,
             'os': 'Darwin' | 'Linux' | 'Windows',
-            'description': str
         }
     """
     env_info = {
@@ -36,7 +69,8 @@ def detect_hardware_environment():
         'gpu_name': None,
         'cpu_count': os.cpu_count() or 1,
         'os': platform.system(),
-        'description': ''
     }
     # Check if running on HF Spaces
@@ -53,6 +87,7 @@ def detect_hardware_environment():
             env_info['gpu_available'] = True
             env_info['gpu_name'] = 'NVIDIA H200 (ZeroGPU)'
             env_info['description'] = f"🚀 HF Spaces - ZeroGPU ({space_id})"
         except ImportError:
             # Check CPU tier by memory/CPU count
             cpu_count = env_info['cpu_count']
@@ -65,21 +100,37 @@ def detect_hardware_environment():
     else:
         # Local environment detection
         if torch.cuda.is_available():
-            env_info['hardware'] = 'local_gpu'
-            env_info['gpu_available'] = True
             try:
-                env_info['gpu_name'] = torch.cuda.get_device_name(0)
             except:
-                env_info['gpu_name'] = 'CUDA GPU'
-            env_info['description'] = f"🖥️  Local - GPU ({env_info['gpu_name']})"
         elif torch.backends.mps.is_available():
             env_info['hardware'] = 'local_gpu'
             env_info['gpu_available'] = True
             env_info['gpu_name'] = 'Apple Silicon GPU (MPS)'
             env_info['description'] = f"🍎 Local - Apple Silicon GPU"
         else:
             env_info['hardware'] = 'local_cpu'
             env_info['description'] = f"💻 Local - CPU ({env_info['os']}, {env_info['cpu_count']} cores)"
     return env_info
@@ -270,7 +321,7 @@ def load_model_once(model_index=None):
             print(f"   🗑️  Unloading previous model from memory...")
             del model
             del tokenizer
-            if torch.cuda.is_available():
                 torch.cuda.empty_cache()
         # Load tokenizer
@@ -284,17 +335,23 @@ def load_model_once(model_index=None):
         if tokenizer.pad_token is None:
             tokenizer.pad_token = tokenizer.eos_token
-        # Detect device
-        device = "cuda" if torch.cuda.is_available() else "cpu"
         print(f"📍 Using device: {device}")
         # Load model with appropriate settings
         if is_cached:
             print(f"   📀 Loading model from disk cache (15-30 seconds)...")
         else:
             print(f"   🌐 Downloading model from network (5-20 minutes, first time only)...")
         if device == "cuda":
-            # GPU available (CPU Upgrade with GPU or ZeroGPU)
             model = AutoModelForCausalLM.from_pretrained(
                 model_name,
                 token=HF_TOKEN,

 # Hardware Environment Detection
 # ============================================================================
+def test_cuda_compatibility():
+    """
+    Test if CUDA actually works on this GPU.
+    RTX 5080 and other Blackwell GPUs (sm_120) are not yet supported by PyTorch.
+    Returns: True if CUDA works, False otherwise
+    """
+    if not torch.cuda.is_available():
+        return False
+    try:
+        # Check compute capability first
+        compute_cap = torch.cuda.get_device_capability(0)
+        compute_cap_major = compute_cap[0]
+        compute_cap_minor = compute_cap[1]
+        # sm_120 (compute capability 12.0) is Blackwell and not yet supported
+        if compute_cap_major >= 12:
+            print(f"⚠️  Detected compute capability {compute_cap_major}.{compute_cap_minor} (sm_{compute_cap_major}{compute_cap_minor})")
+            print(f"   This GPU architecture is not yet supported by PyTorch")
+            return False
+        # Try a simple tensor operation for other cases
+        x = torch.randn(10, 10).cuda()
+        y = torch.randn(10, 10).cuda()
+        z = torch.matmul(x, y)
+        z.cpu()
+        return True
+    except Exception as e:
+        print(f"⚠️  CUDA test failed: {e}")
+        print(f"   Will fall back to CPU mode")
+        return False
 def detect_hardware_environment():
     """
     Comprehensive hardware environment detection
             'gpu_name': str or None,
             'cpu_count': int,
             'os': 'Darwin' | 'Linux' | 'Windows',
+            'description': str,
+            'cuda_compatible': bool
         }
     """
     env_info = {
         'gpu_name': None,
         'cpu_count': os.cpu_count() or 1,
         'os': platform.system(),
+        'description': '',
+        'cuda_compatible': False
     }
     # Check if running on HF Spaces
             env_info['gpu_available'] = True
             env_info['gpu_name'] = 'NVIDIA H200 (ZeroGPU)'
             env_info['description'] = f"🚀 HF Spaces - ZeroGPU ({space_id})"
+            env_info['cuda_compatible'] = True
         except ImportError:
             # Check CPU tier by memory/CPU count
             cpu_count = env_info['cpu_count']
     else:
         # Local environment detection
         if torch.cuda.is_available():
+            # CUDA is available, but test if it actually works
+            cuda_works = test_cuda_compatibility()
             try:
+                gpu_name = torch.cuda.get_device_name(0)
             except:
+                gpu_name = 'CUDA GPU'
+            if cuda_works:
+                env_info['hardware'] = 'local_gpu'
+                env_info['gpu_available'] = True
+                env_info['gpu_name'] = gpu_name
+                env_info['description'] = f"🖥️  Local - GPU ({gpu_name})"
+                env_info['cuda_compatible'] = True
+            else:
+                # CUDA detected but not working (e.g., unsupported compute capability)
+                env_info['hardware'] = 'local_cpu'
+                env_info['gpu_available'] = False
+                env_info['gpu_name'] = gpu_name + " (Unsupported - using CPU)"
+                env_info['description'] = f"⚠️  Local - CPU fallback ({gpu_name} not supported by PyTorch)"
+                env_info['cuda_compatible'] = False
         elif torch.backends.mps.is_available():
             env_info['hardware'] = 'local_gpu'
             env_info['gpu_available'] = True
             env_info['gpu_name'] = 'Apple Silicon GPU (MPS)'
             env_info['description'] = f"🍎 Local - Apple Silicon GPU"
+            env_info['cuda_compatible'] = False
         else:
             env_info['hardware'] = 'local_cpu'
             env_info['description'] = f"💻 Local - CPU ({env_info['os']}, {env_info['cpu_count']} cores)"
+            env_info['cuda_compatible'] = False
     return env_info
             print(f"   🗑️  Unloading previous model from memory...")
             del model
             del tokenizer
+            if HW_ENV['cuda_compatible']:
                 torch.cuda.empty_cache()
         # Load tokenizer
         if tokenizer.pad_token is None:
             tokenizer.pad_token = tokenizer.eos_token
+        # Detect device - use hardware environment detection
+        use_gpu = HW_ENV['gpu_available'] and HW_ENV['cuda_compatible']
+        device = "cuda" if use_gpu else "cpu"
         print(f"📍 Using device: {device}")
+        if not use_gpu and torch.cuda.is_available():
+            print(f"   ⚠️  GPU detected but not compatible with PyTorch")
+            print(f"   ℹ️  RTX 5080 (Blackwell/sm_120) requires PyTorch with sm_120 support")
+            print(f"   ℹ️  Falling back to CPU mode")
         # Load model with appropriate settings
         if is_cached:
             print(f"   📀 Loading model from disk cache (15-30 seconds)...")
         else:
             print(f"   🌐 Downloading model from network (5-20 minutes, first time only)...")
         if device == "cuda":
+            # GPU available and compatible
             model = AutoModelForCausalLM.from_pretrained(
                 model_name,
                 token=HF_TOKEN,

requirements-local.txt DELETED Viewed

@@ -1,25 +0,0 @@
-# Local Development Requirements
-# Use this for Mac/Linux/Windows local development
-# Install: pip install -r requirements-local.txt
-# Gradio Framework
-gradio==5.49.1
-# ML Core Libraries (Latest versions for local)
-transformers==4.57.1
-torch>=2.2.0  # No ZeroGPU restriction - use latest
-safetensors==0.6.2
-accelerate==0.26.1
-# Tokenizers & Serialization
-sentencepiece==0.2.0
-protobuf==4.25.1
-# HF Hub & Authentication
-huggingface-hub>=0.19.0
-# Environment Management
-python-dotenv==1.0.0
-# Note: 'spaces' package not needed for local development
-# It will be imported conditionally and gracefully fail

setup.py CHANGED Viewed

@@ -14,25 +14,142 @@ def detect_environment():
     is_hf_spaces = os.environ.get('SPACE_ID') is not None
     return 'hf_spaces' if is_hf_spaces else 'local'
-def get_pytorch_version(env):
-    """Get appropriate PyTorch version for environment"""
     if env == 'hf_spaces':
         # ZeroGPU compatible version
-        return 'torch==2.2.0'
     else:
-        # Latest version for local
         # Check if Apple Silicon
-        if platform.system() == 'Darwin' and platform.machine() == 'arm64':
-            return 'torch>=2.2.0'  # MPS support
         else:
-            return 'torch>=2.2.0'  # CUDA/CPU
 def install_dependencies():
     """Install dependencies based on environment"""
     env = detect_environment()
-    print(f"Detected environment: {env}")
-    # Base dependencies
     base_deps = [
         'gradio==5.49.1',
         'transformers==4.57.1',
@@ -44,25 +161,58 @@ def install_dependencies():
         'python-dotenv==1.0.0',
     ]
-    # Add PyTorch with appropriate version
-    pytorch = get_pytorch_version(env)
-    base_deps.insert(2, pytorch)
     # Add spaces for HF Spaces only
     if env == 'hf_spaces':
         base_deps.append('spaces')
-    print(f"Installing PyTorch: {pytorch}")
-    print(f"Installing {len(base_deps)} packages...")
-    # Install all dependencies
     subprocess.check_call([
         sys.executable, '-m', 'pip', 'install', '--upgrade'
     ] + base_deps)
     print("✅ Installation complete!")
     print(f"Environment: {env}")
-    print(f"PyTorch: {pytorch}")
 if __name__ == '__main__':
     install_dependencies()

     is_hf_spaces = os.environ.get('SPACE_ID') is not None
     return 'hf_spaces' if is_hf_spaces else 'local'
+def detect_gpu_info():
+    """Detect GPU model and CUDA version"""
+    gpu_model = None
+    cuda_version = None
+    try:
+        # Try nvidia-smi first
+        result = subprocess.run(
+            ['nvidia-smi', '--query-gpu=gpu_name', '--format=csv,noheader'],
+            capture_output=True,
+            text=True,
+            timeout=5
+        )
+        if result.returncode == 0:
+            gpu_model = result.stdout.strip()
+            print(f"   Detected GPU: {gpu_model}")
+            # Try to get CUDA version from nvcc
+            try:
+                nvcc_result = subprocess.run(
+                    ['nvcc', '--version'],
+                    capture_output=True,
+                    text=True,
+                    timeout=5
+                )
+                if nvcc_result.returncode == 0:
+                    output = nvcc_result.stdout
+                    # Parse CUDA version (e.g., "release 12.1")
+                    if 'release' in output:
+                        version = output.split('release')[1].strip().split(',')[0].strip()
+                        major_minor = '.'.join(version.split('.')[:2])
+                        print(f"   Detected CUDA version: {major_minor}")
+                        cuda_version = major_minor
+            except (FileNotFoundError, subprocess.TimeoutExpired):
+                pass
+            # If nvcc not found, try to get CUDA version from nvidia-smi output
+            if not cuda_version:
+                result = subprocess.run(
+                    ['nvidia-smi'],
+                    capture_output=True,
+                    text=True,
+                    timeout=5
+                )
+                for line in result.stdout.split('\n'):
+                    if 'CUDA Version:' in line:
+                        version = line.split('CUDA Version:')[1].strip().split()[0]
+                        major_minor = '.'.join(version.split('.')[:2])
+                        print(f"   Detected CUDA version from nvidia-smi: {major_minor}")
+                        cuda_version = major_minor
+                        break
+            # GPU detected but CUDA version unknown, use latest
+            if not cuda_version:
+                print("   NVIDIA GPU detected but CUDA version unknown, using CUDA 12.4")
+                cuda_version = '12.4'
+    except (FileNotFoundError, subprocess.TimeoutExpired):
+        pass
+    return gpu_model, cuda_version
+def requires_pytorch_2_6(gpu_model):
+    """Check if GPU requires PyTorch 2.6.0+ (for Blackwell/compute capability 12.0+)"""
+    if not gpu_model:
+        return False
+    # Blackwell GPUs (RTX 50xx series) require PyTorch 2.6.0+
+    blackwell_gpus = ['rtx 50', 'rtx50', '5080', '5090', '5070']
+    gpu_lower = gpu_model.lower()
+    return any(model in gpu_lower for model in blackwell_gpus)
+def get_pytorch_install_command(env):
+    """Get appropriate PyTorch install command for environment"""
     if env == 'hf_spaces':
         # ZeroGPU compatible version
+        return (['torch==2.2.0'], None)
     else:
+        # Local environment
+        system = platform.system()
         # Check if Apple Silicon
+        if system == 'Darwin' and platform.machine() == 'arm64':
+            print("   Detected Apple Silicon, installing PyTorch with MPS support")
+            return (['torch>=2.2.0'], None)
+        # Check for CUDA on Linux/Windows
+        elif system in ['Linux', 'Windows']:
+            gpu_model, cuda_version = detect_gpu_info()
+            if cuda_version:
+                # Check if GPU requires PyTorch 2.6.0+
+                needs_pytorch_2_6 = requires_pytorch_2_6(gpu_model)
+                if needs_pytorch_2_6:
+                    print(f"   ⚠️  Detected Blackwell GPU ({gpu_model})")
+                    print(f"   Installing PyTorch nightly with CUDA 12.4+ support (required for compute capability 12.0)")
+                    print(f"   Note: Stable PyTorch releases do not yet fully support sm_120")
+                    # Use nightly build for Blackwell GPU support
+                    return (['torch', 'torchvision', 'torchaudio'], 'https://download.pytorch.org/whl/nightly/cu124')
+                # Map CUDA version to PyTorch index URL
+                cuda_map = {
+                    '11.8': ('cu118', 'https://download.pytorch.org/whl/cu118'),
+                    '12.1': ('cu121', 'https://download.pytorch.org/whl/cu121'),
+                    '12.2': ('cu121', 'https://download.pytorch.org/whl/cu121'),  # Use 12.1 for 12.2
+                    '12.3': ('cu121', 'https://download.pytorch.org/whl/cu121'),  # Use 12.1 for 12.3
+                    '12.4': ('cu124', 'https://download.pytorch.org/whl/cu124'),
+                    '12.5': ('cu124', 'https://download.pytorch.org/whl/cu124'),  # Use 12.4 for 12.5
+                    '12.6': ('cu124', 'https://download.pytorch.org/whl/cu124'),  # Use 12.4 for 12.6
+                    '12.7': ('cu124', 'https://download.pytorch.org/whl/cu124'),  # Use 12.4 for 12.7
+                    '12.8': ('cu124', 'https://download.pytorch.org/whl/cu124'),  # Use 12.4 for 12.8
+                    '13.0': ('cu124', 'https://download.pytorch.org/whl/cu124'),  # Use 12.4 for 13.0
+                }
+                cuda_suffix, index_url = cuda_map.get(cuda_version, ('cu124', 'https://download.pytorch.org/whl/cu124'))
+                print(f"   Installing PyTorch with CUDA {cuda_version} support ({cuda_suffix})")
+                return (['torch', 'torchvision', 'torchaudio'], index_url)
+            else:
+                print("   No CUDA detected, installing CPU-only PyTorch")
+                return (['torch>=2.2.0'], None)
         else:
+            # Other systems, default to CPU
+            return (['torch>=2.2.0'], None)
 def install_dependencies():
     """Install dependencies based on environment"""
     env = detect_environment()
+    print("=" * 60)
+    print(f"🔍 Detected environment: {env}")
+    print("=" * 60)
+    # Get PyTorch installation command
+    pytorch_packages, index_url = get_pytorch_install_command(env)
+    # Base dependencies (excluding PyTorch)
     base_deps = [
         'gradio==5.49.1',
         'transformers==4.57.1',
         'python-dotenv==1.0.0',
     ]
     # Add spaces for HF Spaces only
     if env == 'hf_spaces':
         base_deps.append('spaces')
+    print("=" * 60)
+    print(f"📦 Installing PyTorch...")
+    print("=" * 60)
+    # Install PyTorch (with optional index URL for CUDA)
+    pytorch_cmd = [sys.executable, '-m', 'pip', 'install', '--upgrade'] + pytorch_packages
+    if index_url:
+        pytorch_cmd.extend(['--index-url', index_url])
+    try:
+        subprocess.check_call(pytorch_cmd)
+        print("✅ PyTorch installed successfully!")
+    except subprocess.CalledProcessError as e:
+        print(f"❌ PyTorch installation failed: {e}")
+        print("   Falling back to CPU-only PyTorch...")
+        subprocess.check_call([
+            sys.executable, '-m', 'pip', 'install', '--upgrade', 'torch>=2.2.0'
+        ])
+    print("=" * 60)
+    print(f"📦 Installing remaining dependencies ({len(base_deps)} packages)...")
+    print("=" * 60)
+    # Install remaining dependencies
     subprocess.check_call([
         sys.executable, '-m', 'pip', 'install', '--upgrade'
     ] + base_deps)
+    # Verify PyTorch installation
+    print("=" * 60)
+    print("🔍 Verifying PyTorch installation...")
+    print("=" * 60)
+    try:
+        result = subprocess.run([
+            sys.executable, '-c',
+            'import torch; print(f"PyTorch: {torch.__version__}"); print(f"CUDA available: {torch.cuda.is_available()}"); print(f"CUDA version: {torch.version.cuda if torch.version.cuda else \"N/A\"}")'
+        ], capture_output=True, text=True, timeout=10)
+        print(result.stdout)
+    except Exception as e:
+        print(f"⚠️  Could not verify PyTorch: {e}")
+    print("=" * 60)
     print("✅ Installation complete!")
+    print("=" * 60)
     print(f"Environment: {env}")
+    print(f"PyTorch packages: {', '.join(pytorch_packages)}")
+    if index_url:
+        print(f"Index URL: {index_url}")
 if __name__ == '__main__':
     install_dependencies()