Spaces:

visualisable-ai
/

api

Paused

gary-boon Claude Opus 4.5 commited on Dec 14, 2025

Commit

65c6e2e

1 Parent(s): 688efad

docs: Mark GPU HF Space Devstral deployment complete

- A100 (80GB VRAM) configured
- DEFAULT_MODEL=devstral-small set
- Phase 2c fully complete

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Files changed (1) hide show

docs/devstral-spark-plan-phased.md +2 -2

docs/devstral-spark-plan-phased.md CHANGED Viewed

@@ -1829,12 +1829,12 @@ Before marking each phase complete, verify:
   - Dynamic vocab size from modelInfo
   - Dynamic head_dim derived from actual matrix data
   - Removed hardcoded "64 dimensions" in tutorial
-- [x] **Phase 2c**: API route conversion ✅ COMPLETE (partial)
   - All 8 API routes converted to use backendFetch helper
   - Server-side auth with HF token for private Spaces
   - Per-user backend routing working
   - ⏸️ Spark toggle deferred (no benefit until PyTorch supports GB10)
-  - ⏸️ GPU HF Space Devstral deployment pending (requires VRAM upgrade)
 - [ ] **Phase 3**: Deploy Devstral to DGX Spark ⏸️ BLOCKED (PyTorch sm_121 support)
 - [ ] **Phase 4**: Future enhancements (optional)

   - Dynamic vocab size from modelInfo
   - Dynamic head_dim derived from actual matrix data
   - Removed hardcoded "64 dimensions" in tutorial
+- [x] **Phase 2c**: API route conversion + GPU HF Space ✅ COMPLETE
   - All 8 API routes converted to use backendFetch helper
   - Server-side auth with HF token for private Spaces
   - Per-user backend routing working
+  - GPU HF Space configured: A100 (80GB), DEFAULT_MODEL=devstral-small
   - ⏸️ Spark toggle deferred (no benefit until PyTorch supports GB10)
 - [ ] **Phase 3**: Deploy Devstral to DGX Spark ⏸️ BLOCKED (PyTorch sm_121 support)
 - [ ] **Phase 4**: Future enhancements (optional)