Commit ·
aacbb01
1
Parent(s): 25eeaf2
fix: Correct hardware configuration to 4x-l4 (4x NVIDIA L4 GPUs)
Browse filesPrevious: hardware: a10g-large (WRONG - single A10G)
Corrected: hardware: 4x-l4 (CORRECT - 4x L4 GPUs)
Hardware specs:
- 4x NVIDIA L4 GPUs
- 24GB VRAM per GPU (96GB total)
- Ampere architecture with bf16 support
- Multi-GPU training enabled
The multi-GPU code will now properly utilize all 4 GPUs:
- Per-device batch: 2
- Gradient accumulation: 1
- Effective batch size: 8 (2 × 1 × 4 GPUs)
- Expected: ~25% utilization per GPU
README.md
CHANGED
|
@@ -8,7 +8,7 @@ sdk_version: 5.0.0
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
-
hardware:
|
| 12 |
python_version: "3.10"
|
| 13 |
disable_embedding: false
|
| 14 |
---
|
|
|
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
license: mit
|
| 11 |
+
hardware: 4x-l4
|
| 12 |
python_version: "3.10"
|
| 13 |
disable_embedding: false
|
| 14 |
---
|