Aravindhan11's picture
Deploy Intelligent Distributed LLaMA Framework
52510e8 verified
The examples can run on both Nvidia GPU (using NCCL backend) and CPU (using GLOO backend).
```bash
uv run torchrun --nproc_per_node=3 examples/row_parallel_linear_demo.py
```
```bash
uv run torchrun --nproc_per_node=4 examples/row_parallel_linear_demo.py
```