Update README.md
Browse files
README.md
CHANGED
|
@@ -142,6 +142,7 @@ if use_fp8:
|
|
| 142 |
| Llama-3.1-8B-Instruct-w16a8-rw_with_gw_hp_4nodes | 64 | 3.4517 | 0.7481 | 1.1091 | 0.7481 | 0.7021 | 1.3393 | 0.8660 | 0.9641 | 0.8666 | 0.1732 | 35 |
|
| 143 |
| Llama-3.1-8B-Instruct-w16a8-rw_with_gw_hp_8nodes | 64 | 3.9289 | 0.9702 | 1.6514 | 0.9702 | 1.0127 | 1.5537 | 1.0377 | 1.2222 | 1.0377 | 0.2877 | 17 |
|
| 144 |
|
|
|
|
| 145 |
## Implementation
|
| 146 |
### Gpu && Memory usage Profiling
|
| 147 |
To visualize the usage of the memory and gpu's vram/utilis and more you can profile them with **pytorch-profiler** tool or **nsight systems** profiler tool .
|
|
|
|
| 142 |
| Llama-3.1-8B-Instruct-w16a8-rw_with_gw_hp_4nodes | 64 | 3.4517 | 0.7481 | 1.1091 | 0.7481 | 0.7021 | 1.3393 | 0.8660 | 0.9641 | 0.8666 | 0.1732 | 35 |
|
| 143 |
| Llama-3.1-8B-Instruct-w16a8-rw_with_gw_hp_8nodes | 64 | 3.9289 | 0.9702 | 1.6514 | 0.9702 | 1.0127 | 1.5537 | 1.0377 | 1.2222 | 1.0377 | 0.2877 | 17 |
|
| 144 |
|
| 145 |
+
|
| 146 |
## Implementation
|
| 147 |
### Gpu && Memory usage Profiling
|
| 148 |
To visualize the usage of the memory and gpu's vram/utilis and more you can profile them with **pytorch-profiler** tool or **nsight systems** profiler tool .
|