Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,7 @@ tags: []
|
|
| 7 |
|
| 8 |
Nemotron-Hymba2 is a new hybrid SLM model family that outperforms Qwen models in accuracy (math, coding, and commonsense), batch-size-1 latency, and throughput. More details are in our NeurIPS 2025 [paper](https://drive.google.com/drive/folders/17vOGktwUfUpRAJPGJUV6oX8XwLSczZtv?usp=sharing).
|
| 9 |
|
| 10 |
-
Instruct version:
|
| 11 |
|
| 12 |
Docker path: `/lustre/fsw/portfolios/nvr/users/yongganf/docker/megatron_py25_fast_slm.sqsh` on NRT.
|
| 13 |
|
|
|
|
| 7 |
|
| 8 |
Nemotron-Hymba2 is a new hybrid SLM model family that outperforms Qwen models in accuracy (math, coding, and commonsense), batch-size-1 latency, and throughput. More details are in our NeurIPS 2025 [paper](https://drive.google.com/drive/folders/17vOGktwUfUpRAJPGJUV6oX8XwLSczZtv?usp=sharing).
|
| 9 |
|
| 10 |
+
Instruct version: [https://huggingface.co/nvidia/Nemotron-Hymba2-3B-Instruct](https://huggingface.co/nvidia/Nemotron-Hymba2-3B-Instruct).
|
| 11 |
|
| 12 |
Docker path: `/lustre/fsw/portfolios/nvr/users/yongganf/docker/megatron_py25_fast_slm.sqsh` on NRT.
|
| 13 |
|