Update README.md
Browse files
README.md
CHANGED
|
@@ -18,13 +18,13 @@ tags:
|
|
| 18 |
|
| 19 |
# Nemotron-Terminal Model Family
|
| 20 |
|
| 21 |
-
**Nemotron-Terminal** is a family of models specialized for autonomous terminal interaction, fine-tuned from the Qwen3
|
| 22 |
|
| 23 |
|
| 24 |
## Model Variants
|
| 25 |
We release the following variants of the Nemotron-Terminal family:
|
| 26 |
|
| 27 |
-
- Nemotron-Terminal-8B
|
| 28 |
- Nemotron-Terminal-14B
|
| 29 |
- Nemotron-Terminal-32B
|
| 30 |
|
|
@@ -34,9 +34,9 @@ The Nemotron-Terminal family demonstrates profound leaps in capability compared
|
|
| 34 |
|
| 35 |
| Model | Size | Base Accuracy | **Nemotron-Terminal Accuracy** |
|
| 36 |
| :--- | :---: | :---: | :---: |
|
| 37 |
-
| Nemotron-Terminal-8B | 8B | 2.47% | **13.0%** |
|
| 38 |
| Nemotron-Terminal-14B | 14B | 4.04% | **20.2%** |
|
| 39 |
-
|
|
| 40 |
|
| 41 |
## Usage
|
| 42 |
The models are trained using the **Terminus 2** scaffolding and output a structured JSON format.
|
|
|
|
| 18 |
|
| 19 |
# Nemotron-Terminal Model Family
|
| 20 |
|
| 21 |
+
**Nemotron-Terminal** is a family of models specialized for autonomous terminal interaction, fine-tuned from the Qwen3 (8B, 14B, and 32B). Developed by NVIDIA, these models utilize [Terminal-Corpus](https://huggingface.co/datasets/nvidia/Terminal-Corpus), a large-scale open-source dataset for terminal tasks, to achieve performance that rivals frontier models many times their size.
|
| 22 |
|
| 23 |
|
| 24 |
## Model Variants
|
| 25 |
We release the following variants of the Nemotron-Terminal family:
|
| 26 |
|
| 27 |
+
- **Nemotron-Terminal-8B**
|
| 28 |
- Nemotron-Terminal-14B
|
| 29 |
- Nemotron-Terminal-32B
|
| 30 |
|
|
|
|
| 34 |
|
| 35 |
| Model | Size | Base Accuracy | **Nemotron-Terminal Accuracy** |
|
| 36 |
| :--- | :---: | :---: | :---: |
|
| 37 |
+
| **Nemotron-Terminal-8B** | 8B | 2.47% | **13.0%** |
|
| 38 |
| Nemotron-Terminal-14B | 14B | 4.04% | **20.2%** |
|
| 39 |
+
| Nemotron-Terminal-32B | 32B | 3.37% | **27.4%** |
|
| 40 |
|
| 41 |
## Usage
|
| 42 |
The models are trained using the **Terminus 2** scaffolding and output a structured JSON format.
|