Changelog for update to Rnj-1.
Browse files
README.md
CHANGED
|
@@ -1,6 +1,8 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
library_name: transformers
|
|
|
|
|
|
|
| 4 |
---
|
| 5 |
# Rnj-1
|
| 6 |
|
|
@@ -66,6 +68,16 @@ library_name: transformers
|
|
| 66 |
|
| 67 |
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models. These models perform well across a range of programming languages and boast strong agentic capabilities (e.g., inside agentic frameworks like mini-SWE-agent), while also excelling at tool-calling. They additionally exhibit strong capabilities in math and science. Herein, `rnj-1` refers to the base model, while `rnj-1-instruct` refers to the post-trained instruction tuned model.
|
| 68 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 69 |
# Capabilities
|
| 70 |
|
| 71 |
We evaluate Rnj-1 models against models of comparable size. In addition to accuracy, we also show the FLOPs used in pre-training for each model.
|
|
@@ -159,9 +171,13 @@ The global batch sizes used were:
|
|
| 159 |
|
| 160 |
# Recommendations
|
| 161 |
|
| 162 |
-
### Temperature
|
|
|
|
|
|
|
|
|
|
|
|
|
| 163 |
|
| 164 |
-
|
| 165 |
|
| 166 |
### Propensity to write code
|
| 167 |
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
library_name: transformers
|
| 4 |
+
base_model:
|
| 5 |
+
- EssentialAI/rnj-1
|
| 6 |
---
|
| 7 |
# Rnj-1
|
| 8 |
|
|
|
|
| 68 |
|
| 69 |
Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models. These models perform well across a range of programming languages and boast strong agentic capabilities (e.g., inside agentic frameworks like mini-SWE-agent), while also excelling at tool-calling. They additionally exhibit strong capabilities in math and science. Herein, `rnj-1` refers to the base model, while `rnj-1-instruct` refers to the post-trained instruction tuned model.
|
| 70 |
|
| 71 |
+
# Changelog
|
| 72 |
+
|
| 73 |
+
* Update December 18, 2025:
|
| 74 |
+
- System prompt and temperature recommendations: Resolve premature truncations and mitigate unprompted code outputs.
|
| 75 |
+
- Updates to default chat template.
|
| 76 |
+
- Updated STEM and comparables tables.
|
| 77 |
+
- Links to model generations for evals.
|
| 78 |
+
|
| 79 |
+
* Initial version: December 8, 2025
|
| 80 |
+
|
| 81 |
# Capabilities
|
| 82 |
|
| 83 |
We evaluate Rnj-1 models against models of comparable size. In addition to accuracy, we also show the FLOPs used in pre-training for each model.
|
|
|
|
| 171 |
|
| 172 |
# Recommendations
|
| 173 |
|
| 174 |
+
### System Prompt & Temperature
|
| 175 |
+
|
| 176 |
+
We recommend _always_ adding a system prompt. `You are a helpful assistant.` is a good default prompt to use.
|
| 177 |
+
|
| 178 |
+
We recommend using temperatures in the range [0, 0.2] for `rnj-1-instruct`.
|
| 179 |
|
| 180 |
+
Failure to follow these recommendations can result in a) truncated outputs, b) code outputs even for non-code prompts.
|
| 181 |
|
| 182 |
### Propensity to write code
|
| 183 |
|