Update README.md
Browse files
README.md
CHANGED
|
@@ -8,4 +8,12 @@ base_model:
|
|
| 8 |
pipeline_tag: text-generation
|
| 9 |
---
|
| 10 |
|
| 11 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 8 |
pipeline_tag: text-generation
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# HermesCoder-14B
|
| 12 |
+
|
| 13 |
+
[](https://x.com/NousResearch)
|
| 14 |
+
[](https://www.apache.org/licenses/LICENSE-2.0)
|
| 15 |
+
|
| 16 |
+
We introduce *HermesCoder-14B*, a code reasoning model post-trained on [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B) via reinforcement learning with verifiable rewards (RLVR).
|
| 17 |
+
On LiveCodeBench v5 (08/01/2024 - 05/01/2025), we achieve a Pass@1 accuracy of 67.87\%, up 7.08\% from the baseline Pass@1 accuracy of 60.79\%
|
| 18 |
+
of Qwen3-14B. To the best of our knowledge, this is the highest-performing 14B model to date.
|
| 19 |
+
We trained on 24k verifiable coding problems using 48 B200s over the course of four days.
|