jli505 commited on
Commit
0deb257
·
verified ·
1 Parent(s): 0287376

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -8,4 +8,12 @@ base_model:
8
  pipeline_tag: text-generation
9
  ---
10
 
11
- We introduce HermesCoder-14B, a code reasoning model post-trained on Qwen3-14B via reinforcement learning with verifiable rewards (RLVR). On LiveCodeBench v5 (08/01/2024 - 05/01/2025), we achieve a Pass@1 accuracy of 67.87\%, up 7.08\% from the baseline Pass@1 accuracy of 60.79\% of Qwen3-14B. To the best of our knowledge, this is the highest-performing 14B model to date. We trained on 24K verifiable coding problems using 48 B200s over the course of four days.
 
 
 
 
 
 
 
 
 
8
  pipeline_tag: text-generation
9
  ---
10
 
11
+ # HermesCoder-14B
12
+
13
+ [![](https://img.shields.io/badge/X-NousResearch-000000?logo=x&logoColor=white)](https://x.com/NousResearch)
14
+ [![apache 2.0](https://img.shields.io/badge/License-Apache%202.0-orange?logoColor=white&logoUrl=https://pbs.twimg.com/profile_images/1816254738234761216/TX7TW-Mp_400x400.jpg)](https://www.apache.org/licenses/LICENSE-2.0)
15
+
16
+ We introduce *HermesCoder-14B*, a code reasoning model post-trained on [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B) via reinforcement learning with verifiable rewards (RLVR).
17
+ On LiveCodeBench v5 (08/01/2024 - 05/01/2025), we achieve a Pass@1 accuracy of 67.87\%, up 7.08\% from the baseline Pass@1 accuracy of 60.79\%
18
+ of Qwen3-14B. To the best of our knowledge, this is the highest-performing 14B model to date.
19
+ We trained on 24k verifiable coding problems using 48 B200s over the course of four days.