Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +9 -19

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ model-index:
         metrics:
           - name: pass@5
             type: accuracy
-            value: 90
       - task:
           type: text-generation
         dataset:
@@ -44,7 +44,7 @@ model-index:
         metrics:
           - name: Pass Rate
             type: accuracy
-            value: 28
 ---
 <div align="center">
@@ -86,12 +86,12 @@ The model shows strong agentic behavior: it recovers from errors (read-before-wr
 <div align="center">
-| Benchmark | **OmniCoder-9B** | Qwen3.5-9B | Qwen3-Next-80B | GPT-OSS-120B | GPT-OSS-20B | GLM 4.7 |
-|:---|:---:|:---:|:---:|:---:|:---:|:---:|
-| **AIME 2025** (pass@5) | 90 | | | | | |
-| **GPQA Diamond** (pass@1) | **83.8** | 81.7 | 77.2 | 80.1 | 71.5 | |
-| **GPQA Diamond** (pass@3) | **86.4** | | | | | |
-| **Terminal-Bench 2.0** | **28** | 20 | | | | 33.4 |
 </div>
@@ -164,16 +164,6 @@ See all quantizations: [Tesslate/OmniCoder-9B-GGUF](https://huggingface.co/Tessl
 | **Precision** | bf16 |
 | **Optimizer** | AdamW (lr=2e-4, cosine schedule) |
-### Training Data Sources
-| Source | Samples | Description |
-|:---|---:|:---|
-| NVIDIA Nemotron-Terminal-Corpus | 226K | Terminal agent trajectories |
-| CoderForge-Preview (reward >= 0.5) | 155K | SWE-bench style coding trajectories |
-| Nemotron Skill-Based | 24K | Skill-based coding tasks |
-| Scale-SWE | 20K | Real GitHub issue patches (synthesized trajectories) |
-| Opus Reasoning | 2.3K | Chain-of-thought reasoning |
 ---
 ## Architecture
@@ -181,7 +171,7 @@ See all quantizations: [Tesslate/OmniCoder-9B-GGUF](https://huggingface.co/Tessl
 OmniCoder inherits Qwen3.5-9B's hybrid architecture:
 - **Gated Delta Networks** : Linear attention layers interleaved with standard attention for efficient long-range dependencies
-- **VLM Backbone** : Built on `Qwen3_5ForConditionalGeneration` (supports future multimodal extensions)
 ---

         metrics:
           - name: pass@5
             type: accuracy
+            value: 90.0
       - task:
           type: text-generation
         dataset:
         metrics:
           - name: Pass Rate
             type: accuracy
+            value: 28.0
 ---
 <div align="center">
 <div align="center">
+| Benchmark | **OmniCoder-9B** | Qwen3.5-9B | Qwen3-Next-80B | GPT-OSS-120B | GPT-OSS-20B | GLM-4.7-Flash | GLM 4.7 | Claude Haiku 4.5 |
+|:---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
+| **AIME 2025** (pass@5) | 90 | | | | 91.7 | 91.6 | | |
+| **GPQA Diamond** (pass@1) | **83.8** | 81.7 | 77.2 | 80.1 | 71.5 | | | 73 |
+| **GPQA Diamond** (pass@3) | **86.4** | | | | | | | |
+| **Terminal-Bench 2.0** | **28** | 20 | | | | | 33.4 | 27 |
 </div>
 | **Precision** | bf16 |
 | **Optimizer** | AdamW (lr=2e-4, cosine schedule) |
 ---
 ## Architecture
 OmniCoder inherits Qwen3.5-9B's hybrid architecture:
 - **Gated Delta Networks** : Linear attention layers interleaved with standard attention for efficient long-range dependencies
+- **VLM Backbone** : Built on `Qwen3_5ForConditionalGeneration`
 ---