Update README.md
Browse files
README.md
CHANGED
|
@@ -254,10 +254,9 @@ Evaluation results in reasoning mode for SmolLM3 and Qwen3 models:
|
|
| 254 |
|
| 255 |
|
| 256 |
### Base Pre-Trained Model
|
| 257 |
-
For Ruler 64k evaluation, we apply YaRN to the Qwen models with 32k context to extrapolate the context length.
|
| 258 |
|
| 259 |
#### English benchmarks
|
| 260 |
-
Note: All evaluations are zero-shot unless stated otherwise.
|
| 261 |
|
| 262 |
| Category | Metric | SmolLM3-3B | Qwen2.5-3B | Llama3-3.2B | Qwen3-1.7B-Base | Qwen3-4B-Base |
|
| 263 |
|---------|--------|---------------------|------------|--------------|------------------|---------------|
|
|
|
|
| 254 |
|
| 255 |
|
| 256 |
### Base Pre-Trained Model
|
|
|
|
| 257 |
|
| 258 |
#### English benchmarks
|
| 259 |
+
Note: All evaluations are zero-shot unless stated otherwise. For Ruler 64k evaluation, we apply YaRN to the Qwen models with 32k context to extrapolate the context length.
|
| 260 |
|
| 261 |
| Category | Metric | SmolLM3-3B | Qwen2.5-3B | Llama3-3.2B | Qwen3-1.7B-Base | Qwen3-4B-Base |
|
| 262 |
|---------|--------|---------------------|------------|--------------|------------------|---------------|
|