Sweaterdog commited on
Commit
0161c23
·
verified ·
1 Parent(s): 8ae5369

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -18
README.md CHANGED
@@ -103,24 +103,6 @@ Scores sourced from official technical reports (Qwen3 Technical Report, May 2025
103
 
104
  > **Note:** *Benchmarks are Underway for GRaPE 2.1 Flash, they will be empty and set as "TBD" for the time being*
105
 
106
- ### Benchmarks
107
-
108
- | Models | Params | GPQA Diamond | MMLU-Pro | LiveCodeBench v6 | HMMT Nov 25 | TAU2-Bench | MultiChallenge |
109
- |----------------------|-------------------|--------------|----------|------------------|-------------|------------|----------------|
110
- | GRaPE 2.1 Flash | 9B | TBD | TBD | TBD | TBD | TBD | TBD |
111
- | GRM-2.5-Plus | 9B | 82.7 | 84.2 | 67.2 | 83.2 | 80.5 | 56.5 |
112
- | Qwen3.5-9B | 9B | 81.7 | 82.5 | 65.6 | 82.9 | 79.1 | 54.5 |
113
- | google/gemma-4-E4B-it| E4B (4.5B eff.) | 58.6 | 69.4 | 52.0 | -- | 42.2 | -- |
114
-
115
-
116
- ***
117
-
118
- ### Real World Example
119
-
120
- I know benchmarks seem cool on paper, but some people like to demo models by themselves. I asked GRaPE 2.1 Flash to make a webpage for Aurora Beats, and you can find that [here](https://huggingface.co/SL-AI/GRaPE-2.1-Flash_GGUF/raw/main/Aurora_Beats_2.1_flash.html)
121
-
122
- ***
123
-
124
  # Recommended Inference Settings
125
 
126
  Tested in **LM Studio**. These sampling parameters are a good starting point:
 
103
 
104
  > **Note:** *Benchmarks are Underway for GRaPE 2.1 Flash, they will be empty and set as "TBD" for the time being*
105
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
106
  # Recommended Inference Settings
107
 
108
  Tested in **LM Studio**. These sampling parameters are a good starting point: