Update README.md
Browse files
README.md
CHANGED
|
@@ -63,7 +63,7 @@ We benchmarked our model against GPT-2, SmolLM-135M, and Qwen3-0.6B-Base on a qu
|
|
| 63 |
| SmolLM2-135M | 135M | 0.5193 | 36 | 98 | 40 | 111 |
|
| 64 |
| Qwen3-0.6B-Base | 600M | 0.7359 | 165 | 79 | 16 | 40 |
|
| 65 |
|
| 66 |
-
Each model generated three hundred continuations of the prefix `Question:`. [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) scored each one using a decimal grading system (0.0 to 1.0).
|
| 67 |
Our model generated the second highest number of coherent questions with less parameters than most character level RNNs.
|
| 68 |
|
| 69 |
## Use Cases
|
|
|
|
| 63 |
| SmolLM2-135M | 135M | 0.5193 | 36 | 98 | 40 | 111 |
|
| 64 |
| Qwen3-0.6B-Base | 600M | 0.7359 | 165 | 79 | 16 | 40 |
|
| 65 |
|
| 66 |
+
Each model generated two to three hundred continuations of the prefix `Question:`. [Qwen3-32B](https://huggingface.co/Qwen/Qwen3-32B) scored each one using a decimal grading system (0.0 to 1.0).
|
| 67 |
Our model generated the second highest number of coherent questions with less parameters than most character level RNNs.
|
| 68 |
|
| 69 |
## Use Cases
|