Sweaterdog
/

GRaPE-Mini-Beta

Text Generation

text-generation-inference

Model card Files Files and versions

Sweaterdog commited on Sep 3, 2025

Commit

f123645

·

verified ·

1 Parent(s): e45658a

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -56,6 +56,24 @@ A demo of a website it had generated for itself can be found [here](test_website
 ***
 ## 🧠 Model Philosophy: The Art of the Finetune
 While GRaPE Mini is not trained "from-scratch" (i.e., from random weights), it represents an extensive and highly curated instruction-tuning process. A base model possesses linguistic structure but lacks the ability to follow instructions, reason, or converse. The true "creation" of an assistant like GRaPE lies in the meticulous selection, blending, and application of high-quality datasets. This finetuning process is what transforms a raw linguistic engine into a capable and helpful agent.

 ***
+***
+## 🚀 Benchmarks
+GRaPE Mini Beta is not the final model, and will improve. These are the benchmarks ran for GRaPE Mini Beta.
+*(The benchmarks below were ran with the F16 weights of the model)*
+|  Tasks  |Version|     Filter     |n-shot|  Metric   |   |
+|---------|------:|----------------|-----:|-----------|---|-----:|
+|gsm8k*   |      3|flexible-extract|     5|exact_match|↑  |28.51%|
+|         |       |strict-match    |     5|exact_match|↑  |14.48%|
+|humaneval*|      1|create_test     |     0|pass@1     |   |20.73%|
+* - These models were tested with the GPT-2 tokenizer on accident, updated benchmarks coming soon...
 ## 🧠 Model Philosophy: The Art of the Finetune
 While GRaPE Mini is not trained "from-scratch" (i.e., from random weights), it represents an extensive and highly curated instruction-tuning process. A base model possesses linguistic structure but lacks the ability to follow instructions, reason, or converse. The true "creation" of an assistant like GRaPE lies in the meticulous selection, blending, and application of high-quality datasets. This finetuning process is what transforms a raw linguistic engine into a capable and helpful agent.