Spestly
/

Athena-R3-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

Spestly commited on Jan 27, 2025

Commit

bcf3d95

·

verified ·

1 Parent(s): 4fce245

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -57,6 +57,25 @@ library_name: transformers
 - **Reasoning Improvement:** In this version of Atlas, I have enhanced it's reasoning via synthetic data from models such as Gemini-2.0 Flash Thinking so that it can improve on reasoning.
 ---
 ### **Intended Use Cases**
 Atlas Pro works best for:
 - **Technical Professionals:** Helping developers, engineers, and scientists solve complex problems.

 - **Reasoning Improvement:** In this version of Atlas, I have enhanced it's reasoning via synthetic data from models such as Gemini-2.0 Flash Thinking so that it can improve on reasoning.
 ---
+# **Evaluation**
+Below are the evaluations of the Atlas-Pro models and Deepseek's R1 Qwen Distills (The model that started the whole Atlas family):
+| **Metric**              | **Spestly Atlas Pro (7B)** | **Spestly Atlas Pro (1.5B)** | DeepSeek-R1-Distill-Qwen (7B) | DeepSeek-R1-Distill-Qwen (1.5B) |
+|-------------------------|---------------------------|------------------------------|-----------------------------------|-------------------------------------|
+| **Average**             | **22.65%**               | 12.93%                       | 11.73%                            | 7.53%                              |
+| **IFEval**              | 31.54%                   | 24.30%                       | **40.38%**                        | 34.63%                             |
+| **BBH**                 | **25.27%**               | 9.08%                        | 7.88%                             | 4.73%                              |
+| **MATH**                | **38.90%**               | 25.83%                       | 0.00%                             | 0.00%                              |
+| **GPQA**                | **11.63%**               | 6.26%                        | 3.91%                             | 2.97%                              |
+| **MUSR**                | **6.65%**                | 1.86%                        | 3.55%                             | 2.08%                              |
+| **MMLU-Pro**            | **21.89%**               | 10.28%                       | 14.68%                            | 0.78%                              |
+| **Carbon Emissions (kg)** | 0.69 kg                  | **0.59 kg**                  | 0.68 kg                           | 0.62 kg                             |
+---
 ### **Intended Use Cases**
 Atlas Pro works best for:
 - **Technical Professionals:** Helping developers, engineers, and scientists solve complex problems.