Update README.md
Browse files
README.md
CHANGED
|
@@ -37,7 +37,34 @@ This model is intended for researchers and practitioners looking for a powerful
|
|
| 37 |
The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
|
| 38 |
|
| 39 |
## Evaluation Results
|
| 40 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 41 |
|
| 42 |
## How to Use
|
| 43 |
```python
|
|
|
|
| 37 |
The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
|
| 38 |
|
| 39 |
## Evaluation Results
|
| 40 |
+
0-shot AGIEval
|
| 41 |
+
| Tasks |Version|Filter|n-shot| Metric |Value | |Stderr|
|
| 42 |
+
|---------------------------------|-------|------|-----:|--------|-----:|---|-----:|
|
| 43 |
+
|agieval_nous |N/A |none | 0|acc |0.3602|± |0.0929|
|
| 44 |
+
| | |none | 0|acc_norm|0.3342|± |0.0764|
|
| 45 |
+
| - agieval_aqua_rat | 1|none | 0|acc |0.2402|± |0.0269|
|
| 46 |
+
| | |none | 0|acc_norm|0.2441|± |0.0270|
|
| 47 |
+
| - agieval_logiqa_en | 1|none | 0|acc |0.2965|± |0.0179|
|
| 48 |
+
| | |none | 0|acc_norm|0.3226|± |0.0183|
|
| 49 |
+
| - agieval_lsat_ar | 1|none | 0|acc |0.2348|± |0.0280|
|
| 50 |
+
| | |none | 0|acc_norm|0.2000|± |0.0264|
|
| 51 |
+
| - agieval_lsat_lr | 1|none | 0|acc |0.3667|± |0.0214|
|
| 52 |
+
| | |none | 0|acc_norm|0.3373|± |0.0210|
|
| 53 |
+
| - agieval_lsat_rc | 1|none | 0|acc |0.4981|± |0.0305|
|
| 54 |
+
| | |none | 0|acc_norm|0.4089|± |0.0300|
|
| 55 |
+
| - agieval_sat_en | 1|none | 0|acc |0.6359|± |0.0336|
|
| 56 |
+
| | |none | 0|acc_norm|0.5777|± |0.0345|
|
| 57 |
+
| - agieval_sat_en_without_passage| 1|none | 0|acc |0.3883|± |0.0340|
|
| 58 |
+
| | |none | 0|acc_norm|0.3544|± |0.0334|
|
| 59 |
+
| - agieval_sat_math | 1|none | 0|acc |0.3500|± |0.0322|
|
| 60 |
+
| | |none | 0|acc_norm|0.2682|± |0.0299|
|
| 61 |
+
|
| 62 |
+
| Groups |Version|Filter|n-shot| Metric |Value | |Stderr|
|
| 63 |
+
|------------|-------|------|-----:|--------|-----:|---|-----:|
|
| 64 |
+
|agieval_nous|N/A |none | 0|acc |0.3602|± |0.0929|
|
| 65 |
+
| | |none | 0|acc_norm|0.3342|± |0.0764|
|
| 66 |
+
|
| 67 |
+
5-shot AGIEval coming soon.
|
| 68 |
|
| 69 |
## How to Use
|
| 70 |
```python
|