Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,7 @@ To evaluate both the general reasoning abilities and the domain-specific perform
|
|
| 22 |
|------------------------------|:-------------:|:---------:|:----------:|
|
| 23 |
| Llama-3-8B (base) | 51.4% | 59.9% | 73.1% |
|
| 24 |
| Hobby_Recommendation model | 53.7% | 59.9% | 73.2% |
|
| 25 |
-
| Falcon-7B-Instruct |
|
| 26 |
| Mistral-7B-Instruct | --.-% | --.-% | --.-% |
|
| 27 |
|
| 28 |
**Model Performance Summary**
|
|
|
|
| 22 |
|------------------------------|:-------------:|:---------:|:----------:|
|
| 23 |
| Llama-3-8B (base) | 51.4% | 59.9% | 73.1% |
|
| 24 |
| Hobby_Recommendation model | 53.7% | 59.9% | 73.2% |
|
| 25 |
+
| Falcon-7B-Instruct | 40.2% | 57.7% | 67.6% |
|
| 26 |
| Mistral-7B-Instruct | --.-% | --.-% | --.-% |
|
| 27 |
|
| 28 |
**Model Performance Summary**
|