Update README.md
Browse files
README.md
CHANGED
|
@@ -57,7 +57,7 @@ Code: [https://github.com/jincan333/MAS-TTS](https://github.com/jincan333/MAS-TT
|
|
| 57 |
| | **GPQA** | **Commongen** | **AIME2024** | **MATH-500** | **HumanEval** | **MBPP-S**|
|
| 58 |
| **Non-Reasoning Models** | | | | | | |
|
| 59 |
| Qwen2.5 | 50.2 | 96.7 | 21.1 | 84.4 | 89.0 | 80.2 |
|
| 60 |
-
| DeepSeek-V3 | 58.6
|
| 61 |
| GPT-4o | 49.2 | 97.8 | 7.8 | 81.3 | **90.9** | **85.4** |
|
| 62 |
| **Reasoning Models** | | | | | | |
|
| 63 |
| s1.1-32B | 58.3 | 94.1 | 53.3 | 90.6 | 82.3 | 77.4 |
|
|
@@ -79,7 +79,7 @@ M1-32B is intended for research on Multi-agent reasoning and collaboration in MA
|
|
| 79 |
|
| 80 |
## Citation
|
| 81 |
|
| 82 |
-
If you use this
|
| 83 |
|
| 84 |
```bibtex
|
| 85 |
@article{jin2025two,
|
|
|
|
| 57 |
| | **GPQA** | **Commongen** | **AIME2024** | **MATH-500** | **HumanEval** | **MBPP-S**|
|
| 58 |
| **Non-Reasoning Models** | | | | | | |
|
| 59 |
| Qwen2.5 | 50.2 | 96.7 | 21.1 | 84.4 | 89.0 | 80.2 |
|
| 60 |
+
| DeepSeek-V3 | **58.6** | **98.6** | **33.3** | **88.6** | 89.6 | 83.9 |
|
| 61 |
| GPT-4o | 49.2 | 97.8 | 7.8 | 81.3 | **90.9** | **85.4** |
|
| 62 |
| **Reasoning Models** | | | | | | |
|
| 63 |
| s1.1-32B | 58.3 | 94.1 | 53.3 | 90.6 | 82.3 | 77.4 |
|
|
|
|
| 79 |
|
| 80 |
## Citation
|
| 81 |
|
| 82 |
+
If you use this model, please cite the relevant papers:
|
| 83 |
|
| 84 |
```bibtex
|
| 85 |
@article{jin2025two,
|