TeleAI-AI-Flow
/

AI-Flow-Ruyi-7B-0725

Model card Files Files and versions

Coder-AN commited on Jul 25, 2025

Commit

63c25cf

·

1 Parent(s): dccf4a7

update

Files changed (2) hide show

README.md +4 -4
README_en.md +4 -4

README.md CHANGED Viewed

@@ -80,8 +80,8 @@ tasks:
 |模型名称|HumanEval|MBPP|LiveCodeBench|均分|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|84.76|78.60|63.10|75.49|
-|Qwen2.5-7B-Instruct|63.41|68.48|8.15|46.68|
-|Llama3.1-8B-Instruct|84.15|70.82|34.55|63.17|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|76.83|77.04|28.44|60.77|
 </details>
@@ -92,8 +92,8 @@ tasks:
 |模型名称|GPQA|Math|GSM-8K|均分|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|38.38|83.84|93.03|71.75|
-|Qwen2.5-7B-Instruct|25.25|49.22|85.82|53.43|
-|Llama3.1-8B-Instruct|35.35|73.66|88.48|65.83|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|30.30|72.18|91.36|64.61|
 </details>

 |模型名称|HumanEval|MBPP|LiveCodeBench|均分|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|84.76|78.60|63.10|75.49|
+|Llama3.1-8B-Instruct|63.41|68.48|8.15|46.68|
+|Qwen2.5-7B-Instruct|84.15|70.82|34.55|63.17|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|76.83|77.04|28.44|60.77|
 </details>
 |模型名称|GPQA|Math|GSM-8K|均分|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|38.38|83.84|93.03|71.75|
+|Llama3.1-8B-Instruct|25.25|49.22|85.82|53.43|
+|Qwen2.5-7B-Instruct|35.35|73.66|88.48|65.83|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|30.30|72.18|91.36|64.61|
 </details>

README_en.md CHANGED Viewed

@@ -78,8 +78,8 @@ We conduct a review based on [OpenCompass](https://github.com/open-compass/openc
 |Model|HumanEval|MBPP|LiveCodeBench|Mean|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|84.76|78.60|63.10|75.49|
-|Qwen2.5-7B-Instruct|63.41|68.48|8.15|46.68|
-|Llama3.1-8B-Instruct|84.15|70.82|34.55|63.17|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|76.83|77.04|28.44|60.77|
 </details>
@@ -90,8 +90,8 @@ We conduct a review based on [OpenCompass](https://github.com/open-compass/openc
 |Model|GPQA|Math|GSM-8K|Mean|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|38.38|83.84|93.03|71.75|
-|Qwen2.5-7B-Instruct|25.25|49.22|85.82|53.43|
-|Llama3.1-8B-Instruct|35.35|73.66|88.48|65.83|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|30.30|72.18|91.36|64.61|
 </details>

 |Model|HumanEval|MBPP|LiveCodeBench|Mean|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|84.76|78.60|63.10|75.49|
+|Llama3.1-8B-Instruct|63.41|68.48|8.15|46.68|
+|Qwen2.5-7B-Instruct|84.15|70.82|34.55|63.17|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|76.83|77.04|28.44|60.77|
 </details>
 |Model|GPQA|Math|GSM-8K|Mean|
 |:-:|:-:|:-:|:-:|:-:|
 |Qwen3-8B(think)|38.38|83.84|93.03|71.75|
+|Llama3.1-8B-Instruct|25.25|49.22|85.82|53.43|
+|Qwen2.5-7B-Instruct|35.35|73.66|88.48|65.83|
 |AI-Flow-Ruyi-7B-E7B-0725<b>(ours)</b>|30.30|72.18|91.36|64.61|
 </details>