inclusionAI
/

Ling-flash-2.0

Text Generation

Model card Files Files and versions

mx1 commited on Sep 17, 2025

Commit

9837a93

·

verified ·

1 Parent(s): f9d99bd

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -24,11 +24,17 @@ Trained on __20T+ tokens of high-quality data__, together with __supervised fine
 ### Powerful Complex Reasoning Abilities
 We conducted a comprehensive evaluation of Ling-flash-2.0’s reasoning capabilities, reporting strong results on representative benchmarks:
 ● __Multi-disciplinary knowledge reasoning__: GPQA-Diamond, MMLU-Pro
 ● __Advanced mathematical reasoning__: AIME 2025, Omni-MATH, OptMATH (advanced mathematical optimization tasks)
 ● __Challenging code generation__: LiveCodeBench v6, CodeForces-Elo
 ● __Logical reasoning__: KOR-Bench, ARC-Prize
 ● __Key regulated industries (Finance, Healthcare)__: FinanceReasoning, HealthBench
 Compared with __dense models under 40B__ (e.g., Qwen3-32B-Non-Thinking, Seed-OSS-36B-Instruct (think budget=0)) and __larger-activation/total-parameter MoE models__ (e.g., Hunyuan-A13B-Instruct, GPT-OSS-120B/low), __Ling-flash-2.0__ demonstrates stronger complex reasoning power. Moreover, it shows high competitiveness on __creative tasks__ (Creative Writing v3).
 <p align="center">
     <img src="https://mdn.alipayobjects.com/huamei_fi95qp/afts/img/zxAvQ7QtrAwAAAAAQqAAAAgADkZ7AQFr/fmt.webp"/>

 ### Powerful Complex Reasoning Abilities
 We conducted a comprehensive evaluation of Ling-flash-2.0’s reasoning capabilities, reporting strong results on representative benchmarks:
 ● __Multi-disciplinary knowledge reasoning__: GPQA-Diamond, MMLU-Pro
 ● __Advanced mathematical reasoning__: AIME 2025, Omni-MATH, OptMATH (advanced mathematical optimization tasks)
 ● __Challenging code generation__: LiveCodeBench v6, CodeForces-Elo
 ● __Logical reasoning__: KOR-Bench, ARC-Prize
 ● __Key regulated industries (Finance, Healthcare)__: FinanceReasoning, HealthBench
 Compared with __dense models under 40B__ (e.g., Qwen3-32B-Non-Thinking, Seed-OSS-36B-Instruct (think budget=0)) and __larger-activation/total-parameter MoE models__ (e.g., Hunyuan-A13B-Instruct, GPT-OSS-120B/low), __Ling-flash-2.0__ demonstrates stronger complex reasoning power. Moreover, it shows high competitiveness on __creative tasks__ (Creative Writing v3).
 <p align="center">
     <img src="https://mdn.alipayobjects.com/huamei_fi95qp/afts/img/zxAvQ7QtrAwAAAAAQqAAAAgADkZ7AQFr/fmt.webp"/>