Zhouqi Hua

ZhouqiHUA

1 24 9

AI & ML interests

reasoning LLM

Recent Activity

upvoted a paper 28 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

upvoted a paper 3 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

authored a paper 3 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

View all activity

Organizations

None yet

upvoted a paper 28 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published May 29 • 120

upvoted a paper 3 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 124

authored a paper 3 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

upvoted a paper 3 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

upvoted a paper 4 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431

upvoted a paper 5 months ago

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18

upvoted a collection 5 months ago

RL and Agents

Collection

19 items • Updated Jul 21, 2025 • 2

liked a model 5 months ago

internlm/Intern-S1-Pro

Image-Text-to-Text • Updated Mar 30 • 280k • 280

upvoted a paper 5 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

liked a dataset 6 months ago

openai/gsm8k

Benchmark • Updated Mar 23 • 17.6k • 909k • 1.41k

upvoted 4 papers 7 months ago

Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning

Paper • 2512.10534 • Published Dec 11, 2025 • 33

OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

Paper • 2512.10756 • Published Dec 11, 2025 • 35

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

upvoted a paper 9 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 92

upvoted a paper 10 months ago

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

Paper • 2508.18124 • Published Aug 25, 2025 • 49

authored a paper 10 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 274

upvoted a paper 10 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 274

liked a model 10 months ago

a-m-team/AM-Thinking-v1

Text Generation • 33B • Updated May 14, 2025 • 65 • • 205

upvoted a paper 11 months ago

Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery

Paper • 2508.08401 • Published Aug 11, 2025 • 42

Zhouqi Hua

AI & ML interests

Recent Activity

Organizations

ZhouqiHUA's activity