yushun zhang

yushun0410

·

https://zyushun.github.io/

zyushun

AI & ML interests

LLMs

Organizations

None yet

upvoted a paper 10 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

upvoted a paper over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 380

upvoted 2 collections over 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 728

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 10 items • Updated Mar 2 • 92

upvoted a paper about 2 years ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69