Sky's picture

Sky

dandingsky

·

dandingsky

AI & ML interests

None yet

Recent Activity

commentedon a paper about 2 months ago

Progressive Residual Warmup for Language Model Pretraining

submitted a paper about 2 months ago

Progressive Residual Warmup for Language Model Pretraining

authored a paper 2 months ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

View all activity

Organizations

submitted a paper to Daily Papers about 2 months ago

Progressive Residual Warmup for Language Model Pretraining

Paper • 2603.05369 • Published Mar 5 • 36

authored 8 papers 2 months ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Paper • 2509.26226 • Published Sep 30, 2025 • 34

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 93

Progressive Residual Warmup for Language Model Pretraining

Paper • 2603.05369 • Published Mar 5 • 36

UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models

Paper • 2502.00334 • Published Feb 1, 2025

UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models

Paper • 2501.13766 • Published Jan 23, 2025

GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling

Paper • 2506.22049 • Published Jun 27, 2025 • 2

Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning

Paper • 2506.21285 • Published Jun 26, 2025

Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving

Paper • 2502.12022 • Published Feb 17, 2025