11 17 43

Zhouliang Yu

zhouliang

https://zhouliang-yu.github.io

zhouliang-yu

AI & ML interests

Model-Based AI, Reinforcement Learning, Autoformalization

Recent Activity

liked a dataset 7 days ago

AI-Math-TCS/tcs_proof_strategy

authored a paper 17 days ago

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

commentedon a paper 17 days ago

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

View all activity

Organizations

liked a dataset 7 days ago

AI-Math-TCS/tcs_proof_strategy

Viewer • Updated May 17 • 19k • 135 • 1

authored a paper 17 days ago

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

Paper • 2606.07602 • Published 29 days ago • 6

commented a paper 17 days ago

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

Paper • 2606.07602 • Published 29 days ago • 6 •

upvoted a paper 17 days ago

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

Paper • 2606.07602 • Published 29 days ago • 6

liked a dataset 26 days ago

rootacess/Lean-SFT-dataset

Viewer • Updated Feb 11 • 25.8k • 63 • 1

liked 3 datasets 2 months ago

liked a model 2 months ago

OpenDataArena/Qwen3-8B-ODA-Math-460k

Text Generation • 308k • Updated Jan 21 • 60 • 2

authored a paper 3 months ago

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

Paper • 2603.26535 • Published Mar 27 • 3

liked a dataset 3 months ago

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated Mar 31 • 2.33k • 1.79k • 623

liked a model 3 months ago

Jackrong/Qwopus3.5-4B-v3

Image-Text-to-Text • 5B • Updated Apr 6 • 1.35k • 14

upvoted a paper 4 months ago

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published Feb 27 • 99

liked 6 datasets 4 months ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

Viewer • Updated Feb 27 • 6k • 197 • 66

Goedel-LM/SFT_dataset_v2

Viewer • Updated Mar 2 • 1.75M • 1.03k • 31

lm-provers/ProofBench

Viewer • Updated Jan 9 • 290 • 56 • 3

AI-MO/NuminaMath-1.5

Viewer • Updated Jan 29 • 896k • 5.24k • 188

AI-MO/aops

Viewer • Updated Mar 31 • 80.7k • 672 • 6

lm-provers/FineProofs-SFT

Viewer • Updated Feb 14 • 12.1k • 230 • 42

upvoted a paper 4 months ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 14

Zhouliang Yu

AI & ML interests

Recent Activity

Organizations

zhouliang's activity