4 16 1

Yujun Zhou

yujunzhou

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

submitted a paper 15 days ago

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

upvoted a paper 3 months ago

Emergent Social Intelligence Risks in Generative Multi-Agent Systems

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Paper • 2606.13174 • Published 17 days ago • 4

submitted a paper to Daily Papers 15 days ago

Getting Better at Working With You: Compiling User Corrections into Runtime Enforcement for Coding Agents

Paper • 2606.13174 • Published 17 days ago • 4

upvoted a paper 3 months ago

Emergent Social Intelligence Risks in Generative Multi-Agent Systems

Paper • 2603.27771 • Published Mar 29 • 52

updated a model 3 months ago

yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-RandomNovelty

4B • Updated Mar 29 • 1

published a model 3 months ago

yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-RandomNovelty

4B • Updated Mar 29 • 1

updated a model 3 months ago

yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-OpenAI

4B • Updated Mar 28 • 2

published a model 3 months ago

yujunzhou/MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-OpenAI

4B • Updated Mar 28 • 2

New activity in yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL 4 months ago

Running in MSTY Studio

#1 opened 4 months ago by

Bogoo10191

upvoted a paper 6 months ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published Dec 17, 2025 • 22

submitted a paper to Daily Papers 6 months ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published Dec 17, 2025 • 22

updated 2 models 6 months ago

yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B

Text Generation • 4B • Updated Dec 17, 2025 • 4

yujunzhou/SFT_Advanced_Risk_Self_Grading_llama

Text Generation • 8B • Updated Dec 17, 2025 • 4

published a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B

Text Generation • 4B • Updated Dec 17, 2025 • 4

updated a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base

Text Generation • 4B • Updated Dec 17, 2025 • 4

published a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base

Text Generation • 4B • Updated Dec 17, 2025 • 4

updated 2 models 6 months ago

yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B

Text Generation • 4B • Updated Dec 17, 2025 • 4

yujunzhou/Advanced_Risk_Self_Grading_llama

8B • Updated Dec 17, 2025 • 1

published a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B

Text Generation • 4B • Updated Dec 17, 2025 • 4

updated a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base

Text Generation • 4B • Updated Dec 16, 2025 • 6

published a model 6 months ago

yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base

Text Generation • 4B • Updated Dec 16, 2025 • 6

Yujun Zhou

AI & ML interests

Recent Activity

Organizations

yujunzhou's activity

Running in MSTY Studio