Sining Zhoubian's picture

3

Sining Zhoubian

SiningZhou

https://scholar.google.com/citations?view_op=search_authors&mauthors=Sining+Zhoubian&hl=zh-CN&oi=ao

zhoubiansining

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

published a model 9 months ago

SiningZhou/Qwen3-8B-ReST-RL

published a model 9 months ago

SiningZhou/Qwen3-8B-VM

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding

Paper • 2606.21906 • Published 4 days ago • 18

published 2 models 9 months ago

SiningZhou/Qwen3-8B-ReST-RL

Text Generation • 8B • Updated Sep 12, 2025 • 4

SiningZhou/Qwen3-8B-VM

Text Classification • 8B • Updated Sep 12, 2025 • 5

updated 2 models 9 months ago

SiningZhou/Qwen3-8B-VM

Text Classification • 8B • Updated Sep 12, 2025 • 5

SiningZhou/Qwen3-8B-ReST-RL

Text Generation • 8B • Updated Sep 12, 2025 • 4

upvoted a paper 10 months ago

ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding

Paper • 2508.19576 • Published Aug 27, 2025 • 2

authored 3 papers 10 months ago

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Paper • 2406.03816 • Published Jun 6, 2024 • 1

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

Paper • 2401.07950 • Published Jan 15, 2024 • 4

ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding

Paper • 2508.19576 • Published Aug 27, 2025 • 2

upvoted a paper 11 months ago

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Paper • 2505.18943 • Published May 25, 2025 • 25