Zhangyue Yin

yinzhangyue

·

https://yinzhangyue.github.io/

AI & ML interests

Reasoning and Planning

Recent Activity

upvoted a paper about 2 months ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

upvoted a paper 2 months ago

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

liked a model 3 months ago

OpenMOSS-Team/MOSS-VL-Instruct-0408

View all activity

Organizations

upvoted a paper about 2 months ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

Paper • 2605.19597 • Published May 19 • 21

upvoted a paper 2 months ago

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Paper • 2604.22446 • Published Apr 24 • 125

liked a model 3 months ago

OpenMOSS-Team/MOSS-VL-Instruct-0408

Video-Text-to-Text • 11B • Updated Apr 22 • 381 • 97

liked a dataset 4 months ago

OpenMOSS-Team/OmniAction

Updated Mar 27 • 33k • 283

upvoted 7 papers 4 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published Mar 5 • 56

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 187

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 151

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

liked 3 models 5 months ago

OpenMOSS-Team/MOVA-720p

Any-to-Any • Updated Feb 11 • 163 • 129

OpenMOSS-Team/MOSS-TTS

Text-to-Speech • 8B • Updated Mar 20 • 1.08M • 405

OpenMOSS-Team/MOVA-360p

Image-to-Video • Updated Feb 15 • 98.4k • 215

upvoted 6 papers 5 months ago

MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models

Paper • 2602.10934 • Published Feb 11 • 50

Can Deep Research Agents Find and Organize? Evaluating the Synthesis Gap with Expert Taxonomies

Paper • 2601.12369 • Published Jan 18 • 4

Prism: Spectral-Aware Block-Sparse Attention

Paper • 2602.08426 • Published Feb 9 • 38

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published Feb 2 • 35

CL-bench: A Benchmark for Context Learning

Paper • 2602.03587 • Published Feb 3 • 23