floyed shen

floyed

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

submitted a paper about 2 months ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

upvoted a paper about 2 months ago

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

View all activity

Organizations

upvoted a paper about 1 month ago

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

Paper • 2605.28721 • Published May 27 • 18

upvoted 2 papers about 2 months ago

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

Paper • 2605.11613 • Published May 12 • 3

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

upvoted a paper 2 months ago

Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense

Paper • 2510.01088 • Published Oct 1, 2025 • 1

upvoted a paper 3 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 150

upvoted 3 papers 4 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 154

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published Jan 23 • 19

upvoted a collection 5 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.72k

upvoted 3 papers 5 months ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 47

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

Covo-Audio Technical Report

Paper • 2602.09823 • Published Feb 10 • 15