Kwbw27zcco's picture

Kwbw27zcco

kwbw27zcco

·

AI & ML interests

None yet

Recent Activity

liked a model about 21 hours ago

Tuna12345/qwen2p5-7b-sft-lora-nycu-dl2026

liked a model 4 days ago

tencent/Hy-MT2-1.8B

liked a model 5 days ago

tencent/Hy-MT2-30B-A3B

View all activity

Organizations

None yet

upvoted a paper 6 days ago

TIDE: Efficient and Lossless MoE Diffusion LLM Inference with I/O-aware Expert Offload

Paper • 2605.20179 • Published 10 days ago • 4

upvoted a paper 17 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published 26 days ago • 116

upvoted a paper 20 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 22 days ago • 111

upvoted a paper 21 days ago

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Paper • 2605.04018 • Published 24 days ago • 40

upvoted a paper about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 247

upvoted 3 papers about 2 months ago

Communicating about Space: Language-Mediated Spatial Integration Across Partial Views

Paper • 2603.27183 • Published Mar 28 • 20

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

upvoted a paper 2 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

upvoted 5 papers 3 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 524

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 245

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 210