Yichen Dong

elizabethscott2

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

models4world/mdl-4242276719f9

upvoted a paper 3 days ago

Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generation

liked a dataset 6 days ago

m-a-p/FineFineWeb

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Illuminating Unified Multimodal Model for Free-form Interleaved Text-Image Generation

Paper • 2606.30054 • Published 11 days ago • 4

upvoted a paper 22 days ago

Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models

Paper • 2606.11324 • Published about 1 month ago • 170

upvoted 2 papers about 1 month ago

YoCausal: How Far is Video Generation from World Model? A Causality Perspective

Paper • 2605.30346 • Published May 28 • 55

Seeing Isn't Knowing: Do VLMs Know When Not to Answer Spatial Questions (and Why)?

Paper • 2605.30557 • Published May 28 • 12

upvoted a paper about 2 months ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published May 13 • 274

upvoted 4 papers 2 months ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published May 6 • 106

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 222

Learning Evidence Highlighting for Frozen LLMs

Paper • 2604.22565 • Published Apr 24 • 6

upvoted 7 papers 3 months ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published Apr 22 • 18

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 103

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 329

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published Mar 29 • 33

upvoted 3 papers 4 months ago

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198