雨田梁's picture

雨田梁

victoriawhite7

AI & ML interests

None yet

Recent Activity

liked a dataset 19 minutes ago

fosters/knihi-be-arlou_sny_impieratara_output

liked a dataset 1 day ago

aoliverg/MTUOC-recipes

liked a dataset 1 day ago

KarlQuant/k1rl-checkpoints

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs

Paper • 2605.30611 • Published 7 days ago • 168

upvoted a paper 7 days ago

SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

Paper • 2605.25160 • Published 11 days ago • 8

upvoted a paper 11 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 15 days ago • 204

upvoted a paper 12 days ago

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization

Paper • 2605.19330 • Published 16 days ago • 8

upvoted 2 papers 13 days ago

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published 21 days ago • 145

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 23 days ago • 195

upvoted a paper 20 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 28 days ago • 233

upvoted a paper about 1 month ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 243

upvoted 2 papers about 2 months ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published Apr 9 • 23

upvoted 4 papers 2 months ago

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

Paper • 2604.01569 • Published Apr 2 • 14

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 343

AVControl: Efficient Framework for Training Audio-Visual Controls

Paper • 2603.24793 • Published Mar 25 • 28

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352