小川健太's picture

小川健太

evelyndavis

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

upvoted a paper 2 days ago

Efficient Image Synthesis with Sphere Latent Encoder

upvoted a paper 4 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 9 days ago • 60

upvoted a paper 2 days ago

Efficient Image Synthesis with Sphere Latent Encoder

Paper • 2605.15592 • Published 6 days ago • 6

upvoted a paper 4 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 8 days ago • 257

upvoted 2 papers 6 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 14 days ago • 186

MuSS: A Large-Scale Dataset and Cinematic Narrative Benchmark for Multi-Shot Subject-to-Video Generation

Paper • 2604.23789 • Published 12 days ago • 5

upvoted 2 papers 27 days ago

The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation

Paper • 2604.16830 • Published Apr 18 • 15

Universal statistical signatures of evolution in artificial intelligence architectures

Paper • 2604.10571 • Published Apr 12 • 4

upvoted 3 papers about 1 month ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 262

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

upvoted 3 papers about 2 months ago

Towards a Medical AI Scientist

Paper • 2603.28589 • Published Mar 30 • 90

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 351

upvoted a paper 2 months ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248