山下颯太

thomas-taylor

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

liked a model 9 days ago

liked a dataset 10 days ago

augustander/mist-her2-sample

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

Paper • 2607.13124 • Published 3 days ago • 10

upvoted 2 papers about 1 month ago

ABot-Earth 0.5: Generative 3D Earth Model

Paper • 2606.09967 • Published Jun 8 • 486

Kwai Keye-VL-2.0 Technical Report

Paper • 2606.10651 • Published Jun 9 • 192

upvoted 6 papers about 2 months ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published May 27 • 431

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 191

Segment Anything with Motion, Geometry, and Semantic Adaptation for Complex Nonlinear Visual Object Tracking

Paper • 2605.22538 • Published May 21 • 6

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published May 12 • 196

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published May 13 • 60

upvoted a paper 2 months ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published May 7 • 238

upvoted a paper 4 months ago

Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design

Paper • 2603.28376 • Published Mar 30 • 24