Mila ALLEN's picture

Mila ALLEN

owenf2023

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood

upvoted a paper 1 day ago

Brain-IT-VQA: From Brain Signals to Answers

liked a dataset 1 day ago

ankile/real01c-marker-insert-d1-baseline-uniform-r0b

View all activity

Organizations

None yet

upvoted a paper about 1 hour ago

ChildVox: A Speech, Audio, and Large Audio-Language Model Benchmark in Understanding and Characterizing Sound across Childhood

Paper • 2605.29257 • Published 7 days ago • 9

upvoted a paper 1 day ago

Brain-IT-VQA: From Brain Signals to Answers

Paper • 2605.29588 • Published 7 days ago • 12

upvoted a paper 10 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 28 days ago • 233

upvoted a paper 12 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 16 days ago • 185

upvoted 4 papers about 2 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 236

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds

Paper • 2604.08544 • Published Apr 9 • 16

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published Apr 9 • 115

upvoted 2 papers 2 months ago

NearID: Identity Representation Learning via Near-identity Distractors

Paper • 2604.01973 • Published Apr 2 • 32

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

upvoted 7 papers 3 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 150

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525