Mensa Coate

mensacoate

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness

upvoted a paper about 18 hours ago

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

upvoted a paper about 18 hours ago

CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing

View all activity

Organizations

None yet

upvoted 4 papers about 18 hours ago

Concept-Guided Fine-Tuning: Steering ViTs away from Spurious Correlations to Improve Robustness

Paper • 2603.08309 • Published 1 day ago • 11

AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery

Paper • 2603.07300 • Published 3 days ago • 9

CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing

Paper • 2603.08589 • Published 1 day ago • 30

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published 7 days ago • 42

upvoted 9 papers 7 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 188

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 142

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Paper • 2507.21049 • Published Jul 28, 2025 • 41

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 131

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 160

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 251

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 261

upvoted 5 papers 9 months ago

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

Paper • 2505.23923 • Published May 29, 2025 • 8

upvoted 2 papers 11 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1, 2025 • 56

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published Apr 2, 2025 • 68

Mensa Coate

AI & ML interests

Recent Activity

Organizations

mensacoate's activity