Jun Feng

junfeng0288

·

junfeng0288

AI & ML interests

None yet

Organizations

upvoted 6 papers 3 months ago

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Paper • 2604.08455 • Published Apr 9 • 48

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 295

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 329

From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models

Paper • 2604.09459 • Published Apr 13 • 14

InCoder-32B-Thinking: Industrial Code World Model for Thinking

Paper • 2604.03144 • Published Apr 3 • 239

upvoted 4 papers 4 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 312

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 158

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

Paper • 2603.08013 • Published Mar 9 • 15

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 45

upvoted 2 papers 5 months ago

Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments

Paper • 2602.11964 • Published Feb 12 • 13

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

Paper • 2602.11089 • Published Feb 11 • 18

upvoted a paper 11 months ago

MathReal: We Keep It Real! A Real Scene Benchmark for Evaluating Math Reasoning in Multimodal Large Language Models

Paper • 2508.06009 • Published Aug 8, 2025 • 16