7 45 2

Jinheon Baek

jinheon

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

upvoted a paper 21 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

upvoted a paper 23 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

View all activity

Organizations

upvoted a paper 9 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 10 days ago • 61

upvoted a paper 21 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 23 days ago • 46

upvoted a paper 23 days ago

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Paper • 2606.02404 • Published 25 days ago • 57

upvoted a paper 28 days ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 29 days ago • 78

upvoted 3 papers 29 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 30 days ago • 93

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published 30 days ago • 38

ResearchMath-14K: Scaling Research-Level Mathematics via Agents

Paper • 2605.28003 • Published 30 days ago • 50

upvoted 4 papers about 1 month ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published May 18 • 30

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Paper • 2605.20668 • Published May 20 • 12

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

upvoted a paper 2 months ago

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

upvoted 2 papers 4 months ago

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Paper • 2603.09827 • Published Mar 10 • 30

MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models

Paper • 2602.17602 • Published Feb 19 • 56

upvoted a paper 5 months ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Paper • 2601.23143 • Published Jan 30 • 39

upvoted a paper 6 months ago

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation

Paper • 2601.00664 • Published Jan 2 • 58

upvoted a paper 7 months ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

Paper • 2512.02425 • Published Dec 2, 2025 • 25

upvoted 2 papers 8 months ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 42

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 87

upvoted a paper 9 months ago

Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

Paper • 2510.09201 • Published Oct 10, 2025 • 50

Jinheon Baek

AI & ML interests

Recent Activity

Organizations

jinheon's activity