6 57 2

Minki Kang

Nardien

Nardien

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Qwen-AgentWorld: Language World Models for General Agents

upvoted a paper 8 days ago

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

upvoted a paper 8 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

View all activity

Organizations

upvoted a paper about 17 hours ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 2 days ago • 82

upvoted 2 papers 8 days ago

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Paper • 2604.24005 • Published Apr 27 • 9

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 9 days ago • 60

upvoted a paper 13 days ago

SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

Paper • 2606.13673 • Published 14 days ago • 106

upvoted 2 papers 20 days ago

TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration

Paper • 2606.04743 • Published 22 days ago • 46

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents

Paper • 2605.17873 • Published May 18 • 12

upvoted 2 papers 21 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 24 days ago • 134

Benchmarking Visual State Tracking in Multimodal Video Understanding

Paper • 2606.03920 • Published 23 days ago • 50

upvoted a paper 27 days ago

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

Paper • 2605.29250 • Published 28 days ago • 78

authored a paper 27 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 29 days ago • 93

commented a paper 28 days ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 29 days ago • 93 •

upvoted 2 papers 28 days ago

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

Paper • 2605.28775 • Published 29 days ago • 38

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published 29 days ago • 93

upvoted a paper about 1 month ago

It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs

Paper • 2605.20258 • Published May 18 • 30

submitted a paper to Daily Papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

upvoted 2 papers about 1 month ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 34

PREPING: Building Agent Memory without Tasks

Paper • 2605.13880 • Published May 11 • 28

liked a dataset 2 months ago

nvidia/Nemotron-Terminal-Corpus

Viewer • Updated Feb 27 • 366k • 6.24k • 134

upvoted 2 papers 2 months ago

OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published Apr 20 • 84

Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents

Paper • 2604.14004 • Published Apr 15 • 30

Minki Kang

AI & ML interests

Recent Activity

Organizations

Nardien's activity