Jianhong Wang

hsvgbkhgbv

70

·

https://hsvgbkhgbv.github.io/

AI & ML interests

multi-agent reinforcement learning, ad hoc teamwork, robust reinforcement learning

Recent Activity

updated a collection 21 days ago

upvoted a paper 21 days ago

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

upvoted a paper about 1 month ago

Unified Neural Scaling Laws

View all activity

Organizations

None yet

updated a collection 21 days ago

LLM papers

44 items • Updated 21 days ago

upvoted a paper 21 days ago

From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning

Paper • 2606.17682 • Published 26 days ago • 26

upvoted a paper about 1 month ago

Unified Neural Scaling Laws

Paper • 2605.26248 • Published May 25 • 7

updated a collection about 1 month ago

LLM papers

44 items • Updated 21 days ago

upvoted a paper about 1 month ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published May 29 • 69

upvoted a paper 2 months ago

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Paper • 2605.06642 • Published May 7 • 28

updated a collection 2 months ago

LLM papers

44 items • Updated 21 days ago

upvoted a paper 2 months ago

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published May 4 • 143

updated a collection 2 months ago

LLM papers

44 items • Updated 21 days ago

upvoted a paper 2 months ago

Web2BigTable: A Bi-Level Multi-Agent LLM System for Internet-Scale Information Search and Extraction

Paper • 2604.27221 • Published Apr 29 • 40

updated a collection 2 months ago

LLM papers

44 items • Updated 21 days ago

upvoted 2 papers 2 months ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 222

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 287

updated a collection 3 months ago

LLM papers

44 items • Updated 21 days ago

upvoted 5 papers 3 months ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published Apr 20 • 47

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published Apr 20 • 96

Towards Long-horizon Agentic Multimodal Search

Paper • 2604.12890 • Published Apr 14 • 20

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

Paper • 2604.14142 • Published Apr 15 • 30

updated a collection 3 months ago

LLM papers

44 items • Updated 21 days ago