Zujie Liang's picture

6

Zujie Liang

jokieleung

·

https://jokieleung.github.io/

AI & ML interests

LLM/VLM Agents, reasoning

Recent Activity

upvoted a paper 9 days ago

Code as Agent Harness

upvoted a paper 3 months ago

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

upvoted a paper 4 months ago

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

View all activity

Organizations

upvoted a paper 9 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 11 days ago • 209

upvoted a paper 3 months ago

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

Paper • 2602.12705 • Published Feb 13 • 68

upvoted a paper 4 months ago

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Paper • 2512.24265 • Published Dec 30, 2025 • 4

upvoted 2 papers 8 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

upvoted a paper 9 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47