2 13 1

Zhuokai Zhao

zhuokai

https://zhuokai-zhao.com/

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

upvoted a paper 25 days ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

upvoted a paper about 2 months ago

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

authored a paper 3 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

View all activity

Organizations

upvoted a paper 25 days ago

OmniOPD: Logit-Free On-Policy Distillation via Speculative Verification

Paper • 2606.01476 • Published 29 days ago • 8

upvoted a paper about 2 months ago

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Paper • 2605.04808 • Published May 6 • 20

authored a paper 3 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published Apr 6 • 14

upvoted a paper 3 months ago

Synthetic Sandbox for Training Machine Learning Engineering Agents

Paper • 2604.04872 • Published Apr 6 • 14

authored 2 papers 6 months ago

Preference Optimization with Multi-Sample Comparisons

Paper • 2410.12138 • Published Oct 16, 2024

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted a paper 6 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

authored a paper 8 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

upvoted a paper 8 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

authored 10 papers 8 months ago

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

Paper • 2412.06474 • Published Dec 9, 2024

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

Paper • 2503.19900 • Published Mar 25, 2025

Boosting LLM Reasoning via Spontaneous Self-Correction

Paper • 2506.06923 • Published Jun 7, 2025

RecoWorld: Building Simulated Environments for Agentic Recommender Systems

Paper • 2509.10397 • Published Sep 12, 2025 • 8

StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding

Paper • 2508.15717 • Published Aug 21, 2025 • 1

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

Paper • 2510.05251 • Published Oct 6, 2025 • 8

Thought Communication in Multiagent Collaboration

Paper • 2510.20733 • Published Oct 23, 2025 • 15

upvoted a paper 8 months ago

Thought Communication in Multiagent Collaboration

Paper • 2510.20733 • Published Oct 23, 2025 • 15

Zhuokai Zhao

AI & ML interests

Recent Activity

Organizations

zhuokai's activity