17 17

Zhang Jiahui

zhangjiahuise

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Count Anything

liked a model 1 day ago

Bittoby1040/happyQuasarv21

liked a dataset 3 days ago

ryanmarten/OpenThoughts-1k-sample

View all activity

Organizations

None yet

upvoted a paper about 20 hours ago

Count Anything

Paper • 2605.30846 • Published 5 days ago • 7

upvoted a paper 5 days ago

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Paper • 2605.28816 • Published 7 days ago • 417

upvoted a paper 12 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 22 days ago • 195

upvoted a paper 13 days ago

Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use

Paper • 2605.14038 • Published 21 days ago • 15

upvoted a paper 20 days ago

L2P: Unlocking Latent Potential for Pixel Generation

Paper • 2605.12013 • Published 22 days ago • 36

upvoted a paper 27 days ago

AcademiClaw: When Students Set Challenges for AI Agents

Paper • 2605.02661 • Published about 1 month ago • 17

upvoted a paper about 1 month ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

upvoted 3 papers about 2 months ago

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Paper • 2604.03841 • Published Apr 4 • 10

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

upvoted 4 papers 2 months ago

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

Paper • 2603.28590 • Published Mar 30 • 22

ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers

Paper • 2603.24414 • Published Mar 25 • 183

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition

Paper • 2603.13904 • Published Mar 14 • 4

upvoted 3 papers 3 months ago

Zhang Jiahui

AI & ML interests

Recent Activity

Organizations

zhangjiahuise's activity