Jiale Zhao

Heisenburger2000

·

https://scholar.google.com/citations?user=rtVg_VUAAAAJ&hl=en

Heisenburger2020

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

DOPD: Dual On-policy Distillation

upvoted a paper 5 days ago

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

upvoted a paper 6 days ago

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

View all activity

Organizations

upvoted 2 papers 5 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 7 days ago • 96

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Paper • 2606.31315 • Published 6 days ago • 73

upvoted 2 papers 6 days ago

TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

Paper • 2606.28480 • Published 10 days ago • 47

OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks

Paper • 2606.29537 • Published 8 days ago • 19

upvoted a paper 10 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 12 days ago • 47

upvoted 3 papers 11 days ago

Beyond NL2Code: A Structured Survey of Multimodal Code Intelligence

Paper • 2606.15932 • Published 20 days ago • 38

Improved Large Language Diffusion Models

Paper • 2606.25331 • Published 12 days ago • 43

Autodata: An agentic data scientist to create high quality synthetic data

Paper • 2606.25996 • Published 12 days ago • 18

upvoted a paper 12 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 13 days ago • 144

upvoted 3 papers 13 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 14 days ago • 79

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 15 days ago • 95

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

Paper • 2606.22883 • Published 14 days ago • 37

upvoted 5 papers 20 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 21 days ago • 122

BadWorld: Adversarial Attacks on World Models

Paper • 2606.16519 • Published 21 days ago • 18

BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering

Paper • 2606.17049 • Published 21 days ago • 27

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

Paper • 2606.16281 • Published 21 days ago • 34

CODA-BENCH: Can Code Agents Handle Data-Intensive Tasks?

Paper • 2606.15300 • Published 23 days ago • 13

upvoted a paper 24 days ago

FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents

Paper • 2606.12087 • Published 26 days ago • 77

upvoted a paper 25 days ago

DeNovoSWE: Scaling Long-Horizon Environments for Generating Entire Repositories from Scratch

Paper • 2606.10728 • Published 27 days ago • 34

upvoted a paper 26 days ago

Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional Fields

Paper • 2606.11042 • Published 27 days ago • 22