Jeff Gao's picture

Jeff Gao

jeff-gao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

upvoted a paper 11 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

upvoted a paper 11 days ago

APPO: Agentic Procedural Policy Optimization

View all activity

Organizations

None yet

upvoted 9 papers 11 days ago

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Paper • 2605.30993 • Published 28 days ago • 59

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 25 days ago • 57

APPO: Agentic Procedural Policy Optimization

Paper • 2606.12384 • Published 15 days ago • 77

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 16 days ago • 67

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Paper • 2606.10917 • Published 17 days ago • 76

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

Paper • 2606.00683 • Published 27 days ago • 96

WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

Paper • 2606.09426 • Published 18 days ago • 102

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 29 days ago • 112

Audio Interaction Model

Paper • 2606.05121 • Published 23 days ago • 119

liked a dataset about 1 month ago

zhifeixie/Voices-in-the-Wild-2M

Updated 28 days ago • 11k • 43

upvoted a paper about 2 months ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

upvoted 2 papers 3 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 158

upvoted 3 papers 4 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 198

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 356

upvoted 4 papers 5 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 113

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Paper • 2601.09688 • Published Jan 14 • 128

User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

Paper • 2601.08225 • Published Jan 13 • 53