Avi Trost

atrost

·

avitrost

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

updated a model 12 days ago

atrost/q1_dp0_loss_kl_pcgrad_b0150_savefix_2xh100

published a model 12 days ago

atrost/q1_dp0_loss_kl_pcgrad_b0150_savefix_2xh100

View all activity

Organizations

upvoted a paper 12 days ago

When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning?

Paper • 2606.18531 • Published 14 days ago • 4

upvoted a paper 2 months ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published Apr 14 • 25

upvoted 2 papers 3 months ago

Test-Time Scaling Makes Overtraining Compute-Optimal

Paper • 2604.01411 • Published Apr 1 • 28

SlopCodeBench: Benchmarking How Coding Agents Degrade Over Long-Horizon Iterative Tasks

Paper • 2603.24755 • Published Mar 25 • 30

upvoted 2 papers 4 months ago

RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning

Paper • 2603.09160 • Published Mar 10 • 17

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published Feb 23 • 58

upvoted a paper 7 months ago

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

Paper • 2511.21692 • Published Nov 26, 2025 • 15

upvoted a paper 8 months ago

Trove: A Flexible Toolkit for Dense Retrieval

Paper • 2511.01857 • Published Nov 3, 2025 • 12

upvoted 2 papers about 1 year ago

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Paper • 2505.00358 • Published May 1, 2025 • 26

Antidistillation Sampling

Paper • 2504.13146 • Published Apr 17, 2025 • 60