Alexey Khokhulin

alexey-khokhulin

6 1

·

alexeykhokhulin

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

upvoted a paper about 1 month ago

Trust-Region Behavior Blending for On-Policy Distillation

liked a Space 5 months ago

t-tech/manifolds

View all activity

Organizations

None yet

upvoted a paper 10 days ago

One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining

Paper • 2606.30634 • Published 11 days ago • 24

upvoted a paper about 1 month ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published May 29 • 69

liked a Space 5 months ago

Chasing the Counting Manifold in Open LLMs

Counting manifolds in open LLMs from behavior to SAEs.

upvoted a paper 5 months ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 75

upvoted 2 papers 7 months ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 121

ESSA: Evolutionary Strategies for Scalable Alignment

Paper • 2507.04453 • Published Jul 6, 2025 • 5

authored a paper 7 months ago

ESSA: Evolutionary Strategies for Scalable Alignment

Paper • 2507.04453 • Published Jul 6, 2025 • 5

upvoted a paper 11 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35