Plyusov

daniilplyusov

9

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

upvoted a paper 23 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

upvoted a paper about 1 month ago

Trust-Region Behavior Blending for On-Policy Distillation

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Paper • 2606.20517 • Published 15 days ago • 60

upvoted a paper 23 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 24 days ago • 12

upvoted a paper about 1 month ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published May 29 • 68

upvoted a paper 4 months ago

Next Embedding Prediction Makes World Models Stronger

Paper • 2603.02765 • Published Mar 3 • 21

upvoted 2 papers 5 months ago

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Paper • 2602.12125 • Published Feb 12 • 68

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 75

upvoted a paper 11 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35

upvoted a paper about 1 year ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published May 28, 2025 • 24

upvoted a paper over 1 year ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5, 2025 • 59

published a model over 1 year ago

daniilplyusov/reward_model

Updated Feb 2, 2025