Boris Shaposhnikov

borisshapa

borisshapa

AI & ML interests

NLP

Recent Activity

authored a paper 17 days ago

Trust-Region Behavior Blending for On-Policy Distillation

upvoted a paper 25 days ago

Trust-Region Behavior Blending for On-Policy Distillation

upvoted a paper 5 months ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

View all activity

Organizations

None yet

authored a paper 17 days ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 28 days ago • 66

upvoted a paper 25 days ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 28 days ago • 66

upvoted a paper 5 months ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 75

upvoted a paper 11 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6, 2025 • 35

upvoted a paper about 1 year ago

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation

Paper • 2505.22255 • Published May 28, 2025 • 24

updated a model about 1 year ago

borisshapa/sft-llama3.1-8b-uch

8B • Updated Apr 15, 2025 • 4

published a model about 1 year ago

borisshapa/sft-llama3.1-8b-uch

8B • Updated Apr 15, 2025 • 4

updated 2 models over 1 year ago

borisshapa/ppo-4x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 4

borisshapa/ppo-2x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 2

published a model over 1 year ago

borisshapa/ppo-4x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 4

updated a model over 1 year ago

borisshapa/ppo-8x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 3

published 2 models over 1 year ago

borisshapa/ppo-2x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 2

borisshapa/ppo-8x-mistral-7b-smallsft-tldr

Text Generation • 7B • Updated Mar 20, 2025 • 3

updated a model over 1 year ago

borisshapa/sft-qwen2.5-0.5b-uf

Updated Mar 18, 2025

published a model over 1 year ago

borisshapa/sft-qwen2.5-0.5b-uf

Updated Mar 18, 2025

upvoted 2 papers over 1 year ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13, 2025 • 37

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5, 2025 • 60

authored a paper over 1 year ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113

upvoted a paper over 1 year ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113

updated a model over 1 year ago

borisshapa/rm-opt-350m-hs2

Text Generation • 0.3B • Updated Dec 2, 2024 • 4

Boris Shaposhnikov

AI & ML interests

Recent Activity

Organizations

borisshapa's activity