Aleksandr Samarin's picture

🔄 In a Training Loop

Aleksandr Samarin

astrlrd

·

AI & ML interests

None yet

Recent Activity

updated a model 28 days ago

nebius/EAGLE3-Llama-3.1-8B-Instruct

updated a model 28 days ago

nebius/EAGLE3-Qwen3-235B-A22B-Instruct-2507

updated a model 28 days ago

nebius/EAGLE3-Llama-3.3-70B-Instruct

View all activity

Organizations

upvoted a paper about 1 month ago

SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding

Paper • 2605.10453 • Published May 11 • 9

upvoted 2 papers 4 months ago

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Paper • 2602.23866 • Published Feb 27 • 91

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Paper • 2602.23881 • Published Feb 27 • 18

upvoted 2 collections 4 months ago

Infinity-Instruct-Completions

Training data for speculative decoding draft models, containing model-generated responses for 660K prompts from Infinity-Instruct-0625. • 6 items • Updated Mar 13 • 3

LK-Speculators

High-performance speculative decoding draft models trained using LK losses, a novel training objectives that directly optimize acceptance rate • 9 items • Updated Mar 3 • 5

upvoted a paper 4 months ago

Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards

Paper • 2602.10231 • Published Feb 10 • 13

upvoted a paper 11 months ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published Aug 5, 2025 • 59

upvoted a paper about 1 year ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 97