Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexey Malakhov's picture
8 16 2

Alexey Malakhov

ZeL1k7
kefirski's profile picture rusrakhimov's profile picture GeorgeBredis's profile picture
·
  • alekseymalakhov11

AI & ML interests

None yet

Organizations

T-Bank-AI's profile picture T-Tech's profile picture

authored a paper 3 months ago

F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare

Paper • 2602.06717 • Published Feb 6 • 74
authored a paper 5 months ago

ESSA: Evolutionary Strategies for Scalable Alignment

Paper • 2507.04453 • Published Jul 6, 2025 • 5
authored a paper over 1 year ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3, 2025 • 113
authored a paper about 2 years ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 90
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs