Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
avinashkumarkashyap 's Collections
Avinash

Avinash

updated about 14 hours ago
Upvote
-

  • Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

    Paper • 2602.18292 • Published 4 days ago • 10

  • VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

    Paper • 2602.10693 • Published 13 days ago • 165
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs