Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andreas Stöffelbauer's picture
10

Andreas Stöffelbauer

andreasskyscanner

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago
You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass
upvoted a paper about 14 hours ago
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
upvoted a paper about 14 hours ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
View all activity

Organizations

None yet

andreasskyscanner 's models 2

andreasskyscanner/llama-31-hhrlhf-squad-rlhf-policy-model

Text Generation • 1B • Updated Jul 1, 2025 • 1

andreasskyscanner/llama-32-hhrlhf-reward-adapter

Updated Jul 1, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs