Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenbo Zhang's picture
1

Wenbo Zhang

Wenboz
https://onepounchman.github.io/

AI & ML interests

Trustworthy AI, LLMs

Recent Activity

upvoted a paper 1 day ago
Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models
updated a dataset 4 months ago
Wenboz/mistral-base-dpo-iter2-reward-logps-ultrafeedback
published a dataset 4 months ago
Wenboz/mistral-base-dpo-iter2-reward-logps-ultrafeedback
View all activity

Organizations

None yet

upvoted a paper 1 day ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published 4 days ago • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs