Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhang's picture
6

Zhang

Diluner
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago
Sparse Reward Subsystem in Large Language Models
authored a paper about 19 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
upvoted a paper 1 day ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
View all activity

Organizations

None yet

upvoted a paper about 13 hours ago

Sparse Reward Subsystem in Large Language Models

Paper • 2602.00986 • Published 3 days ago • 8
upvoted a paper 1 day ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published 3 days ago • 29
upvoted a paper 9 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 189
upvoted a paper 10 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93
upvoted a paper 12 months ago

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published Feb 17, 2025 • 6
upvoted a paper over 1 year ago

Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization

Paper • 2410.04717 • Published Oct 7, 2024 • 18
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs