Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
haoran's picture
5 1 109

haoran

haorannlp
ยท
  • haorannlp

AI & ML interests

nlp, language model

Recent Activity

liked a dataset 7 days ago
sojuL/RubricHub_v1
liked a dataset 12 days ago
nvidia/Nemotron-Pretraining-SFT-v1
new activity 12 days ago
nvidia/Nemotron-Pretraining-SFT-v1:Request to access Pre-trainning-SFT dataset
View all activity

Organizations

LMKnowhere's profile picture

New activity in nvidia/Nemotron-Pretraining-SFT-v1 12 days ago

Request to access Pre-trainning-SFT dataset

๐Ÿ‘€ 1
#5 opened 12 days ago by
haorannlp
commented 2 papers 6 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper โ€ข 2508.05629 โ€ข Published Aug 7, 2025 โ€ข 181 โ€ข
21

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper โ€ข 2508.05629 โ€ข Published Aug 7, 2025 โ€ข 181 โ€ข
21
New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 8 months ago

Can you please release how you post-train qwen3 on deepseek?

2
#12 opened 8 months ago by
ZeroWw
New activity in DavidAU/Qwen3-30B-A6B-16-Extreme 9 months ago

Is this a finetune?

๐Ÿ‘ 1
5
#1 opened 9 months ago by
Trappu
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs