haoran's picture

haoran

haorannlp

·

haorannlp

AI & ML interests

nlp, language model

Recent Activity

liked a dataset 9 days ago

Glint-Research/Fable-5-traces

liked a dataset about 1 month ago

SCAI-JHU/ThoughtTrace

liked a dataset about 1 month ago

Qwen/WebWorldData

View all activity

Organizations

New activity in Nanbeige/ToolMind-Web-QA 4 months ago

download error

#1 opened 4 months ago by

New activity in Qwen/Qwen3.5-122B-A10B 4 months ago

Benchmarks against Qwen Coder Next 80B

#2 opened 4 months ago by

New activity in nvidia/Nemotron-Pretraining-SFT-v1 5 months ago

Request to access Pre-trainning-SFT dataset

#5 opened 5 months ago by

commented 2 papers 11 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190 •

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190 •

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B about 1 year ago

Can you please release how you post-train qwen3 on deepseek?

#12 opened about 1 year ago by

New activity in DavidAU/Qwen3-30B-A6B-16-Extreme about 1 year ago

Is this a finetune?

#1 opened about 1 year ago by