Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
5
Yuxuan Wan
yuxuanw8
Follow
0 followers
·
4 following
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
yuxuanw8/qwen3b-rlcr-kl-beta0.01-hotpot
updated
a model
2 days ago
yuxuanw8/qwen3b-rlcr-kl-beta0.1-hotpot
published
a model
2 days ago
yuxuanw8/qwen3b-rlcr-kl-beta0.01-hotpot
View all activity
Organizations
yuxuanw8
's models
42
Sort: Recently updated
yuxuanw8/qwen3b-rlcr-kl-beta0.01-hotpot
Text Generation
•
3B
•
Updated
2 days ago
•
36
yuxuanw8/qwen3b-rlcr-kl-beta0.1-hotpot
Text Generation
•
3B
•
Updated
2 days ago
•
38
yuxuanw8/qwen3b-rlcr-hotpot
Text Generation
•
3B
•
Updated
2 days ago
•
34
yuxuanw8/pythia2.8b-oai-summary-ppopet-0.5ep
Text Generation
•
3B
•
Updated
Jan 27
•
3
yuxuanw8/pythia2.8b-oai-summary-ppo-1ep
Text Generation
•
3B
•
Updated
Jan 27
•
2
yuxuanw8/qwen25-1.5b_ultrafeedback_pet_1e-5_nsample8
Text Classification
•
2B
•
Updated
Jan 26
•
1
yuxuanw8/pythia6.9b-oai-summary-chipo-1ep
Text Generation
•
7B
•
Updated
Jan 25
•
1
yuxuanw8/pythia6.9b-oai-summary-rpo-1ep
Text Generation
•
7B
•
Updated
Jan 25
•
2
yuxuanw8/pythia6.9b-oai-summary-dpo-1ep
Text Generation
•
7B
•
Updated
Jan 25
•
1
yuxuanw8/pythia-2.8b_summarization_pet_1e-5_cleanrl
Text Classification
•
3B
•
Updated
Jan 24
•
1
yuxuanw8/qwen25-1.5b_ultrafeedback_dpo_lr1e-4
Text Generation
•
2B
•
Updated
Jan 23
•
3
yuxuanw8/qwen25-1.5b_ultrafeedback_rpo_lr1e-4
Text Generation
•
2B
•
Updated
Jan 23
•
3
yuxuanw8/qwen25-1.5b_ultrafeedback_chipo_lr1e-4
Text Generation
•
2B
•
Updated
Jan 23
•
2
yuxuanw8/pythia2.8b-oai-summary-chipo-1ep
Text Generation
•
3B
•
Updated
Jan 18
•
3
yuxuanw8/pythia2.8b-oai-summary-rpo-1ep
Text Generation
•
3B
•
Updated
Jan 18
•
3
yuxuanw8/pythia2.8b-oai-summary-dpo-1ep
Text Generation
•
3B
•
Updated
Jan 17
•
1
yuxuanw8/pythia-2.8b_summarization_dpo_lr1e-4
Text Generation
•
3B
•
Updated
Jan 17
•
2
yuxuanw8/pythia-6.9b_summarization_rm_lr1e-4
Text Classification
•
7B
•
Updated
Jan 17
•
2
yuxuanw8/pythia-6.9b_summarization_sft_lr1e-4
Text Generation
•
7B
•
Updated
Jan 17
•
3
yuxuanw8/qwen25-1.5b_ultrafeedback_rm_lr3e-4_3ep
Text Classification
•
2B
•
Updated
Jan 16
•
2
yuxuanw8/qwen25-1.5b_ultrafeedback_sft_lr1e-4
Text Generation
•
2B
•
Updated
Jan 16
•
3
•
yuxuanw8/pythia-2.8b_summarization_pet_lr1e-5
Text Classification
•
3B
•
Updated
Jan 16
•
1
yuxuanw8/pythia-2.8b_summarization_rm_lr3e-4_3ep
Text Classification
•
3B
•
Updated
Jan 16
•
1
yuxuanw8/pythia-2.8b_summarization_sft_lr1e-4
Text Generation
•
3B
•
Updated
Jan 16
•
1
yuxuanw8/pythia-2.8b_summarization_dpo_lr1e-5
Text Generation
•
3B
•
Updated
Jan 7
•
3
yuxuanw8/gpt2_large_imdb_pet_3e-6_v2
Text Generation
•
0.8B
•
Updated
Dec 26, 2025
•
2
yuxuanw8/gpt2_large_imdb_pet_3e-5
Text Generation
•
0.8B
•
Updated
Dec 26, 2025
•
2
yuxuanw8/gpt2_large_imdb_pet_1e-5_v2_optim
Text Generation
•
0.8B
•
Updated
Dec 26, 2025
•
2
yuxuanw8/gpt2_large_imdb_pet_1e-5_v2
Text Generation
•
0.8B
•
Updated
Dec 26, 2025
•
3
yuxuanw8/pythia1b-sft-tldr
Text Generation
•
1B
•
Updated
Nov 28, 2025
•
21
Previous
1
2
Next