Yifan Wang

AmberYifan

·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

AmberYifan/qwen3-8b_ultrafeedback_grpo_structure_only_step38

published a model about 1 month ago

AmberYifan/qwen3-8b_ultrafeedback_grpo_structure_only_step38

updated a model about 1 month ago

AmberYifan/qwen3-8b_openrubrics_v2_grpo_structure_only_step60

View all activity

Organizations

AmberYifan 's models 131

AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft

Text Generation • 8B • Updated Sep 19, 2025 • 2

AmberYifan/qwen2.5-0.5b-instruct-full-pretrain-control-tweet-1m-en

Text Generation • 0.5B • Updated Sep 19, 2025 • 3

AmberYifan/qwen2.5-0.5b-instruct-full-pretrain-junk-tweet-1m-en

Text Generation • 0.5B • Updated Sep 19, 2025 • 3

AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en

Text Generation • 8B • Updated Sep 19, 2025 • 4

AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en

Text Generation • 8B • Updated Sep 19, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-sft

Text Generation • 4B • Updated Sep 16, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-sft

Text Generation • 4B • Updated Sep 16, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-sft

Text Generation • 4B • Updated Sep 16, 2025 • 1

AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en-sft

Text Generation • 4B • Updated Sep 16, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-sft

Text Generation • 4B • Updated Sep 16, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en

Text Generation • 4B • Updated Sep 16, 2025 • 1

AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en

Text Generation • 4B • Updated Sep 16, 2025

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-low-tweet-1m-en-gpt-sft

Text Generation • 4B • Updated Aug 30, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-mid-tweet-1m-en-gpt-sft

Text Generation • 4B • Updated Aug 30, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-mix-high-tweet-1m-en-gpt-sft

Text Generation • 4B • Updated Aug 30, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-junk-tweet-1m-en-gpt-sft

Text Generation • 4B • Updated Aug 30, 2025 • 2

AmberYifan/qwen3-4b-thinking-full-pretrain-control-tweet-1m-en-gpt-sft

Text Generation • 4B • Updated Aug 30, 2025 • 2

AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-DRIFT-iter2-RPO

Text Generation • 841k • Updated Aug 7, 2025 • 3

AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-spin-iter2-RPO

Text Generation • 841k • Updated Aug 7, 2025 • 2

AmberYifan/Qwen2.5-14B-Instruct-ultrafeedback-iterdpo-iter2-RPO

Text Generation • 841k • Updated Aug 7, 2025 • 4 • 1

AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-DRIFT-iter2

Text Generation • 841k • Updated Jul 30, 2025 • 3

AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-iterDPO-iter2

Text Generation • 841k • Updated Jul 30, 2025 • 3

AmberYifan/Qwen2.5-14B-Instruct-wildfeedback-RPO-SPIN-iter2

Text Generation • 841k • Updated Jul 30, 2025 • 2

AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-gpt-sft

Text Generation • 8B • Updated Jul 3, 2025 • 2

AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-gpt-sft

Text Generation • 8B • Updated Jul 3, 2025 • 3

AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-gpt-sft

Text Generation • 8B • Updated Jul 3, 2025 • 2

AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-gpt-sft

Text Generation • 8B • Updated Jul 3, 2025 • 2

AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-gpt-sft

Text Generation • 8B • Updated Jul 3, 2025 • 2

AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-iterDPO-iter2

Text Generation • 8B • Updated Jun 30, 2025 • 10 • 1

AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-SPIN-iter2

Text Generation • 8B • Updated Jun 30, 2025 • 9 • 1