AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-SPIN-iter2 Text Generation • 8B • Updated Jun 30, 2025 • 2 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-DRIFT-iter2 Text Generation • 8B • Updated Jun 29, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-iterdpo-iter1 Text Generation • 8B • Updated Jun 29, 2025 • 5 • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-spin-iter1 Text Generation • 8B • Updated Jun 29, 2025 • 5
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-nspin-iter1 Text Generation • 8B • Updated Jun 29, 2025 • 4 • 1
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-sft Text Generation • 8B • Updated Jun 26, 2025 • 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft Text Generation • 8B • Updated Jun 26, 2025 • 9
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-sft Text Generation • 8B • Updated Jun 26, 2025 • 3
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-sft Text Generation • 8B • Updated Jun 26, 2025 • 9
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-iter2 Text Generation • 8B • Updated Jun 25, 2025 • 7 • 1
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft Text Generation • 8B • Updated Jun 25, 2025 • 6
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1 Text Generation • 8B • Updated Jun 25, 2025 • 5 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-4k-iter2 Text Generation • 8B • Updated Jun 24, 2025 • 4 • 1
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-SPIN-iter2 Text Generation • 8B • Updated Jun 21, 2025 • 13 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter1 Text Generation • 8B • Updated Jun 20, 2025 • 6 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter2 Text Generation • 8B • Updated Jun 20, 2025 • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter1 Text Generation • 8B • Updated Jun 20, 2025 • 8 • 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en Text Generation • 8B • Updated Jun 19, 2025 • 4
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en Text Generation • 8B • Updated Jun 19, 2025 • 6
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en Text Generation • 8B • Updated Jun 19, 2025 • 5
AmberYifan/Qwen2.5-7B-Instruct-noseed-userfeedback-iter2 Text Generation • 8B • Updated Jun 13, 2025 • 2
AmberYifan/Qwen2.5-7B-Instruct-noseed-userfeedback-iter1 Text Generation • 8B • Updated Jun 13, 2025 • 4
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en Text Generation • 8B • Updated Jun 11, 2025 • 7