·
AI & ML interests
None yet
Organizations
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-SPIN-iter2
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-DRIFT-iter2
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-iterdpo-iter1
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-spin-iter1
Text Generation
• 8B • Updated AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-10k
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-nspin-iter1
Text Generation
• 8B • Updated • 1
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en-sft
Text Generation
• 8B • Updated • 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft
Text Generation
• 8B • Updated • 2
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation
• 8B • Updated • 1
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-iter2
Text Generation
• 8B • Updated • 2
• 1
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en-sft
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1
Text Generation
• 8B • Updated • 1
• 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-4k-iter2
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-4k-iter1
Text Generation
• 8B • Updated • 2
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-SPIN-iter2
Text Generation
• 8B • Updated • 2
• 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SPIN-iter1
Text Generation
• 8B • Updated • 1
• 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter2
Text Generation
• 8B • Updated AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT-SPIN-iter1
Text Generation
• 8B • Updated • 1
• 1
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-SFT
Text Generation
• 8B • Updated • 1
• 1
AmberYifan/llama3-8b-full-pretrain-mix-low-tweet-1m-en
Text Generation
• 8B • Updated • 2
AmberYifan/llama3-8b-full-pretrain-mix-mid-tweet-1m-en
Text Generation
• 8B • Updated • 1
AmberYifan/llama3-8b-full-pretrain-mix-high-tweet-1m-en
Text Generation
• 8B • Updated • 5
AmberYifan/Qwen2.5-7B-Instruct-userfeedback-iter3
Text Generation
• 8B • Updated AmberYifan/Qwen2.5-7B-Instruct-noseed-userfeedback-iter2
Text Generation
• 8B • Updated AmberYifan/Qwen2.5-7B-Instruct-noseed-userfeedback-iter1
Text Generation
• 8B • Updated • 1
AmberYifan/llama3-8b-full-pretrain-control-tweet-1m-en
Text Generation
• 8B • Updated • 3
AmberYifan/llama3-8b-full-pretrain-junk-tweet-1m-en
Text Generation
• 8B • Updated AmberYifan/Qwen2.5-7B-Instruct-ultrafeedback-20k
Text Generation
• 8B • Updated • 1
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-20k
Text Generation
• 8B • Updated