·
AI & ML interests
None yet
Organizations
AmberYifan/Qwen2.5-3B-MATH-GRPO
Updated
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure-gating
Updated
AmberYifan/Qwen2.5-3B-Instruct-GRPO
Updated
AmberYifan/Qwen2.5-3B-Instruct-MATH-MARL-structure
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-test
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-diameter
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-diameter-relative
Updated
AmberYifan/qwen3-0.6b-p36-sft
Updated
AmberYifan/qwen3-0.6b-mmlu-sft
Updated
AmberYifan/Llama-3.1-8B-Instruct-tulu-sft-30k
Updated
AmberYifan/Llama-3.1-8B-Instruct-tulu-sft-12k
Updated
AmberYifan/qwen3-0.6b-tulu-sft-12k
Updated
AmberYifan/qwen3-0.6b-math500-sft-12k
Updated
AmberYifan/qwen3-0.6b-mmlu-sft-12k
Updated
AmberYifan/Qwen3-1.7B-Polaris-MARL-mysw-relative
2B • Updated AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-relative
Text Generation
• 2B • Updated AmberYifan/Qwen3-1.7B-MATH-MARL-mysw-relative-qwen3-0.6b-embedding
Text Generation
• 2B • Updated • 1
AmberYifan/Qwen3-1.7B-MATH-MARL-diameter-relative-qwen3-0.6b-embedding
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-sw-relative-qwen3-0.6b-embedding
Updated
AmberYifan/Qwen3-1.7B-MATH-GRPO-tuned
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-diameter-relative
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-sw-relative
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-structure-relative
Updated
AmberYifan/qwen3-0.6b-alpaca-sft
Text Generation
• 0.6B • Updated AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-SPIN-iter5
Updated
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-DRIFT-iter5
Updated
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-iter5
Updated
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-iterDPO-NoPrompt-iter2
Updated
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-DRIFT-iter4
Updated
AmberYifan/Qwen2.5-7B-Instruct-wildfeedback-SPIN-iter3-T1.0
Updated