shuoxing/qwen2-5-0.5b-full-pretrain-control-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated Jan 25 • 2
shuoxing/qwen2-5-0.5b-full-pretrain-mix-high-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated Jan 25
shuoxing/qwen2-5-0.5b-full-pretrain-mix-mid-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated Jan 24 • 2
shuoxing/qwen2-5-0.5b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs4 Text Generation • 0.5B • Updated Jan 24 • 2
shuoxing/llama3-8b-full-sft-mix-high-tweet-1m-en-reproduce-bs16 Text Generation • 266k • Updated Dec 30, 2025 • 2
shuoxing/llama3-8b-full-sft-mix-mid-tweet-1m-en-reproduce-bs16 Text Generation • 266k • Updated Dec 30, 2025 • 2
shuoxing/llama3-8b-full-sft-mix-low-tweet-1m-en-reproduce-bs16 Text Generation • 266k • Updated Dec 30, 2025 • 2
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-reproduce-bs8 Text Generation • 266k • Updated Dec 27, 2025 • 2
shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8 Text Generation • 266k • Updated Dec 25, 2025 • 71