shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 20 days ago • 35
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 20 days ago • 31