mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_quantumcomputing
Text Generation
• 8B • Updated mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_politics
Text Generation
• 8B • Updated mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_physics
8B • Updated mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_proofassistants
Text Generation
• 8B • Updated mlfoundations-dev/llama3-1_8b_mlfoundations-dev-stackexchange_poker
Text Generation
• 8B • Updated mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
• 7B • Updated • 1
mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated • 1
mlfoundations-dev/oh-mistral-bs2048_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulerconstant_warmup1_00E-01_minlr
Text Generation
• 7B • Updated • 2
mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated • 1
mlfoundations-dev/oh-mistral-bs4096_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated • 1
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_1.0
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs512_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs512_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
• 7B • Updated • 2
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.9
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs1024_lr5_00E-06_schedulercosine_with_min_lr_warmup1_00E-01_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs1024_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.8
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs2048_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
• 7B • Updated • 12
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.7
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs4096_lr2_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
• 7B • Updated mlfoundations-dev/oh-mistral-bs4096_lr5_00E-06_schedulercosine_with_min_lr_warmup5_00E-02_minlr5_00E-07
Text Generation
• 7B • Updated • 1
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.6
Text Generation
• 7B • Updated • 2
mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.5
Text Generation
• 7B • Updated mlfoundations-dev/llama2_oh_teknium_scaling_down_random_0.4
Text Generation
• 7B • Updated mlfoundations-dev/tinyllama_alpaca_sft_sample
Text Generation
• 1B • Updated • 3