-
-
-
-
-
-
Inference Providers
Active filters: full
SimpleBerry/LLaMA-O1-Base-1127
Text Generation
• 8B • Updated
• 6
• 18
SimpleBerry/LLaMA-O1-Supervised-1129
Text Generation
• 8B • Updated
• 4
• 23
mlfoundations-dev/hp_ablations_gemma_lr1e-5
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_lr5e-6
Text Generation
• 9B • Updated
• 7
mlfoundations-dev/hp_ablations_gemma_lr1e-6
Text Generation
• 9B • Updated
• 7
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10
Text Generation
• 9B • Updated
• 7
mlfoundations-dev/hp_ablations_gemma_scheduler_constant
Text Generation
• 9B • Updated
• 3
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.15
Text Generation
• 9B • Updated
• 12
mlfoundations-dev/hp_ablations_gemma_scheduler_inverse_sqrt
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.05
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.10
Text Generation
• 9B • Updated
• 3
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-6
Text Generation
• 8B • Updated
• 3
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-6
Text Generation
• 8B • Updated
• 2
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr1e-7
Text Generation
• 8B • Updated
• 9
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.05_minlr5e-7
Text Generation
• 8B • Updated
• 5
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr5e-7
Text Generation
• 8B • Updated
• 3
mlfoundations-dev/hp_ablations_qwen_scheduler_cosine_warmup0.10_minlr1e-7
Text Generation
• 8B • Updated
• 3
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6
Text Generation
• 9B • Updated
• 3
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr5e-7
Text Generation
• 9B • Updated
• 2
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-7
Text Generation
• 9B • Updated
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-7
Text Generation
• 9B • Updated
• 7
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr5e-7
Text Generation
• 9B • Updated
• 1
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-6
Text Generation
• 9B • Updated
• 2
mradermacher/LLaMA-O1-Base-1127-GGUF
8B • Updated
• 35
mradermacher/LLaMA-O1-Supervised-1129-GGUF
8B • Updated
• 97
sanjay920/llama-3.2-1b-coral.org-expert
1B • Updated
• 11
tensorblock/OH_original_wo_null_sources-GGUF
tensorblock/oh-dcft-v3-llama3.1-nemotron-70b_shareGPT_format-GGUF
lightblue/qwen2.5-7B-instruct-kto
Text Generation
• 8B • Updated
• 4