-
-
-
-
-
-
Inference Providers
Active filters: meta
matrixportalx/Llama-3.2-3B-Instruct-Q2_K-GGUF
Text Generation
• 3B • Updated
• 1
matrixportalx/Llama-2-7b-chat-hf-Q4_K_M-GGUF
Text Generation
• 7B • Updated
• 148
• 6
Raj-Maharajwala/Open-Insurance-LLM-Llama3-8B-GGUF
Text Generation
• 8B • Updated
• 164
• 5
raethehacker/Llama-3.1-8B-Instruct-Q2_K-GGUF
Text Generation
• 8B • Updated
• 5
• 1
alices2/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
• 1B • Updated
• 1
executorch-community/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8-ET
Text Generation
• Updated
• 23
• 1
executorch-community/Llama-3.2-1B-Instruct-QLORA_INT4_EO8-ET
Text Generation
• Updated
• 5
• 3
matrixportalx/Llama-3.1-8B-Instruct-IQ4_NL-GGUF
Text Generation
• 8B • Updated
• 1
mlx-community/Meta-Llama-3.1-8B-Instruct-3bit
Text Generation
• Updated
• 29
artificialguybr/LLAMA3.2-1B-Synthia-I-Redmond
Text Generation
• 1B • Updated
• 9
• 1
artificialguybr/LLAMA3.2-1B-Synthia-I-Redmond-gguf
1B • Updated
• 359
• 1
mav23/Llama-2-70B-fp16-GGUF
Text Generation
• 69B • Updated
• 8
matrixportalx/Llama-3.1-8B-Instruct-IQ3_M-GGUF
Text Generation
• 8B • Updated
• 1
Text Generation
• 8B • Updated
• 5
artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond
Text Generation
• 1B • Updated
• 21
• 1
artificialguybr/LLAMA3.2-1B-Synthia-II-Redmond-gguf
1B • Updated
• 126
• 1
Maites/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
• 1B • Updated
• 1
Maites/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation
• 1B • Updated
• 5
irinachengsc/Llama-3.1-8B-Q4_0-GGUF
Text Generation
• 8B • Updated
Triangle104/Meta-Llama-3.1-8B-Instruct-Q5_K_S-GGUF
8B • Updated
Triangle104/Meta-Llama-3.1-8B-Instruct-Q5_K_M-GGUF
8B • Updated
• 5
Triangle104/Meta-Llama-3.1-8B-Instruct-Q6_K-GGUF
8B • Updated
• 47
Triangle104/Meta-Llama-3.1-8B-Instruct-Q8_0-GGUF
8B • Updated
• 1
Maites/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
• 3B • Updated
• 4
Text Generation
• 8B • Updated
Almheiri/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
• 1B • Updated
• 4
• base16/Llama-3.2-1B-Instruct-4bit
Text Generation
• 0.2B • Updated
• 18
• 1
Text Generation
• Updated
• 19
• 13
tensorblock/EZO-Llama-3.2-3B-Instruct-dpoE-GGUF
Text Generation
• 3B • Updated
• 21
Text Generation
• 8B • Updated