Inference Providers
Active filters: rag
belyakoff/llama-3.2-3b-instruct-fine-tuned-gptq-8bit
Text Generation
• 1B • Updated • 1
• 2
Anirudh6778/t5_fineTuned_RAFT
Text Generation
• 0.2B • Updated • 2
belyakoff/SmolLM2-360M-Instruct-FT
Text Generation
• 0.4B • Updated • 4
• 2
mradermacher/docsgpt-7b-mistral-GGUF
7B • Updated • 141
mradermacher/docsgpt-7b-mistral-i1-GGUF
7B • Updated • 153
tensorblock/SmolLM2-360M-Instruct-FT-GGUF
Text Generation
• 0.4B • Updated • 45
• 1
itlwas/SmolLM2-360M-Instruct-FT-Q4_K_M-GGUF
Text Generation
• 0.4B • Updated • 17
tensorblock/docsgpt-7b-mistral-GGUF
doubleyyh/email-tuned-qwen2-lora
Text Generation
• Updated • 6
rokeya71/granite-embedding-125m-english-onnx
Feature Extraction
• Updated • 3
• 1
cnmoro/Qwen0.5b-RagSemanticChunker
Text Generation
• 0.5B • Updated • 6
• 4
Josephgflowers/Phinance-Phi-3.5-mini-instruct-finance-v0.3
4B • Updated • 7
• 1
altaidevorg/bge-m3-distill-8l-letsearch
Updated
cnmoro/Qwen3b-RagSemanticChunker
Text Generation
• 3B • Updated • 3
• 2
mradermacher/Qwen0.5b-RagSemanticChunker-GGUF
0.5B • Updated • 33
mradermacher/Qwen0.5b-RagSemanticChunker-i1-GGUF
0.5B • Updated • 66
0.6B • Updated silma-ai/SILMA-Kashif-2B-Instruct-v1.0
Text Generation
• 3B • Updated • 2.25k
• • 24
mradermacher/SILMA-Kashif-2B-Instruct-v1.0-GGUF
3B • Updated • 85
• 1
mradermacher/SILMA-Kashif-2B-Instruct-v1.0-i1-GGUF
3B • Updated • 104
• 1
H1tak3/rag-phishing-detector
0.6B • Updated • 1
• 1
mradermacher/Phinance-Phi-3.5-mini-instruct-finance-v0.3-GGUF
4B • Updated • 121
tjohn327/scion-minilm-l6-v2
22.7M • Updated • 4
tensorblock/SILMA-Kashif-2B-Instruct-v1.0-GGUF
Text Generation
• 3B • Updated • 29
Question Answering
• 8B • Updated • 16
• 5
mradermacher/Ext2Gen-8B-R2-GGUF
8B • Updated • 93
• 1
Tonic/c4ai-command-a-03-2025-4bit_nf4_double
Text Generation
• 114B • Updated • 5