Inference Providers
Active filters: gpu
ConfidentialMind/gte-multilingual-reranker-base-onnx-op19-opt-gpu
Sentence Similarity
• Updated • 17
Robotics
• Updated sbeierle/fame-pytorch-kit
Updated
excribe/classifer_sgd_longformer_4099
Text Classification
• 0.1B • Updated • 5
Text Generation
• Updated AhmedAyman/k2-think-cuda-1505
Text Generation
• Updated • 3
Eltamuan/Gravitas-Torch-2.8-Blackwell-Edition
Updated
magiccodingman/Qwen3-4B-Instruct-2507-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 114
magiccodingman/Qwen3-4B-Thinking-2507-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 37
• 1
magiccodingman/Qwen3-4B-Thinking-2507-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 32
• 1
magiccodingman/Qwen3-4B-Instruct-2507-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 4B • Updated • 109
• 2
magiccodingman/Seed-OSS-36B-Instruct-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 36B • Updated • 14
• 1
magiccodingman/Granite-4.0-H-350M-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 0.3B • Updated • 23
magiccodingman/Granite-4.0-H-1B-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 1B • Updated • 36
magiccodingman/Apriel-1.5-15b-Thinker-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 14B • Updated • 291
magiccodingman/Qwen3-VL-8B-Thinking-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 8B • Updated • 326
• 1
magiccodingman/Qwen3-VL-8B-Instruct-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 8B • Updated • 137
• 2
magiccodingman/Qwen3-VL-32B-Thinking-Unsloth-MXFP4-Hybrid-GGUF
Text Generation
• 33B • Updated • 47
magiccodingman/Granite-4.0-H-350M-Unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 0.3B • Updated • 99
• 1
magiccodingman/Qwen3-4B-Instruct-2507-Unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 4B • Updated • 541
• 8
magiccodingman/Qwen3-4B-Thinking-2507-Unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 4B • Updated • 501
• 2
magiccodingman/Qwen3-30B-A3B-Thinking-2507-unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 31B • Updated • 511
• 5
magiccodingman/Qwen3-30B-A3B-Instruct-2507-unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 31B • Updated • 457
• 4
magiccodingman/Seed-OSS-36B-Instruct-unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 36B • Updated • 1.03k
• 10
magiccodingman/Apriel-1.5-15b-Thinker-unsloth-MagicQuant-Hybrid-GGUF
Text Generation
• 14B • Updated • 369
• 2
Stan31/quantumflow-prototypes
Updated
Jong-Seong/qwen3-next-gb10-guide
Updated
Hellohal2064/vllm-dgx-spark-gb10
Text Generation
• Updated • 5
Jens-Duttke/DepthPro-ONNX-HighPerf
Depth Estimation
• Updated • 6
• 1
wekkel/Qwen3-32B-Instruct-DirectML-INT4
Text Generation
• Updated • 3