Inference Providers
Active filters: sparse
RedHatAI/Llama-2-7b-pruned50-retrained
Text Generation
• 7B • Updated • 11
RedHatAI/Llama-2-7b-pruned70-retrained
Text Generation
• 7B • Updated • 115
• 1
RedHatAI/Llama-2-7b-ultrachat200k-pruned_50
Text Generation
• 7B • Updated • 14
RedHatAI/Llama-2-7b-ultrachat200k-pruned_70
Text Generation
• 7B • Updated • 8
RedHatAI/Llama-2-7b-ultrachat200k-pruned_50-quantized-deepsparse
Text Generation
• Updated • 8
RedHatAI/Llama-2-7b-ultrachat200k-pruned_70-quantized-deepsparse
Text Generation
• Updated • 11
RedHatAI/Llama-2-7b-evol-code-alpaca-pruned_50
Text Generation
• 7B • Updated • 19
RedHatAI/Llama-2-7b-evol-code-alpaca-pruned_70
Text Generation
• 7B • Updated • 7
RedHatAI/Llama-2-7b-evol-code-alpaca-pruned_50-quantized-deepsparse
Text Generation
• Updated • 12
RedHatAI/Llama-2-7b-evol-code-alpaca-pruned_70-quantized-deepsparse
Text Generation
• Updated • 11
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_50
Text Generation
• 7B • Updated • 14
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_70
Text Generation
• 7B • Updated • 10
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse
Text Generation
• Updated • 5
RedHatAI/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse
Text Generation
• Updated • 8
• 1
kettleguts/zephyr-7b-beta_sparse05
Text Generation
• 7B • Updated • 88
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
• Updated • 2
RedHatAI/Llama-2-7b-gsm8k-pruned_50
Text Generation
• 7B • Updated • 13
• 1
RedHatAI/Llama-2-7b-gsm8k-pruned_70
Text Generation
• 7B • Updated • 11
mradermacher/Llama-2-7b-pruned70-retrained-gsm8k-GGUF
7B • Updated • 762
RedHatAI/SparseLlama-3-8B-pruned_50.2of4
Text Generation
• 8B • Updated • 10
• vuiseng9/ov-mpt-7b-gsm8k-sparse70
Text Generation
• Updated opensearch-project/opensearch-neural-sparse-encoding-v2-distill
Feature Extraction
• 67M • Updated • 75.1k
• • 10
opensearch-project/opensearch-neural-sparse-encoding-doc-v2-distill
Feature Extraction
• 67M • Updated • 589k
• • 19
opensearch-project/opensearch-neural-sparse-encoding-doc-v2-mini
Feature Extraction
• 22.7M • Updated • 1.48k
• • 6
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
11B • Updated • 312
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
11B • Updated • 503
tensorblock/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 37
tensorblock/Llama-2-7b-pruned50-retrained-GGUF
Text Generation
• 7B • Updated • 10
mradermacher/phi-2-pruned50-GGUF
3B • Updated • 61
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated • 48