roshniramesh
's Collections
Text Generation
•
Updated
•
25
•
1
nvidia/Gemma-2b-it-ONNX-INT4
nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4
Updated
•
41
•
6
nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4
nvidia/Phi-3.5-mini-Instruct-ONNX-INT4
nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4
nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4
meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
85
•
37
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
9B
•
Updated
•
8.53k
•
7
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
615
•
29
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
194k
•
82
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
17.9k
•
30
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
8B
•
Updated
•
96
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
3B
•
Updated
•
22.9k
•
26
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
71B
•
Updated
•
169k
•
107
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
32k
•
18
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4
Text Generation
•
71B
•
Updated
•
974
•
23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
7.52k
•
40
meta-llama/Llama-Guard-3-1B-INT4
Text Generation
•
Updated
•
17
•
27
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
168
•
70
meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
230
•
37
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
116
•
45
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
7B
•
Updated
•
168k
•
23
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16
Text Generation
•
7B
•
Updated
•
121
•
2
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
•
7B
•
Updated
•
52
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
40
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
121
•
2
RedHatAI/gemma-2-2b-it-quantized.w4a16
Text Generation
•
1B
•
Updated
•
147
•
1
RedHatAI/gemma-2-9b-it-quantized.w4a16
Text Generation
•
3B
•
Updated
•
77
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16
Text Generation
•
3B
•
Updated
•
114
•
4
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
1.46k
•
32
nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
•
39
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
•
2.06k
•
1
Text Generation
•
72B
•
Updated
•
92
•
46
Text Generation
•
14B
•
Updated
•
148
•
100
Text Generation
•
8B
•
Updated
•
647
•
75
Text Generation
•
2B
•
Updated
•
417
•
36
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
3.73k
•
18
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
421
•
7
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
1.27k
•
48
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
1.58k
•
6
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
72B
•
Updated
•
2.62k
•
37
Qwen/Qwen1.5-4B-Chat-GGUF
Text Generation
•
4B
•
Updated
•
2.58k
•
16
Qwen/Qwen1.5-0.5B-Chat-GGUF
Text Generation
•
0.6B
•
Updated
•
6.15k
•
35
Qwen/Qwen1.5-7B-Chat-GGUF
Text Generation
•
8B
•
Updated
•
5.17k
•
70
Qwen/CodeQwen1.5-7B-Chat-GGUF
Text Generation
•
7B
•
Updated
•
787
•
109
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
3.78k
•
2
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.5B
•
Updated
•
955
•
8
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
•
0.6B
•
Updated
•
44k
•
58
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
7.64k
•
27
Qwen/Qwen2-0.5B-Instruct-GGUF
Text Generation
•
0.5B
•
Updated
•
19k
•
69
Qwen/Qwen2-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
3.46k
•
177
Qwen/Qwen2-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
72
•
15
Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
8.41k
•
5
Qwen/Qwen2-72B-Instruct-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
148
•
33
Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4
Text Generation
•
57B
•
Updated
•
476
•
23