roshniramesh
's Collections
int4 llm
updated
Text Generation
•
Updated
•
25
•
1
nvidia/Gemma-2b-it-ONNX-INT4
nvidia/Meta-Llama-3.1-8B-Instruct-ONNX-INT4
Updated
•
21
•
6
nvidia/Meta-Llama-3.2-3B-Instruct-ONNX-INT4
nvidia/Phi-3.5-mini-Instruct-ONNX-INT4
nvidia/Mistral-Nemo-12B-Instruct-ONNX-INT4
nvidia/Nemotron-Mini-4B-Instruct-ONNX-INT4
meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
76
•
38
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
9B
•
Updated
•
2.83k
•
7
Qwen/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
670
•
29
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
8B
•
Updated
•
490k
•
86
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
28.7k
•
30
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
•
8B
•
Updated
•
86
hugging-quants/Llama-3.2-3B-Instruct-Q4_K_M-GGUF
Text Generation
•
3B
•
Updated
•
18.1k
•
25
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4
Text Generation
•
71B
•
Updated
•
76.8k
•
107
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
•
1B
•
Updated
•
34.3k
•
19
hugging-quants/Meta-Llama-3.1-70B-Instruct-GPTQ-INT4
Text Generation
•
71B
•
Updated
•
1.01k
•
23
hugging-quants/Meta-Llama-3.1-8B-Instruct-GPTQ-INT4
Text Generation
•
8B
•
Updated
•
19.4k
•
41
meta-llama/Llama-Guard-3-1B-INT4
Text Generation
•
Updated
•
13
•
27
meta-llama/Llama-3.2-3B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
72
•
71
meta-llama/Llama-3.2-3B-Instruct-SpinQuant_INT4_EO8
Text Generation
•
Updated
•
76
•
37
meta-llama/Llama-3.2-1B-Instruct-QLORA_INT4_EO8
Text Generation
•
Updated
•
116
•
48
RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit
Text Generation
•
7B
•
Updated
•
17.9k
•
23
RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w4a16
Text Generation
•
7B
•
Updated
•
128
•
2
RedHatAI/Llama-2-7b-chat-quantized.w4a16
Text Generation
•
7B
•
Updated
•
23
RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16
Text Generation
•
8B
•
Updated
•
58
•
2
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
194
•
2
RedHatAI/gemma-2-2b-it-quantized.w4a16
Text Generation
•
1B
•
Updated
•
49
•
1
RedHatAI/gemma-2-9b-it-quantized.w4a16
Text Generation
•
3B
•
Updated
•
83
•
2
RedHatAI/Mistral-Nemo-Instruct-2407-quantized.w4a16
Text Generation
•
3B
•
Updated
•
1.33k
•
4
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16
Text Generation
•
71B
•
Updated
•
3.02k
•
32
nvidia/Mistral-7B-Instruct-v0.3-ONNX-INT4
OpenVINO/mistral-7b-instruct-v0.1-int4-ov
Text Generation
•
Updated
•
4
OpenVINO/Mistral-7B-Instruct-v0.2-int4-ov
Text Generation
•
Updated
•
614
•
1
Text Generation
•
72B
•
Updated
•
130
•
47
Text Generation
•
14B
•
Updated
•
113
•
100
Text Generation
•
8B
•
Updated
•
565
•
75
Text Generation
•
2B
•
Updated
•
314
•
36
Qwen/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
111B
•
Updated
•
57.6k
•
18
Qwen/Qwen1.5-1.8B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
90
•
7
Qwen/Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4
Text Generation
•
14B
•
Updated
•
744
•
50
Qwen/Qwen1.5-4B-Chat-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
87
•
6
Qwen/Qwen1.5-72B-Chat-GPTQ-Int4
Text Generation
•
72B
•
Updated
•
1.76k
•
37
Qwen/Qwen1.5-4B-Chat-GGUF
Text Generation
•
4B
•
Updated
•
716
•
16
Qwen/Qwen1.5-0.5B-Chat-GGUF
Text Generation
•
0.6B
•
Updated
•
4.7k
•
35
Qwen/Qwen1.5-7B-Chat-GGUF
Text Generation
•
8B
•
Updated
•
2.72k
•
70
Qwen/CodeQwen1.5-7B-Chat-GGUF
Text Generation
•
7B
•
Updated
•
813
•
109
Qwen/Qwen2.5-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
762
•
3
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.5B
•
Updated
•
438
•
9
Qwen/Qwen2.5-0.5B-Instruct-GGUF
Text Generation
•
0.6B
•
Updated
•
43.2k
•
72
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
6.44k
•
27
Qwen/Qwen2-0.5B-Instruct-GGUF
Text Generation
•
0.5B
•
Updated
•
16.2k
•
71
Qwen/Qwen2-7B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
4.25k
•
177
Qwen/Qwen2-0.5B-Instruct-GPTQ-Int4
Text Generation
•
0.6B
•
Updated
•
79
•
15
Qwen/Qwen2-1.5B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
18.5k
•
5
Qwen/Qwen2-72B-Instruct-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
58
•
33
Qwen/Qwen2-57B-A14B-Instruct-GPTQ-Int4
Text Generation
•
57B
•
Updated
•
149
•
23