matrixportal/X-Ray_Alpha-GGUF

This model was converted to GGUF format from SicariusSicariiStuff/X-Ray_Alpha using llama.cpp via the ggml.ai's all-gguf-same-where space. Refer to the original model card for more details on the model.

โœ… Quantized Models Download List

๐Ÿ” Recommended Quantizations

  • โœจ General CPU Use: Q4_K_M (Best balance of speed/quality)
  • ๐Ÿ“ฑ ARM Devices: Q4_0 (Optimized for ARM CPUs)
  • ๐Ÿ† Maximum Quality: Q8_0 (Near-original quality)

๐Ÿ“ฆ Full Quantization Options

๐Ÿš€ Download ๐Ÿ”ข Type ๐Ÿ“ Notes
Download Q2_K Basic quantization
Download Q3_K_S Small size
Download Q3_K_M Balanced quality
Download Q3_K_L Better quality
Download Q4_0 Fast on ARM
Download Q4_K_S Fast, recommended
Download Q4_K_M โญ Best balance
Download Q5_0 Good quality
Download Q5_K_S Balanced
Download Q5_K_M High quality
Download Q6_K ๐Ÿ† Very good quality
Download Q8_0 โšก Fast, best quality
Download F16 Maximum accuracy
Download mmproj Multimodal projection file for image processing

๐Ÿ’ก Pro Tip: Start with Q4_K_M for most use cases, only use F16 if you need maximum precision.

Downloads last month
227
GGUF
Model size
4B params
Architecture
gemma3
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for matrixportalx/X-Ray_Alpha-GGUF

Quantized
(12)
this model