Working Embeddings model Quantized models for llm on prem RnD rpreite/Qwen3-Embedding-8B-INT4-W4A16 Text Generation • 2B • Updated Aug 8, 2025 • 288 • 2 rpreite/Qwen3-Embedding-4B-INT4-W4A16 Text Generation • 0.9B • Updated Aug 8, 2025 • 30 • 1
Working Generation models Quantized models for llm on prem RnD rpreite/Qwen3-14B-BNB-INT4 Text Generation • 15B • Updated Aug 27, 2025 • 3 rpreite/gemma-3-12b-it-BNB-INT4 Image-Text-to-Text • Updated Sep 1, 2025 • 1 rpreite/Gemma3_GPTQ_W4A16 4B • Updated Sep 15, 2025 • 1 rpreite/Qwen3_GPTQ_W4A16 3B • Updated Sep 15, 2025 • 1
Working Embeddings model Quantized models for llm on prem RnD rpreite/Qwen3-Embedding-8B-INT4-W4A16 Text Generation • 2B • Updated Aug 8, 2025 • 288 • 2 rpreite/Qwen3-Embedding-4B-INT4-W4A16 Text Generation • 0.9B • Updated Aug 8, 2025 • 30 • 1
Working Generation models Quantized models for llm on prem RnD rpreite/Qwen3-14B-BNB-INT4 Text Generation • 15B • Updated Aug 27, 2025 • 3 rpreite/gemma-3-12b-it-BNB-INT4 Image-Text-to-Text • Updated Sep 1, 2025 • 1 rpreite/Gemma3_GPTQ_W4A16 4B • Updated Sep 15, 2025 • 1 rpreite/Qwen3_GPTQ_W4A16 3B • Updated Sep 15, 2025 • 1