Custom GGUF quants of Metaβs Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32
Joseph
Joseph717171
AI & ML interests
None yet
Recent Activity
upvoted an article about 9 hours ago
Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine liked a model 1 day ago
Glint-Research/QKVAE-1M-1.3 liked a model 4 days ago
TeichAI/Qwen3.5-9B-Fable-5-v1