Rafael Medeiros

RafaelOM

1

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

repliedto danielhanchen's post about 2 months ago

Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs. Google's new model, Gemma 4 12B Unified supports image, audio and 256K context. You can run and train the model via Unsloth Studio. GGUF: https://huggingface.co/unsloth/gemma-4-12b-it-GGUF Guide: https://unsloth.ai/docs/models/gemma-4

View all activity

Organizations

None yet

liked a model about 1 month ago

yuxinlu1/gemma-4-12B-coder-fable5-composer2.5-v1-GGUF

Text Generation • 12B • Updated Jun 19 • 267k • 2.77k

replied to danielhanchen's post about 2 months ago

I tested this https://huggingface.co/unsloth/gemma-4-12B-it-qat-GGUF on an RTX 4060 (8GB VRAM) using https://github.com/AtomicBot-ai/atomic-llama-cpp-turboquant, and it worked perfectly. I even used the assistant for MTP https://huggingface.co/Janvitos/gemma-4-12B-it-qat-assistant-MTP-Q8_0-GGUF/tree/main and everything loaded into VRAM.