Code generation, debugging, review, and test writing. All deployable privately on dedicated GPUs at hexgrid.cloud
AI & ML interests
Deploy open-source LLMs like Llama, Qwen, Gemma, Mistral, and DeepSeek as production-ready OpenAI-compatible APIs.
Recent Activity
View all activity
FP8, AWQ-4Bit and W8A8 quantized versions of popular models. Lower VRAM, same production quality. Deploy at hexgrid.cloud in one click.
-
cyankiwi/Qwen3.5-9B-AWQ-4bit
Image-Text-to-Text • 10B • Updated • 758k • 33 -
lovedheart/Qwen3.5-9B-FP8
Image-Text-to-Text • 10B • Updated • 74.5k • 14 -
cyankiwi/Qwen3.5-27B-AWQ-4bit
Image-Text-to-Text • 29B • Updated • 558k • 40 -
RedHatAI/gemma-4-31B-it-FP8-block
Image-Text-to-Text • 31B • Updated • 3.36M • 39
Every model deployable on HexGrid Cloud with one click. Dedicated GPU, private API endpoint, OpenAI-compatible. Visit https://hexgrid.cloud
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.29M • • 6.22k -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 741k • • 2.87k -
google/gemma-4-31B-it
Image-Text-to-Text • 33B • Updated • 11.2M • • 3.12k -
Qwen/Qwen3.5-9B
Image-Text-to-Text • 10B • Updated • 8.84M • • 1.66k
Code generation, debugging, review, and test writing. All deployable privately on dedicated GPUs at hexgrid.cloud
FP8, AWQ-4Bit and W8A8 quantized versions of popular models. Lower VRAM, same production quality. Deploy at hexgrid.cloud in one click.
-
cyankiwi/Qwen3.5-9B-AWQ-4bit
Image-Text-to-Text • 10B • Updated • 758k • 33 -
lovedheart/Qwen3.5-9B-FP8
Image-Text-to-Text • 10B • Updated • 74.5k • 14 -
cyankiwi/Qwen3.5-27B-AWQ-4bit
Image-Text-to-Text • 29B • Updated • 558k • 40 -
RedHatAI/gemma-4-31B-it-FP8-block
Image-Text-to-Text • 31B • Updated • 3.36M • 39
The complete open-source RAG pipeline. Best of the embedding models, one reranker, one chat model. All deployable on dedicated GPUs at hexgrid.cloud
Every model deployable on HexGrid Cloud with one click. Dedicated GPU, private API endpoint, OpenAI-compatible. Visit https://hexgrid.cloud
-
meta-llama/Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 9.29M • • 6.22k -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 741k • • 2.87k -
google/gemma-4-31B-it
Image-Text-to-Text • 33B • Updated • 11.2M • • 3.12k -
Qwen/Qwen3.5-9B
Image-Text-to-Text • 10B • Updated • 8.84M • • 1.66k