AI & ML interests

On-device AI, GGUF quantization, Apple Silicon, macOS automation

Recent Activity

hero775ย  updated a model 1 day ago
batiai/DeepSeek-V4-Pro-GGUF
hero775ย  updated a collection 3 days ago
๐Ÿš€ Frontier MoE โ€” 128Bโ€“1T
View all activity

batiai 's collections 6

๐Ÿš€ Frontier MoE โ€” 128Bโ€“1T
Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.
๐ŸŽ Gemma 4 โ€” Google's Latest
Gemma 4 quantizations from Google's official weights. Best entry for 16GB Mac mini M4 (E4B Q4 = 57 t/s).
BatiAI RAG Stack
Complete Mac-first on-device RAG stack โ€” chat LLM + reranker + text/VL embedder, direct from BF16, BatiAI-signed. For BatiFlow.
๐Ÿง  NVIDIA Nemotron 3 โ€” Hybrid Mamba+Attention
NVIDIA Nemotron 3 family โ€” NemotronH architecture combining Mamba state-space + standard attention. Mac-runnable, BatiAI-quantized + signed.
๐Ÿš€ Frontier MoE โ€” 128Bโ€“1T
Largest open-weight LLMs, BatiAI-quantized. Mac-runnable from M4 Max 128GB to Mac Studio M3 Ultra 512GB.
๐ŸŽ Gemma 4 โ€” Google's Latest
Gemma 4 quantizations from Google's official weights. Best entry for 16GB Mac mini M4 (E4B Q4 = 57 t/s).
BatiAI RAG Stack
Complete Mac-first on-device RAG stack โ€” chat LLM + reranker + text/VL embedder, direct from BF16, BatiAI-signed. For BatiFlow.