AI & ML interests

None defined yet.

danielhanchenΒ 
posted an update 9 days ago
view post
Post
3012
1-bit GLM-5.2 GGUF vs. Claude 4.8 Opus vs. GPT-5.5

We gave 3 models the same prompt and compared one-shot outputs.

The 1-bit GLM-5.2 GGUF ran locally on a Mac Studio M3 Ultra with 256GB RAM at ~21.6 tok/s.

Which output do you like best?
GGUF: unsloth/GLM-5.2-GGUF
  • 3 replies
Β·
danielhanchenΒ 
posted an update 16 days ago
danielhanchenΒ 
posted an update 22 days ago
danielhanchenΒ 
posted an update 24 days ago
danielhanchenΒ 
posted an update 28 days ago
view post
Post
9248
Gemma 4 12B can now run locally on just 8GB RAM via Dynamic GGUFs.

Google's new model, Gemma 4 12B Unified supports image, audio and 256K context.
You can run and train the model via Unsloth Studio.

GGUF: unsloth/gemma-4-12b-it-GGUF
Guide: https://unsloth.ai/docs/models/gemma-4
  • 5 replies
Β·
mahwizzzzΒ 
posted an update about 1 month ago
view post
Post
215
Released lafzyn , built over Qwen, an Urdu language model that converts Urdu text into IPA phonetic transcription, with GGUF builds for local inference.

Release contents:
- mahwizzzz/lafzyn: full weights
- mahwizzzz/lafzyn-gguf: quantized builds

Try it out πŸ€—
Demo: https://huggingface.co/spaces/mahwizzzz/Lafzyn
danielhanchenΒ 
posted an update about 1 month ago
danielhanchenΒ 
posted an update about 2 months ago
view post
Post
5961
We’re excited to announce that Unsloth has joined the PyTorch Ecosystem! πŸ”₯πŸ¦₯

Unsloth is an open-source project that makes training & running models more accurate and faster with less compute. Our mission is to make local AI accessible to everyone. Thanks to all of you for making this possible! πŸ’•

Blog: https://unsloth.ai/blog/pytorch
GitHub: https://github.com/unslothai/unsloth
  • 2 replies
Β·
danielhanchenΒ 
posted an update about 2 months ago
view post
Post
7769
We collaborated with NVIDIA to teach you how we made LLM training ~25% faster! πŸš€

Learn how 3 optimizations help your home GPU train models faster:
1. Packed-sequence metadata caching
2. Double-buffered checkpoint reloads
3. Faster MoE routing

Guide: https://unsloth.ai/blog/nvidia-collab
GitHub: https://github.com/unslothai/unsloth
danielhanchenΒ 
posted an update about 2 months ago
view post
Post
8910
We made a guide on how to run open LLMs in Claude Code, Codex and OpenClaw.

Use Gemma 4 and Qwen3.6 GGUFs for local agentic coding on 24GB RAM

Run with self-healing tool calls, code execution, web search via the Unsloth API endpoint and llama.cpp

Guide: https://unsloth.ai/docs/basics/api
danielhanchenΒ 
posted an update 2 months ago
view post
Post
10853
Unsloth is now one of the top 10 most followed organizations on Hugging Face. πŸ€—πŸ¦₯

Thanks so much for all the support!
Our HF page:
unsloth
  • 5 replies
Β·
danielhanchenΒ 
posted an update 2 months ago
danielhanchenΒ 
posted an update 3 months ago
danielhanchenΒ 
posted an update 3 months ago
danielhanchenΒ 
posted an update 3 months ago
danielhanchenΒ 
posted an update 3 months ago
view post
Post
2814
A new way to use Unsloth.

Coming soon...
danielhanchenΒ 
posted an update 3 months ago
view post
Post
955
You don’t need to set LLM parameters anymore! πŸš€

llama.cpp uses only the context length + compute your local setup needs. Unsloth also auto-applies the correct model settings

Try in Unsloth Studio - now with precompiled llama.cpp binaries.

GitHub: https://github.com/unslothai/unsloth
  • 2 replies
Β·
danielhanchenΒ 
posted an update 4 months ago
view post
Post
3448
Introducing Unsloth Studio ✨
A new open-source web UI to train and run LLMs.

β€’ Run models locally on Mac, Windows, Linux
β€’ Train 500+ models 2x faster with 70% less VRAM
β€’ Supports GGUF, vision, audio, embedding models
β€’ Auto-create datasets from PDF, CSV, DOCX
β€’ Self-healing tool calling and code execution
β€’ Compare models side by side + export to GGUF

GitHub: https://github.com/unslothai/unsloth
Blog and Guide: https://unsloth.ai/docs/new/studio

Available now on Hugging Face, NVIDIA, Docker and Colab.