ibm-granite/granite-4.0-micro
Text Generation • 3B • Updated • 31.9k • 274
4b is the new 7b, i was meant to do sub-3b and sub-4b collections, but in the end i decided to combine into a single collection of text-to-text LLMs
Note traditional transformers model
Note h stands for hybrid, uses transformers and mamba
Note better than 2511
Note literally the end-game 4b parameter model.
Note literally the end-game 4b parameter model. Now with reasoning!
Note wow! a tiny MoE!