BitCPM-CANN Collection Full-pipeline ternary quantized model trained on CANN. • 12 items • Updated May 24 • 28
MOOSE-Star Models & Data Collection Models and data for scientific discovery from MOOSE-Star (arXiv:2603.03756). IR and HC models, paper decomposition and SFT data. • 5 items • Updated Apr 5 • 5
Qwen 3.6 UDT MTP Collection Dynamic-imatrix GGUF quants of Qwen 3.6 27B & 35B-A3B. TurboQuant3 KV + shared-model NextN ready. • 2 items • Updated May 14 • 4
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated Apr 29 • 61
Granite Speech Collection Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 8 items • Updated 14 days ago • 34
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 517
Introspective Diffusion Language Models (I-DLM) Collection Model checkpoints for I-DLM. Paper: https://arxiv.org/abs/2604.11035 • 3 items • Updated Apr 14 • 11
Gemma 4 Uncensored Collection Abliterated Gemma 4 models with refusal behavior removed. Biprojection + EGA for MoE. Cross-validated against 686 prompts from 4 datasets. • 10 items • Updated 17 days ago • 100