Qwen3 4B Thinking x MiniMax M2.1 Code SFT

This model was trained on over 1,300 agentic "vibe coding" examples generated by MiniMax M2.1 with a large majority focused on extracting UI/UX design capabilities across different tech stacks.

For more info on how and what the model was trained on, please view the dataset card


This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
1,402
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TeichAI/Qwen3-4B-Thinking-MiniMax-M2.1-Coder-GGUF

Dataset used to train TeichAI/Qwen3-4B-Thinking-MiniMax-M2.1-Coder-GGUF