TeichAI
/

Qwen3-4B-Thinking-MiniMax-M2.1-Coder

Text Generation

text-generation-inference

Model card Files Files and versions

armand0e commited on 18 days ago

Commit

0c94c2c

·

verified ·

1 Parent(s): eba1154

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -1,21 +1,21 @@
 ---
-base_model: unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - qwen3
-license: apache-2.0
 language:
 - en
 ---
-# Uploaded finetuned  model
-- **Developed by:** TeichAI
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/qwen3-4b-thinking-2507-unsloth-bnb-4bit
-This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
+base_model: unsloth/Qwen3-4B-Thinking-2507
 tags:
 - text-generation-inference
 - transformers
 - unsloth
 - qwen3
 language:
 - en
+datasets:
+- TeichAI/MiniMax-M2.1-Code-SFT
 ---
+# Qwen3 4B Thinking x MiniMax M2.1 Code SFT
+This model was trained on over 1,300 agentic "vibe coding" examples generated by MiniMax M2.1 with a large majority focused on extracting UI/UX design capabilities across different tech stacks.
+For more info on how and what the model was trained on, please view [the dataset card](https://huggingface.co/datasets/TeichAI/Gemini-3-Flash-Preview-VIBE)
+---
+This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.