README.md · ZeroXClem/Qwen3-4B-MiniMight at main

Qwen3-4B-MiniMight / README.md

ZeroXClem

Update README.md

fb0edca verified 3 months ago

preview code

raw

history blame contribute delete

5.04 kB

	---
	license: apache-2.0
	tags:
	- merge
	- mergekit
	- lazymergekit
	- ZeroXClem/Qwen3-4B-MiniMight
	- MiniMight
	- ZeroXClem
	- Code
	- Cursor
	base_model:
	- TeichAI/Qwen3-4B-Thinking-2507-Claude-4.5-Opus-High-Reasoning-Distill
	- bunnycore/Qwen3-4B-Mini-Merge
	- bunnycore/Qwen3-4B-MegaMerge
	- bunnycore/Qwen3-4B-Max-Merge
	- ertghiu256/qwen3-4b-claude-sonnet-x-gemini-reasoning
	- ZeroXClem/Qwen3-4B-Hermes-Axion-Pro
	- Qwen/Qwen3-4B-Thinking-2507
	pipeline_tag: text-generation
	library_name: transformers
	---

	## 🧠 ZeroXClem/Qwen3-4B-MiniMight

	![MiniMight](https://cdn-uploads.huggingface.co/production/uploads/64408cd43e0374802e19f454/Ahz0EEdsPBjHl3MCDH4tS.png)

	> "Small in scale. Mighty in mind."
	> A beautifully blended 4B model with 262k context length, fusing deep reasoning, code, safety, and creativity — distilled through MergeKit’s `model_stock` magic.

	---

	## 🔧 Merge Configuration

	```yaml
	name: ZeroXClem/Qwen3-4B-MiniMight
	base_model: Qwen/Qwen3-4B-Thinking-2507
	dtype: bfloat16
	merge_method: model_stock
	models:
	- model: TeichAI/Qwen3-4B-Thinking-2507-Claude-4.5-Opus-High-Reasoning-Distill
	- model: bunnycore/Qwen3-4B-Mini-Merge
	- model: bunnycore/Qwen3-4B-MegaMerge
	- model: bunnycore/Qwen3-4B-Max-Merge
	- model: ertghiu256/qwen3-4b-claude-sonnet-x-gemini-reasoning
	- model: ZeroXClem/Qwen3-4B-Hermes-Axion-Pro
	tokenizer_source: Qwen/Qwen3-4B-Thinking-2507
	````

	---

	## 🌟 Highlights

	* 🧠 Reasoning-Centric Intelligence — Optimized for multi-step thought, explanation, STEM logic, and symbolic problem solving.
	* 🪐 262,144 Token Context Window — Massive memory span for long documents, complex chains, or multi-turn workflows.
	* 🧬 Deep Merge of Code, RP, and Claude/Gemini Logic — Captures both fluency and fidelity in multi-domain generation.
	* 🔐 Safe Alignment from Hermes & Axion — Includes red-teamed safety-conscious merges to guide outputs.
	* ✍️ Elegant Dialogue & Creative Thought — Engages in immersive roleplay, structured writing, and storytelling.
	* ⚡ Efficient 4B Inference — Small enough for local GPUs, quantized formats, and mobile deployments.

	---

	## 🧪 Usage Example

	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM

	model = AutoModelForCausalLM.from_pretrained("ZeroXClem/Qwen3-4B-MiniMight", device_map="auto", torch_dtype="auto")
	tokenizer = AutoTokenizer.from_pretrained("ZeroXClem/Qwen3-4B-MiniMight")

	messages = [{"role": "user", "content": "Explain quantum tunneling with an example."}]
	prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)

	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
	outputs = model.generate(**inputs, max_new_tokens=384)

	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	🧠 Use `enable_thinking=True` or `/think` with compatible frontends for enhanced reasoning control.

	---

	## 🔁 Models Merged

	\| Source Model \| Specialization \|
	\| ----------------------------------------------------------------------- \| ----------------------------------------------- \|
	\| `TeichAI/Qwen3-4B-Thinking-2507-Claude-4.5-Opus-High-Reasoning-Distill` \| High reasoning distilled from Claude Opus \|
	\| `bunnycore/Qwen3-4B-Mini-Merge` \| Mini-coder, Qwaifu, Darwin blend \|
	\| `bunnycore/Qwen3-4B-MegaMerge` \| AesCoder + OSS GPT + Qwaifu fusion \|
	\| `bunnycore/Qwen3-4B-Max-Merge` \| Maximal blend with strong instruction coherence \|
	\| `ertghiu256/qwen3-4b-claude-sonnet-x-gemini-reasoning` \| Gemini 3 Pro & Claude Sonnet enhanced \|
	\| `ZeroXClem/Qwen3-4B-Hermes-Axion-Pro` \| Reasoning + alignment + safety-core logic \|

	---

	## 💼 Ideal For

	* 🔍 Chain-of-thought reasoning & symbolic logic
	* 🧪 Long-document summarization & analysis
	* 💬 Roleplay & storytelling with high memory
	* 📚 Tutoring & educational AI (math, code, science)
	* ⚖️ Safety-aligned assistants for deployment
	* 👨‍💻 Code generation, refactoring, and walkthroughs

	---

	## 🚫 Limitations

	* Long responses may truncate if `max_new_tokens` is too low
	* Mixed training sources may result in style variance across domains
	* Dialogue tone defaults to helpful/instructive rather than emotional/creative

	---

	## 📜 License

	Apache 2.0
	Please verify license alignment when adapting for commercial or public-facing use.

	---

	## 💌 Credits

	Thank you to bunnycore, Qwen Team, ertghiu256, TeichAI for the wonderful fine tunes and base models that made this merge possible!
	Built by ZeroXClem Team — merging distillations, hugging intellect and ideas into tangible weights.
	MergeKit mastery allows full distilled insight in one mighty little package.

	> Mini in name. Infinite in mind. 🛸