Update README.md
Browse files
README.md
CHANGED
|
@@ -4,19 +4,95 @@ tags:
|
|
| 4 |
- merge
|
| 5 |
- mergekit
|
| 6 |
- lazymergekit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
| 9 |
-
# ZeroXClem
|
| 10 |
|
| 11 |
-
|
| 12 |
|
| 13 |
-
|
| 14 |
|
| 15 |
-
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
name: ZeroXClem-Qwen3-4B-ChromaticCoder
|
| 19 |
-
base_model: prithivMLmods/Lacaille-MoT-4B-Supreme2
|
| 20 |
dtype: bfloat16
|
| 21 |
merge_method: model_stock
|
| 22 |
models:
|
|
@@ -28,5 +104,45 @@ models:
|
|
| 28 |
- model: prithivMLmods/Bootes-Qwen3_Coder-Reasoning
|
| 29 |
- model: Loom-Labs/Apollo-1-4B
|
| 30 |
- model: GetSoloTech/Qwen3-Code-Reasoning-4B
|
| 31 |
-
tokenizer_source: prithivMLmods/Lacaille-MoT-4B-Supreme2
|
| 32 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
- merge
|
| 5 |
- mergekit
|
| 6 |
- lazymergekit
|
| 7 |
+
- qwen3-4B
|
| 8 |
+
- ZeroXClem
|
| 9 |
+
base_model:
|
| 10 |
+
- Menlo/Jan-nano
|
| 11 |
+
- prithivMLmods/Octans-Qwen3-UI-Code-4B
|
| 12 |
+
- prithivMLmods/Logics-Qwen3-Math-4B
|
| 13 |
+
- prithivMLmods/Carinae-Qwen3-Radiation-4B
|
| 14 |
+
- prithivMLmods/Kepler-Qwen3-4B-Super-Thinking
|
| 15 |
+
- prithivMLmods/Bootes-Qwen3_Coder-Reasoning
|
| 16 |
+
- Loom-Labs/Apollo-1-4B
|
| 17 |
+
- GetSoloTech/Qwen3-Code-Reasoning-4B
|
| 18 |
+
- prithivMLmods/Lacaille-MoT-4B-Supreme2
|
| 19 |
+
pipeline_tag: text-generation
|
| 20 |
+
library_name: transformers
|
| 21 |
---
|
| 22 |
|
| 23 |
+
# ZeroXClem/Qwen3-4B-ChromaticCoder
|
| 24 |
|
| 25 |
+

|
| 26 |
|
| 27 |
+
**ZeroXClem/Qwen3-4B-ChromaticCoder** is a vibrant and versatile 4B model fusion built using `MergeKit` and the `model_stock` strategy. Blending deep reasoning, mathematical precision, frontend UI generation, and code synthesis, it shines in logic-driven and creative problem spaces.
|
| 28 |
|
| 29 |
+
This model is a chromatic cascade of top-performing Qwen3 derivatives and fine-tuned reasoning specialists โ harmonizing technical accuracy with structured expressiveness across a wide domain of tasks.
|
| 30 |
+
|
| 31 |
+
---
|
| 32 |
+
|
| 33 |
+
## ๐ง Overview
|
| 34 |
+
|
| 35 |
+
**ChromaticCoder** is based on the powerful foundation of `prithivMLmods/Lacaille-MoT-4B-Supreme2`, integrating a spectrum of expert finetunes to produce a model specialized in:
|
| 36 |
+
|
| 37 |
+
- ๐ **Mathematical and logical reasoning**
|
| 38 |
+
- ๐ป **Frontend & UI code generation**
|
| 39 |
+
- ๐งฎ **Multi-step algorithmic thinking**
|
| 40 |
+
- ๐ ๏ธ **Code reasoning, explanation, and synthesis**
|
| 41 |
+
- ๐ **Structured technical content creation**
|
| 42 |
+
|
| 43 |
+
---
|
| 44 |
+
|
| 45 |
+
## ๐งฌ Merge Details
|
| 46 |
+
|
| 47 |
+
| Detail | Value |
|
| 48 |
+
|---------------------|------------------------------------------------------------------------|
|
| 49 |
+
| **Merge Method** | `model_stock` |
|
| 50 |
+
| **Base Model** | [`prithivMLmods/Lacaille-MoT-4B-Supreme2`](https://huggingface.co/prithivMLmods/Lacaille-MoT-4B-Supreme2) |
|
| 51 |
+
| **Dtype** | `bfloat16` |
|
| 52 |
+
| **Tokenizer Source** | `prithivMLmods/Lacaille-MoT-4B-Supreme2` |
|
| 53 |
+
|
| 54 |
+
---
|
| 55 |
|
| 56 |
+
## ๐งฉ Models Merged
|
| 57 |
+
|
| 58 |
+
- [`Menlo/Jan-nano`](https://huggingface.co/Menlo/Jan-nano) โ Agentic research-aligned model with MCP support.
|
| 59 |
+
- [`prithivMLmods/Octans-Qwen3-UI-Code-4B`](https://huggingface.co/prithivMLmods/Octans-Qwen3-UI-Code-4B) โ UI code generation with Tailwind/React.
|
| 60 |
+
- [`prithivMLmods/Logics-Qwen3-Math-4B`](https://huggingface.co/prithivMLmods/Logics-Qwen3-Math-4B) โ Advanced math and logic reasoning.
|
| 61 |
+
- [`prithivMLmods/Carinae-Qwen3-Radiation-4B`](https://huggingface.co/prithivMLmods/Carinae-Qwen3-Radiation-4B) โ Balanced probabilistic modeling with multilingual reasoning.
|
| 62 |
+
- [`prithivMLmods/Kepler-Qwen3-4B-Super-Thinking`](https://huggingface.co/prithivMLmods/Kepler-Qwen3-4B-Super-Thinking) โ Hybrid symbolic-probabilistic thought.
|
| 63 |
+
- [`prithivMLmods/Bootes-Qwen3_Coder-Reasoning`](https://huggingface.co/prithivMLmods/Bootes-Qwen3_Coder-Reasoning) โ Instruction-tuned code synthesis and stepwise debugging.
|
| 64 |
+
- [`Loom-Labs/Apollo-1-4B`](https://huggingface.co/NoemaResearch/Apollo-1-4B) โ General-purpose reasoning and multilingual instruction following.
|
| 65 |
+
- [`GetSoloTech/Qwen3-Code-Reasoning-4B`](https://huggingface.co/GetSoloTech/Qwen3-Code-Reasoning-4B) โ Competitive programming and reasoning powerhouse.
|
| 66 |
+
|
| 67 |
+
---
|
| 68 |
+
|
| 69 |
+
## ๐ Chromatic Features
|
| 70 |
+
|
| 71 |
+
โจ **Unified Expert Reasoning**
|
| 72 |
+
Brings together multiple specialized reasoning modules โ from UI generation to symbolic math and programming logic โ into one coherent architecture.
|
| 73 |
+
|
| 74 |
+
๐ง **Deep Logic and Event Simulation**
|
| 75 |
+
Excels in modeling probabilistic systems, structured math, and algorithmic solutions with step-by-step clarity.
|
| 76 |
+
|
| 77 |
+
๐ป **Frontend & UI Coding Mastery**
|
| 78 |
+
With Octans and Jan-nano integrations, this model generates accurate and readable frontend code (React, Tailwind, HTML5).
|
| 79 |
+
|
| 80 |
+
๐งช **STEM-Specialized Performance**
|
| 81 |
+
Fine-tuned on math, logic, and scientific problem domains, ChromaticCoder is a strong match for educational and research applications.
|
| 82 |
+
|
| 83 |
+
๐ ๏ธ **Developer-Centric Reasoning**
|
| 84 |
+
Instruction-tuned layers optimize code completion, refactoring, and explanation across Python, JS, C++, and more.
|
| 85 |
+
|
| 86 |
+
๐ **Multilingual Capabilities**
|
| 87 |
+
Thanks to Apollo and Carinae, it supports over 80 languages in both reasoning and coding domains.
|
| 88 |
+
|
| 89 |
+
---
|
| 90 |
+
|
| 91 |
+
## ๐ง MergeKit Configuration
|
| 92 |
+
|
| 93 |
+
```yaml
|
| 94 |
name: ZeroXClem-Qwen3-4B-ChromaticCoder
|
| 95 |
+
base_model: prithivMLmods/Lacaille-MoT-4B-Supreme2
|
| 96 |
dtype: bfloat16
|
| 97 |
merge_method: model_stock
|
| 98 |
models:
|
|
|
|
| 104 |
- model: prithivMLmods/Bootes-Qwen3_Coder-Reasoning
|
| 105 |
- model: Loom-Labs/Apollo-1-4B
|
| 106 |
- model: GetSoloTech/Qwen3-Code-Reasoning-4B
|
| 107 |
+
tokenizer_source: prithivMLmods/Lacaille-MoT-4B-Supreme2
|
| 108 |
+
````
|
| 109 |
+
|
| 110 |
+
---
|
| 111 |
+
|
| 112 |
+
## ๐ก Use Cases
|
| 113 |
+
|
| 114 |
+
* ๐ **STEM Tutoring & Education**
|
| 115 |
+
* ๐งฎ **Mathematical and Logical Explanation**
|
| 116 |
+
* ๐ฅ๏ธ **Frontend Development & Prototyping**
|
| 117 |
+
* ๐ **Technical Documentation**
|
| 118 |
+
* ๐งโ๐ป **Algorithm Debugging & Refactoring**
|
| 119 |
+
* ๐ค **Agentic Reasoning and Simulated Tool Use**
|
| 120 |
+
|
| 121 |
+
---
|
| 122 |
+
|
| 123 |
+
## ๐งช Limitations
|
| 124 |
+
|
| 125 |
+
* Limited by 4B parameter size โ may struggle with extremely long or open-domain contexts.
|
| 126 |
+
* Some outputs may be verbose or over-explained depending on the base tuning weights.
|
| 127 |
+
* Not suitable for unrestricted creative or emotional writing tasks.
|
| 128 |
+
|
| 129 |
+
---
|
| 130 |
+
|
| 131 |
+
## โ๏ธ License & Usage
|
| 132 |
+
|
| 133 |
+
* License: **Apache 2.0**
|
| 134 |
+
* Users are responsible for implementing appropriate safety and moderation when deploying the model.
|
| 135 |
+
|
| 136 |
+
---
|
| 137 |
+
|
| 138 |
+
## ๐ช Credits & Acknowledgements
|
| 139 |
+
|
| 140 |
+
This fusion was only possible thanks to the incredible work of:
|
| 141 |
+
|
| 142 |
+
* **Menlo Research**, **PrithivML**, **Loom Labs**, **GetSoloTech**, and others
|
| 143 |
+
* Model authors and dataset contributors across the OSS reasoning community
|
| 144 |
+
* Qwen3 for providing a strong base ecosystem for 4B-scale thinking models
|
| 145 |
+
|
| 146 |
+
---
|
| 147 |
+
|
| 148 |
+
**Made with ๐ by the ZeroXClem team. ๐ฎ**
|