Upload 4 files

Browse files

Files changed (5) hide show

.gitattributes +1 -0
README.md +54 -127
special_tokens_map.json +31 -0
tokenizer.json +3 -0
tokenizer_config.json +239 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,127 +1,54 @@
-```markdown
----
-base_model:
-  - openfree/Darwin-Qwen3-4B
-  - Lucidity-AI/Astral-4B-Coder
-library_name: transformers
-tags:
-  - merge
-  - mergekit
-  - qwen
-  - qwen2.5
-  - safetensors
-  - code
-  - logic
-license: apache-2.0
-language:
-  - en
----
-# Darwin-Astral-4B-Coder
-!
-**Darwin-Astral-4B-Coder** is a specialized 4B parameter model resulting from the amalgamation of high-performance logic and coding models. It was created to combine the evolutionary reasoning capabilities of the *Darwin* series with the precise code generation of the *Astral* series, resulting in a lightweight but powerful coding assistant.
-This model was merged using the custom **Amalgamation AI** engine (powered by `mergekit`).
-## 👊 The "Within Us" Philosophy
-This model represents a fusion of two distinct intelligences:
-1.  **Logic & Reasoning:** Inherited from `openfree/Darwin-Qwen3-4B`.
-2.  **Coding Proficiency:** Inherited from `Lucidity-AI/Astral-4B-Coder`.
-By merging these, we aim to create a "Thinking Coder" capable of understanding complex prompts and generating efficient, clean code on consumer hardware.
-## 💻 Technical Details
-* **Base Architecture:** Qwen2.5 (4B)
-* **Merge Method:** SLERP (Spherical Linear Interpolation)
-* **Precision:** Float16
-* **Layer Count:** 36 Layers
-* **Developer:** Guy DuGan II (Within Us AI)
-### Merge Configuration
-The following configuration was used to generate this model:
-```yaml
-models:
-  - model: openfree/Darwin-Qwen3-4B
-    # No parameters necessary for base model
-  - model: Lucidity-AI/Astral-4B-Coder
-    parameters:
-      density: 0.5
-      weight: 0.5
-merge_method: slerp
-base_model: openfree/Darwin-Qwen3-4B
-parameters:
-  t:
-    - filter: embed_tokens
-      value: 0.0
-    - filter: self_attn
-      value: 0.5
-    - filter: mlp
-      value: 0.5
-    - filter: lm_head
-      value: 1.0
-    - value: 0.5 # Catch-all for norm layers
-dtype: float16
-```
-## 🚀 How to Use (Transformers)
-You can run this model directly using the Hugging Face `transformers` library.
-```python
-import torch
-from transformers import AutoTokenizer, AutoModelForCausalLM
-model_id = "WithinUsAI/Darwin-Astral-4B-Coder" # Replace with your actual username/repo
-tokenizer = AutoTokenizer.from_pretrained(model_id)
-model = AutoModelForCausalLM.from_pretrained(
-    model_id,
-    torch_dtype=torch.float16,
-    device_map="auto"
-)
-prompt = "Write a Python script to merge two sorted lists."
-messages = [
-    {"role": "system", "content": "You are an advanced coding assistant."},
-    {"role": "user", "content": prompt}
-]
-text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-inputs = tokenizer(text, return_tensors="pt").to(model.device)
-outputs = model.generate(
-    **inputs,
-    max_new_tokens=512,
-    do_sample=True,
-    temperature=0.7
-)
-print(tokenizer.decode(outputs[0], skip_special_tokens=True))
-```
-## 📜 License
-This model is released under the **Apache 2.0** license, following the licensing of the base Qwen models. Please refer to the original model cards for specific restrictions.
----
-*Created with Amalgamation AI by Within Us AI.*
-```
----
-### **Next Step: GGUF Conversion**
-Once you have uploaded this to Hugging Face (or even if you keep it local), the next logical step is to make it runnable on your iPhone, older laptops, or via Ollama.
-**Ready for the GGUF script?** I can provide a `convert_to_gguf.py` script that handles the quantization (making it smaller/faster) automatically.
-```

+---
+base_model: []
+library_name: transformers
+tags:
+- mergekit
+- merge
+---
+# WithinUs_CPU_Hybrid
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [SLERP](https://en.wikipedia.org/wiki/Slerp) merge method.
+### Models Merged
+The following models were included in the merge:
+* X:/Genesis_X/models/openfree-Darwin-Qwen3-4B
+* X:/Genesis_X/models/Lucidity-AI-Astral-4B-Coder
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+base_model: X:/Genesis_X/models/Lucidity-AI-Astral-4B-Coder
+dtype: float16
+merge_method: slerp
+parameters:
+  t:
+  - filter: embed_tokens
+    value: 0.0
+  - filter: self_attn
+    value: 0.5
+  - filter: mlp
+    value: 0.5
+  - filter: lm_head
+    value: 1.0
+  - value: 0.5
+slices:
+- sources:
+  - layer_range:
+    - 0
+    - 36
+    model: X:/Genesis_X/models/Lucidity-AI-Astral-4B-Coder
+  - layer_range:
+    - 0
+    - 36
+    model: X:/Genesis_X/models/openfree-Darwin-Qwen3-4B
+```

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "additional_special_tokens": [
+    "<|im_start|>",
+    "<|im_end|>",
+    "<|object_ref_start|>",
+    "<|object_ref_end|>",
+    "<|box_start|>",
+    "<|box_end|>",
+    "<|quad_start|>",
+    "<|quad_end|>",
+    "<|vision_start|>",
+    "<|vision_end|>",
+    "<|vision_pad|>",
+    "<|image_pad|>",
+    "<|video_pad|>"
+  ],
+  "eos_token": {
+    "content": "<|im_end|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": {
+    "content": "<|endoftext|>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:aeb13307a71acd8fe81861d94ad54ab689df773318809eed3cbe794b4492dae4
+size 11422654

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,239 @@

+{
+  "add_bos_token": false,
+  "add_prefix_space": false,
+  "added_tokens_decoder": {
+    "151643": {
+      "content": "<|endoftext|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151644": {
+      "content": "<|im_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151645": {
+      "content": "<|im_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151646": {
+      "content": "<|object_ref_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151647": {
+      "content": "<|object_ref_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151648": {
+      "content": "<|box_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151649": {
+      "content": "<|box_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151650": {
+      "content": "<|quad_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151651": {
+      "content": "<|quad_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151652": {
+      "content": "<|vision_start|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151653": {
+      "content": "<|vision_end|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151654": {
+      "content": "<|vision_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151655": {
+      "content": "<|image_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151656": {
+      "content": "<|video_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "151657": {
+      "content": "<tool_call>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151658": {
+      "content": "</tool_call>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151659": {
+      "content": "<|fim_prefix|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151660": {
+      "content": "<|fim_middle|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151661": {
+      "content": "<|fim_suffix|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151662": {
+      "content": "<|fim_pad|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151663": {
+      "content": "<|repo_name|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151664": {
+      "content": "<|file_sep|>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151665": {
+      "content": "<tool_response>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151666": {
+      "content": "</tool_response>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151667": {
+      "content": "<think>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    },
+    "151668": {
+      "content": "</think>",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": false
+    }
+  },
+  "additional_special_tokens": [
+    "<|im_start|>",
+    "<|im_end|>",
+    "<|object_ref_start|>",
+    "<|object_ref_end|>",
+    "<|box_start|>",
+    "<|box_end|>",
+    "<|quad_start|>",
+    "<|quad_end|>",
+    "<|vision_start|>",
+    "<|vision_end|>",
+    "<|vision_pad|>",
+    "<|image_pad|>",
+    "<|video_pad|>"
+  ],
+  "bos_token": null,
+  "clean_up_tokenization_spaces": false,
+  "eos_token": "<|im_end|>",
+  "errors": "replace",
+  "extra_special_tokens": {},
+  "model_max_length": 131072,
+  "pad_token": "<|endoftext|>",
+  "split_special_tokens": false,
+  "tokenizer_class": "Qwen2Tokenizer",
+  "unk_token": null
+}