Davidsv
/

SUONG-2

@@ -1,4 +1,5 @@
 ---
 base_model:
 - teknium/OpenHermes-2.5-Mistral-7B
 - NousResearch/Nous-Hermes-2-Mistral-7B-DPO
@@ -6,18 +7,32 @@ tags:
 - merge
 - mergekit
 - lazymergekit
-- teknium/OpenHermes-2.5-Mistral-7B
-- NousResearch/Nous-Hermes-2-Mistral-7B-DPO
 ---
 # SUONG-2
-SUONG-2 is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
 * [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
 * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
-## 🧩 Configuration
 ```yaml
 slices:
   - sources:
@@ -34,30 +49,4 @@ parameters:
     - filter: mlp
       value: [1, 0.5, 0.7, 0.3, 0]
     - value: 0.5
-dtype: bfloat16
-```
-## 💻 Usage
-```python
-!pip install -qU transformers accelerate
-from transformers import AutoTokenizer
-import transformers
-import torch
-model = "Davidsv/SUONG-2"
-messages = [{"role": "user", "content": "What is a large language model?"}]
-tokenizer = AutoTokenizer.from_pretrained(model)
-prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-pipeline = transformers.pipeline(
-    "text-generation",
-    model=model,
-    torch_dtype=torch.float16,
-    device_map="auto",
-)
-outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
-print(outputs[0]["generated_text"])
-```

 ---
+license: apache-2.0
 base_model:
 - teknium/OpenHermes-2.5-Mistral-7B
 - NousResearch/Nous-Hermes-2-Mistral-7B-DPO
 - merge
 - mergekit
 - lazymergekit
+- mistral
+- hermes
+- dpo
 ---
 # SUONG-2
+This is a merge of two leading Hermes models created using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing), combining OpenHermes's robust capabilities with Nous-Hermes-DPO's refined instruction following.
+## About Me
+I'm David Soeiro-Vuong, a third-year Computer Science student working as an apprentice at TW3 Partners, a company specialized in Generative AI. Passionate about artificial intelligence and language models optimization, I focus on creating efficient model merges that balance performance and capabilities.
+🔗 [Connect with me on LinkedIn](https://www.linkedin.com/in/david-soeiro-vuong-a28b582ba/)
+## Merge Details
+### Merge Method
+This model uses SLERP (Spherical Linear Interpolation) with carefully tuned parameters:
+- Progressive attention layer fusion patterns
+- Balanced MLP layer transitions
+- bfloat16 format for efficient memory usage
+- Full layer utilization for maximum capability retention
+### Models Merged
 * [teknium/OpenHermes-2.5-Mistral-7B](https://huggingface.co/teknium/OpenHermes-2.5-Mistral-7B)
 * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
+### Configuration
 ```yaml
 slices:
   - sources:
     - filter: mlp
       value: [1, 0.5, 0.7, 0.3, 0]
     - value: 0.5
+dtype: bfloat16