FINGU-AI
/

Phi-4-RRStock

@@ -2,73 +2,62 @@
 library_name: transformers
 license: mit
 ---
-### Model Card for **Phi-4 Merge Model**
-#### **Model Description**
-This is a merged model combining pre-trained language models with specific configurations and parameters, created using MergeKit. The merge focuses on balancing reasoning, stock capabilities, and performance enhancements derived from both parent models.
 ---
-### **Merge Details**
-**Merge Method**
-The model was merged using the **SLERP merge method**, preserving key features from each base model to optimize reasoning, general knowledge, and understanding.
 ---
-### **Models Merged**
-The following models were included in this merge:
-1. **bunnycore/Phi-4-RR-Shoup** – contributing **40.95%**
-2. **bunnycore/Phi-4-Model-Stock-v4** – contributing **41.03%**
 ---
-### **Configuration**
 The following YAML configuration was used to produce this merged model:
 ```yaml
-models:
   - model: bunnycore/Phi-4-RR-Shoup
-    parameters:
-      weight: 0.4095
   - model: bunnycore/Phi-4-Model-Stock-v4
-    parameters:
-      weight: 0.4103
-merge_method: linear
-normalize: false
-int8_mask: true
 dtype: bfloat16
-```
----
-### **Open LLM Leaderboard Evaluation Results**
-Below are the evaluation metrics achieved by this merged model:
-| **Metric**              | **Score**  |
-|-------------------------|------------|
-| **Avg.**                | **40.95**  |
-| **IFEval (0-Shot)**      | **65.87**  |
-| **BBH (3-Shot)**         | **56.11**  |
-| **MATH Lvl 5 (4-Shot)**  | **47.96**  |
-| **GPQA (0-shot)**        | **11.63**  |
-| **MuSR (0-shot)**        | **14.94**  |
-| **MMLU-PRO (5-shot)**    | **49.21**  |
----
-### **Potential Use Cases**
-This merged model can be applied in various NLP tasks, including but not limited to:
-- **Zero-shot and Few-shot Reasoning**
-- **Mathematical Problem Solving**
-- **General Knowledge Question Answering**
-- **Multi-task Learning for Professional Knowledge Areas (MMLU)**
----
-### **License and Usage**
-Ensure compliance with the licenses of the merged models. This merged model inherits licenses from all parent models, and the user is advised to review and adhere to individual model licenses.
-**Disclaimer:** This model is provided as-is without warranty. Performance may vary based on specific tasks or evaluation benchmarks.

 library_name: transformers
 license: mit
 ---
+# Phi-4 SLERP Merge Model
+## Model Description
+This is a merged language model created using the **Spherical Linear Interpolation (SLERP) merge method**, allowing for a smooth blend of features from both parent models across different layers. The merge optimizes reasoning, general knowledge, and task-specific performance by strategically interpolating attention and MLP components.
 ---
+## Merge Details
+**Merge Method:**
+The model was merged using **SLERP (Spherical Linear Interpolation)** rather than a traditional linear merge, ensuring a well-balanced combination of both source models while maintaining coherent weight transitions.
+**Base Model:**
+- **bunnycore/Phi-4-RR-Shoup** (used as the primary base)
 ---
+## Models Merged
+The following models were included in this merge:
+1. **bunnycore/Phi-4-RR-Shoup** (Primary base)
+2. **bunnycore/Phi-4-Model-Stock-v4**
 ---
+## Configuration
 The following YAML configuration was used to produce this merged model:
 ```yaml
+slices:
+- sources:
   - model: bunnycore/Phi-4-RR-Shoup
+    layer_range:
+    - 0
+    - 32
   - model: bunnycore/Phi-4-Model-Stock-v4
+    layer_range:
+    - 0
+    - 32
+merge_method: slerp
+base_model: bunnycore/Phi-4-RR-Shoup
+parameters:
+  t:
+  - filter: self_attn
+    value:
+    - 0
+    - 0.5
+    - 0.3
+    - 0.7
+    - 1
+  - filter: mlp
+    value:
+    - 1
+    - 0.5
+    - 0.7
+    - 0.3
+    - 0
+  - value: 0.5
 dtype: bfloat16