Upload folder using huggingface_hub

Browse files

Files changed (5) hide show

README.md +16 -35
comparison_graph.png +0 -0
model-00001-of-00002.safetensors +1 -1
model-00002-of-00002.safetensors +1 -1
tokenizer.json +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,6 @@ tags:
 - math
 - optimized
 - wanda
-- activation-pruning
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
@@ -14,35 +13,28 @@ pipeline_tag: text-generation
 > 🎯 **MATH-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
-This model is a **conservatively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct), specialized for **MATH** tasks using activation-aware weight pruning (Wanda-style).
-## ✨ Key Features
-- **Specialization**: Optimized for Math tasks
-- **Pruning Method**: Wanda-style (|W| × |activation|) importance scoring
-- **Size Reduction**: 1% weights pruned
-- **Use Case**: High accuracy retention, ideal for production use
-## 📊 Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
-| Python | 100.0% | 100.0% | → |
-| Html | 6.7% | 6.7% | → |
-| Trivia | 66.7% | 66.7% | → |
-| **Math** | 60.0% | 60.0% ⭐ | → |
-| Reasoning | 100.0% | 100.0% | → |
-| Medical | 86.7% | 86.7% | → |
-| Linux | 100.0% | 100.0% | → |
-| Writing | 73.3% | 73.3% | → |
-**Average**: 74.2% → 74.2% (+0.0%)
-**Math Retention**: 100.0% of original performance
 ![Comparison Graph](comparison_graph.png)
-## 🚀 Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,31 +42,20 @@ from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-math-safe")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-math-safe")
-# Example usage
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
-## 📋 Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Math |
 | Prune Mode | Safe |
-| Pruning Method | Activation-based weight pruning (Wanda) |
 | Weight Reduction | 1% weights pruned |
-## 🔗 Related Models
-This model is part of the **Qwen2.5-3B-Instruct** pruned model collection. Variants:
-- **Safe** - Conservative pruning (~10-20%), high accuracy retention
-- **Aggressive** - Maximum compression (~40-50%), best for edge deployment
-## 📜 License
-This model inherits the license from the base model [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
----
-*Generated by ZANNPS [Zeto Automatic Neural Network Pruning System]*

 - math
 - optimized
 - wanda
 base_model: Qwen/Qwen2.5-3B-Instruct
 pipeline_tag: text-generation
 ---
 > 🎯 **MATH-optimized** | 📦 **Safe** pruning | ⚡ **1% weights pruned**
+This model is a **conservatively pruned** version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct).
+## Performance Comparison
 | Category | Original | Pruned | Change |
 |----------|----------|--------|--------|
+| Python | 92.3% | 92.3% | → |
+| Html | 40.0% | 40.0% | → |
+| Trivia | 100.0% | 100.0% | → |
+| **Math** | 100.0% | 100.0% ⭐ | → |
+| Reasoning | 91.7% | 91.7% | → |
+| Medical | 64.3% | 64.3% | → |
+| Linux | 69.2% | 69.2% | → |
+| Writing | 54.5% | 54.5% | → |
+**Average**: 76.5% → 76.5% (+0.0%)
+**Math Retention**: 100.0%
 ![Comparison Graph](comparison_graph.png)
+## Quick Start
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-math-safe")
 tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen2.5-3B-Instruct-math-safe")
 inputs = tokenizer("Your prompt here", return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=100)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Technical Details
 | Property | Value |
 |----------|-------|
 | Base Model | [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) |
 | Specialization | Math |
 | Prune Mode | Safe |
 | Weight Reduction | 1% weights pruned |
+## License
+This model inherits the license from the base model.

comparison_graph.png CHANGED Viewed

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bb655ba20979a5b4eeac19afa3715e65cf00a8aa286b16f05b49502a8acaa25d
 size 3995916600

 version https://git-lfs.github.com/spec/v1
+oid sha256:c8841f2986cfbc2b0d0db9a7e46608044b8287d48b5f87e3c62bb66d82294b51
 size 3995916600

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:743e4f7edd725dc4644875ecf255e08f13617285a81616e169206b5476c18a00
 size 2176009944

 version https://git-lfs.github.com/spec/v1
+oid sha256:a3bd36c63cc06eb90348899b2b6554ea82537f97b9009f89b375c89d29e2d77c
 size 2176009944

tokenizer.json CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51354673edf4300eb841665e1fb684cc1badea87c49d5de6ef09981151683508
 size 11422159

 version https://git-lfs.github.com/spec/v1
+oid sha256:7b3e3adf18710ac3bd97b384b0d01b58205c4c5cd37c6c56d24c8fff86b0561d
 size 11422159