Echoes123-3
/

qwen2.5-0.5b-coding-pruned

8-bit precision

Model card Files Files and versions

qwen2.5-0.5b-coding-pruned / README.md

Echoes123-3's picture

Upload README.md with huggingface_hub

2cb9abc verified 4 months ago

|

history blame contribute delete

841 Bytes

license: apache-2.0
base_model: Qwen/Qwen2.5-0.5B
tags:
  - coding
  - pruning
  - ffn-pruning
  - specialization

Qwen 2.5 0.5B – Coding Expert (FFN-Pruned)

This model is a coding-specialized expert derived from Qwen 2.5 0.5B using activation-based FFN neuron pruning on the HumanEval dataset.

Key features

~25% FFN neuron pruning
No retraining
Stable Python code generation
Reduced compute per token

Method

Collected per-neuron FFN activations during inference
Pruned lowest-activation neurons per layer
Preserved embeddings, attention, and early representations

Intended use

Research
Model compression experiments
Expert routing / MoE research

Limitations

Specialized for coding
General language performance may degrade

This is a research artifact, not a production model.