Qwen3-4B-html-light-prune

This model is a light pruned version of Qwen/Qwen3-4B, specialized for HTML tasks.

Pruning Details

  • Base Model: Qwen/Qwen3-4B
  • Specialization: Html
  • Prune Mode: Light
  • Method: Activation-based weight pruning

Performance Comparison

Category Original Pruned
Python 0.0% 6.7%
HTML 6.7% 26.7%
Trivia 86.7% 86.7%
Math 40.0% 40.0%
Reasoning 60.0% 46.7%

Comparison Graph

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen3-4B-html-light-prune-prune")
tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen3-4B-html-light-prune-prune")

License

This model inherits the license from the base model.

Downloads last month
3
Safetensors
Model size
4B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for CompactAI/Qwen3-4B-html-light-prune

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(442)
this model

Collection including CompactAI/Qwen3-4B-html-light-prune