Qwen3-4B-html-light-prune

This model is a light pruned version of Qwen/Qwen3-4B, specialized for HTML tasks.

Pruning Details

Base Model: Qwen/Qwen3-4B
Specialization: Html
Prune Mode: Light
Method: Activation-based weight pruning

Performance Comparison

Category	Original	Pruned
Python	0.0%	6.7%
HTML	6.7%	26.7%
Trivia	86.7%	86.7%
Math	40.0%	40.0%
Reasoning	60.0%	46.7%

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("CompactAI/Qwen3-4B-html-light-prune-prune")
tokenizer = AutoTokenizer.from_pretrained("CompactAI/Qwen3-4B-html-light-prune-prune")

License

This model inherits the license from the base model.

Downloads last month: 3

Safetensors

Model size

4B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for CompactAI/Qwen3-4B-html-light-prune

Base model

Qwen/Qwen3-4B-Base

Finetuned

Qwen/Qwen3-4B

Finetuned

(442)

this model

Collection including CompactAI/Qwen3-4B-html-light-prune

Qwen3-4B

Collection

Collection of pruned models based on Qwen/Qwen3-4B • 15 items • Updated about 16 hours ago