YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

33% pruning on RedPajama 3B linear layers

The pruned layers are:

  1. attention linear layers (query, key, value computation)
  2. attention dense layer
  3. MLP layers

Pruning is done in all decoder modules. Pruning is unstructured magnitude pruning

Downloads last month
4
Safetensors
Model size
3B params
Tensor type
F32
F16
I8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support