|
|
--- |
|
|
license: apache-2.0 |
|
|
tags: |
|
|
- slidesparse |
|
|
- sparse |
|
|
- quantization |
|
|
- int8 |
|
|
- fp8 |
|
|
- llama |
|
|
- qwen |
|
|
--- |
|
|
|
|
|
# SlideSparse Checkpoints |
|
|
|
|
|
Pre-converted sparse model checkpoints using the **SlideSparse** technique. |
|
|
|
|
|
## Overview |
|
|
|
|
|
This repository contains model weights converted with various sparsity configurations: |
|
|
- **2:4** - Standard N:M sparsity (50% sparse) |
|
|
- **2:6** - Extended sparsity (67% sparse) |
|
|
- **2:8** - Higher sparsity (75% sparse) |
|
|
- **2:10** - Maximum sparsity (80% sparse) |
|
|
|
|
|
## Models Included |
|
|
|
|
|
| Base Model | Quantization | Sparsity Variants | |
|
|
|------------|--------------|-------------------| |
|
|
| Llama-3.2-1B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
|
|
| Llama-3.2-3B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
|
|
| Qwen2.5-7B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
|
|
| Qwen2.5-14B | INT8, FP8 | 2:4, 2:6, 2:8, 2:10 | |
|
|
|
|
|
## Source Models |
|
|
|
|
|
These checkpoints are derived from: |
|
|
- [RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Llama-3.2-1B-Instruct-quantized.w8a8) |
|
|
- [RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Llama-3.2-3B-Instruct-quantized.w8a8) |
|
|
- [RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8) |
|
|
- [RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8](https://huggingface.co/RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8) |
|
|
|
|
|
## License |
|
|
|
|
|
- **Qwen models**: Apache 2.0 |
|
|
- **Llama models**: Please refer to [Meta's Llama license](https://llama.meta.com/llama3/license/) |
|
|
|
|
|
## Usage |
|
|
|
|
|
```bash |
|
|
# Download all checkpoints |
|
|
huggingface-cli download bcacdwk/slidesparse-checkpoints --local-dir ./checkpoints_slidesparse |
|
|
|
|
|
# Download specific model |
|
|
huggingface-cli download bcacdwk/slidesparse-checkpoints Llama3.2-1B-INT8-SlideSparse-2_4 --local-dir ./checkpoints_slidesparse/Llama3.2-1B-INT8-SlideSparse-2_4 |
|
|
``` |
|
|
|
|
|
## Citation |
|
|
|
|
|
If you use these checkpoints, please cite the SlideSparse paper (coming soon). |
|
|
|