Spaces:

compressed-llm
/

README

Running

jyhong836 commited on Dec 1, 2023

Commit

02a238d

1 Parent(s): cbb77d6

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -15,6 +15,12 @@ Credits to Ajay Jaiswal, Jinhao Duan, Zhenyu Zhang, Zhangheng Li, Lu Yin, Shiwei
 License: [MIT License](https://opensource.org/license/mit/)
 Setup environment
 ```shell
 pip install torch==2.0.0+cu117 torchvision==0.15.1+cu117 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu117
@@ -23,6 +29,8 @@ pip install accelerate
 pip install auto-gptq  # for gptq
 ```
 How to use pruned models
 ```python
 import torch

 License: [MIT License](https://opensource.org/license/mit/)
+Simplified lists:
+* Models: Llama-2 13b, Llama-2 chat 13b, Vicuna 13b v1.3
+* Compression methods:
+  - Pruning: Magnitude-based, Wanda, SparseGPT (2:4 semi-structured)
+  - Quantization: AWQ, GPTQ (3,4,8 bits)
 Setup environment
 ```shell
 pip install torch==2.0.0+cu117 torchvision==0.15.1+cu117 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu117
 pip install auto-gptq  # for gptq
 ```
+## How to use models
 How to use pruned models
 ```python
 import torch