Update README.md
Browse files
README.md
CHANGED
|
@@ -81,4 +81,18 @@ Work in progress.
|
|
| 81 |
## Limitations
|
| 82 |
|
| 83 |
- The model is only runnable on NVIDIA GPUs with architectures sm_86 (Ampere: RTX 3090, A6000), sm_89 (Ada: RTX 4090), and sm_80 (A100). See this [issue](https://github.com/mit-han-lab/nunchaku/issues/1) for more details.
|
| 84 |
-
- You may observe some slight differences from the BF16 models in details.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 81 |
## Limitations
|
| 82 |
|
| 83 |
- The model is only runnable on NVIDIA GPUs with architectures sm_86 (Ampere: RTX 3090, A6000), sm_89 (Ada: RTX 4090), and sm_80 (A100). See this [issue](https://github.com/mit-han-lab/nunchaku/issues/1) for more details.
|
| 84 |
+
- You may observe some slight differences from the BF16 models in details.
|
| 85 |
+
|
| 86 |
+
### Citation
|
| 87 |
+
|
| 88 |
+
If you find this model useful or relevant to your research, please cite
|
| 89 |
+
|
| 90 |
+
```bibtex
|
| 91 |
+
@article{
|
| 92 |
+
li2024svdquant,
|
| 93 |
+
title={SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models},
|
| 94 |
+
author={Li*, Muyang and Lin*, Yujun and Zhang*, Zhekai and Cai, Tianle and Li, Xiuyu and Guo, Junxian and Xie, Enze and Meng, Chenlin and Zhu, Jun-Yan and Han, Song},
|
| 95 |
+
journal={arXiv preprint arXiv:2411.05007},
|
| 96 |
+
year={2024}
|
| 97 |
+
}
|
| 98 |
+
```
|