bnjmnmarie's picture
Create README.md
9b420ae verified
---
license: apache-2.0
base_model:
- Nanbeige/Nanbeige4.1-3B
tags:
- llm-compressor
---
<div align="center">
<img
src="https://cdn-uploads.huggingface.co/production/uploads/64b93e6bd6c468ac7536607e/mj6xac74jHGLqymiovObc.png"
alt="The Kaitchup -- AI on a Budget"
style="width: 100%; max-width: 100%; height: auto; display: inline-block; margin-bottom: 0.5em; margin-top: 0.5em;"
/>
<div style="display: flex; justify-content: center; gap: 0.5em; margin-bottom: 1em;">
<a href="https://kaitchup.substack.com/subscribe"><strong>Subscribe and Support</strong></a>
</div>
</div>
This is [Nanbeige/Nanbeige4.1-3B](https://huggingface.co/Nanbeige/Nanbeige4.1-3B) quantized with [llm-compressor](https://github.com/vllm-project/llm-compressor) to W8A8 (FP8) . The model is compatible with vLLM (tested: v0.15.1). Tested with an L4 (Google Colab).
- **Developed by:** [The Kaitchup](https://kaitchup.substack.com/)