Marco Marten commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -2,4 +2,36 @@
|
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
|
| 5 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
|
| 5 |
+
# FireRed-Image-Edit-1.0 Quantizations (FP8 & NVFP4)
|
| 6 |
+
|
| 7 |
+
This repository provides quantized versions of the **FireRed-Image-Edit-1.0** model, optimized for reduced VRAM usage while maintaining high performance.
|
| 8 |
+
|
| 9 |
+
We provide weights in **FP8** and the cutting-edge **NVFP4** (Nvidia FP4) formats, making it easier to run this powerful image editing model on a wider range of hardware.
|
| 10 |
+
|
| 11 |
+
## Available Versions
|
| 12 |
+
|
| 13 |
+
| File Name | Format | Size | Description |
|
| 14 |
+
|-----------|--------|------|-------------|
|
| 15 |
+
| `FireRed-Image-Edit-1.0_FP8.safetensors` | FP8 | ~20.4 GB | Standard 8-bit quantization for balanced performance. |
|
| 16 |
+
| `FireRed-Image-Edit-1_NVFP4.safetensors` | NVFP4 | ~11.6 GB | Highly compressed 4-bit format optimized for NVIDIA Blackwell/Ada architecture. |
|
| 17 |
+
|
| 18 |
+
## About FireRed-Image-Edit
|
| 19 |
+
|
| 20 |
+
FireRed-Image-Edit is a state-of-the-art model designed for precise and high-quality image editing based on textual instructions. By using these quantized versions, you can:
|
| 21 |
+
- **Reduce VRAM consumption** significantly (especially with the NVFP4 version).
|
| 22 |
+
- **Speed up inference** on compatible hardware.
|
| 23 |
+
- **Run the model on consumer GPUs** that might otherwise lack the memory for the full-precision weights.
|
| 24 |
+
|
| 25 |
+
## Usage
|
| 26 |
+
|
| 27 |
+
You can use these weights with popular frameworks like `ComfyUI` or `diffusers` (ensure you have the necessary libraries installed for FP8/FP4 support).
|
| 28 |
+
|
| 29 |
+
### Requirements
|
| 30 |
+
- For **NVFP4**: Requires latest NVIDIA drivers and hardware support (Ada Lovelace or newer recommended).
|
| 31 |
+
- For **FP8**: Compatible with NVIDIA 30nd/40th series and latest `torch`/`accelerate` versions.
|
| 32 |
+
|
| 33 |
+
## Installation
|
| 34 |
+
|
| 35 |
+
```bash
|
| 36 |
+
# Example: Download using huggingface-cli
|
| 37 |
+
huggingface-cli download LuceroAI/quants --local-dir ./models/fire-red-quants
|