Text-to-Image
Safetensors
MLX
mlx-gen
mflux
apple-silicon
8-bit precision
ernie
ernie-image
ernie-image-turbo
Instructions to use AbstractFramework/ernie-image-turbo-4bit with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use AbstractFramework/ernie-image-turbo-4bit with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir ernie-image-turbo-4bit AbstractFramework/ernie-image-turbo-4bit
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
| license: apache-2.0 | |
| base_model: baidu/ERNIE-Image-Turbo | |
| pipeline_tag: text-to-image | |
| library_name: mlx-gen | |
| tags: | |
| - mlx | |
| - mlx-gen | |
| - mflux | |
| - apple-silicon | |
| - 8-bit | |
| - ernie | |
| - ernie-image | |
| - ernie-image-turbo | |
| # ernie-image-turbo-8bit | |
| This repository contains MLX-Gen saved weights for `baidu/ERNIE-Image-Turbo`. The checkpoint is designed for local Apple Silicon inference with [`mlx-gen`](https://github.com/lpalbou/mlx-gen). | |
| It uses the mflux/MLX saved-weight layout and MLX quantization tensors. It is not a Diffusers or Transformers `from_pretrained()` checkpoint. | |
| ## Source Model | |
| Original model: [`baidu/ERNIE-Image-Turbo`](https://huggingface.co/baidu/ERNIE-Image-Turbo). | |
| ## License and Access | |
| This quantized derivative follows the Apache 2.0 license of the source model. | |
| ## Quantization | |
| This is an MLX q8 checkpoint for ERNIE Image Turbo. MLX-Gen uses 8-bit quantization for ERNIE modules where MLX supports quantization: | |
| - q8 for quantizable ERNIE transformer modules. | |
| - q8 for quantizable ERNIE text-encoder modules. | |
| - q8 for quantizable ERNIE VAE attention modules. | |
| - BF16 for norms, convolutions, and other non-quantizable parameters. | |
| ERNIE q4 uses a model-specific mixed q4/q8 policy because fully q4 checkpoints can drift from BF16/q8 behavior. | |
| See the [MLX-Gen quantization docs](https://github.com/lpalbou/mlx-gen/blob/main/docs/quantization.md) for compatibility notes and measured ERNIE q4/q8 behavior. | |
| Prepared ERNIE folders contain the ordinary text-to-image generation stack. ERNIE Prompt Enhancer files are not bundled in this checkpoint. | |
| ## Compatibility | |
| Requires `mlx-gen >= 0.18.5`. | |
| Generated with `mlx-gen 0.18.5`. | |
| Use the `mlxgen` command and Python import path for new MLX-Gen projects. | |
| ## Usage | |
| ```bash | |
| python -m pip install -U mlx-gen | |
| mlxgen download --model AbstractFramework/ernie-image-turbo-8bit | |
| mlxgen generate \ | |
| --model AbstractFramework/ernie-image-turbo-8bit \ | |
| --prompt "Your prompt here" \ | |
| --width 512 \ | |
| --height 512 \ | |
| --steps 8 \ | |
| --guidance 1 \ | |
| --seed 42 \ | |
| --output image.png | |
| ``` | |
| ## Attribution | |
| MLX-Gen is based on [mflux](https://github.com/filipstrand/mflux) by Filip Strand and the original mflux contributors. This model card is generated by MLX-Gen so derived checkpoints keep that attribution visible. | |
| Quantized and contributed by [@lpalbou](https://huggingface.co/lpalbou). | |