Spaces:
Running
Running
| title: Exllamav2 Patch | |
| sdk: static | |
| license: mit | |
| short_description: Patch for ExLlamaV2 to support Unigram-based tokenizers | |
| # Unigram Tokenizer Patch for ExLlamaV2 | |
| This repository provides a patch to support **Unigram-based tokenizers** with `ExLlamaV2`. | |
| By default, using a Unigram tokenizer may result in an error during inference. This patch resolves that issue. | |
| ## Files | |
| - `hf.py.patch` | |
| A patch file that modifies `hf.py` in the ExLlamaV2 repository to support Unigram tokenizers. | |
| - `apply_patch.sh` | |
| A shell script to apply the patch. | |
| ## Usage | |
| You can apply the patch with the following command: | |
| ```bash | |
| $ ./apply_patch.sh EXLLAMAV2_DIR | |
| ``` | |
| Replace EXLLAMAV2_DIR with the path to your local clone of ExLlamaV2. | |
| Example: | |
| ```bash | |
| $ ./apply_patch.sh ~/repos/exllamav2 | |
| ``` | |
| ## Purpose | |
| The patch is intended to: | |
| - Prevent runtime errors when using tokenizers.models.Unigram. | |
| - Add fallback handling for missing unk_token attributes. | |
| - Improve compatibility with models trained using SentencePiece + Unigram tokenizers. | |
| ## Notes | |
| - This patch is intended to be applied to the hf.py file in the ExLlamaV2 repository. | |
| - Tested Versions: **v0.2.6** through **v0.2.8** | |
| - If the patch fails, please ensure your copy of hf.py matches the original before applying. | |