Spaces:

tokutsu
/

exllamav2_patch

Running

exllamav2_patch / README.md

tokutsu

Update README to suppress comments

27051a6 10 months ago

1.28 kB

	---
	title: Exllamav2 Patch
	sdk: static
	license: mit
	short_description: Patch for ExLlamaV2 to support Unigram-based tokenizers
	---

	# Unigram Tokenizer Patch for ExLlamaV2

	This repository provides a patch to support Unigram-based tokenizers with `ExLlamaV2`.
	By default, using a Unigram tokenizer may result in an error during inference. This patch resolves that issue.

	## Files

	- `hf.py.patch`
	A patch file that modifies `hf.py` in the ExLlamaV2 repository to support Unigram tokenizers.

	- `apply_patch.sh`
	A shell script to apply the patch.

	## Usage

	You can apply the patch with the following command:

	```bash
	$ ./apply_patch.sh EXLLAMAV2_DIR
	```

	Replace EXLLAMAV2_DIR with the path to your local clone of ExLlamaV2.

	Example:
	```bash
	$ ./apply_patch.sh ~/repos/exllamav2
	```

	## Purpose

	The patch is intended to:
	- Prevent runtime errors when using tokenizers.models.Unigram.
	- Add fallback handling for missing unk_token attributes.
	- Improve compatibility with models trained using SentencePiece + Unigram tokenizers.

	## Notes

	- This patch is intended to be applied to the hf.py file in the ExLlamaV2 repository.
	- Tested Versions: v0.2.6 through v0.2.8
	- If the patch fails, please ensure your copy of hf.py matches the original before applying.