Bordoglor's picture
Upload folder using huggingface_hub
92c1c00 verified

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

X-LoRA examples

xlora_inference_mistralrs.py

Perform inference of an X-LoRA model using the inference engine mistral.rs.

Mistral.rs supports many base models besides Mistral, and can load models directly from saved LoRA checkpoints. Check out adapter model docs and the models support matrix.

Mistral.rs features X-LoRA support and incorporates techniques such as a dual-KV cache, continuous batching, Paged Attention, and optional non granular scalings, will allow vastly improved throughput.

Links: