Instructions to use TheBloke/LosslessMegaCoder-Llama2-13B-Mini-GGML with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use TheBloke/LosslessMegaCoder-Llama2-13B-Mini-GGML with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TheBloke/LosslessMegaCoder-Llama2-13B-Mini-GGML", dtype="auto") - Notebooks
- Google Colab
- Kaggle
how to get it working with Exlama ?
#1
by DQ83 - opened
i get in chatbot_wrapper
stopping_strings = get_stopping_strings(state)
File "E:\oobabooga_windows\text-generation-webui\modules\chat.py", line 160, in get_stopping_strings
state['turn_template'].split('<|user-message|>')[1].split('<|bot|>')[0] + '<|bot|>',
which turn template could work ?
ah i see ChatML