should i expect lower accuracy from the original model

by YairFr - opened Aug 2, 2023

Aug 2, 2023

but sending the same prompt, one time to the original model , using LlamaForCausalLM.from_pretrained
and one time via wrapping the ggml model via Llamacpp and use it in langchain's AgentExecutor - i get different (and worse) results

kacperwikiel

Aug 2, 2023

Can you provide an example of those differences?

YairFr

Aug 2, 2023

what i wrote here https://huggingface.co/TheBloke/Llama-2-13B-chat-GGML/discussions/9 is relevant also for this model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment