GGUF
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="EssentialAI/rnj-1-instruct-GGUF",
	filename="Rnj-1-Instruct-8B-Q4_K_M.gguf",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

This is a GGUF-formatted checkpoint of rnj-1-instruct suitable for use in llama.cpp, Ollama, or others. This has been quantized with the Q4_K_M scheme, which results in model weights of size 4.8GB.

For llama.cpp, install (after version 7328, e.g., on Mac OSX brew install llama.cpp) and run either of these commands:

llama-cli -hf EssentialAI/rnj-1-instruct-GGUF
llama-server -hf EssentialAI/rnj-1-instruct-GGUF -c 0 # and open browser to localhost:8080

For Ollama, install (after version v0.13.3 -- versions can be found here) and run:

ollama run rnj-1
Downloads last month
151
GGUF
Model size
8B params
Architecture
rnj1
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including EssentialAI/rnj-1-instruct-GGUF