How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="mrtechgarg/trio-nano",
	filename="trio-nano-q4_k_m.gguf",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

trio-nano

trio-nano -- ultra-fast model for edge and mobile by trio.ai (3B parameters)

Install

pip install triobot
trio train --setup --model trio-nano
trio serve

Model Family

Model Params Use Case
trio-nano 3B Edge, mobile, instant
trio-small 4B Daily tasks
trio-medium 8B Coding, writing
trio-high 9B Advanced reasoning
trio-max 12B Best consumer GPU
trio-pro 30B MoE Pro workloads

Built by trio.ai | Apache 2.0

Downloads last month
-
GGUF
Model size
2B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support