Llama-cpp-python code:

from llama_cpp import Llama
from huggingface_hub import snapshot_download

# load model
snapshot_download(repo_id="ichrnkv/t_lite_1.0_gguf", local_dir="./")

# llama cpp model
model = Llama(
    model_path="./model.gguf",
    verbose=True,
    n_gpu_layers=-1,
    seed=42
)

Downloads last month: 4

GGUF

Model size

8B params

Architecture

qwen2

Hardware compatibility

We're not able to determine the quantization variants.

View all variants

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ichrnkv/t_lite_1.0_gguf

Base model

t-tech/T-lite-it-1.0

Quantized

(17)

this model