Pidgn Rufus Model

A Llama 3.2 3B Instruct fine-tune exported as a single-file GGUF (F16) checkpoint, intended for conversational use in Nigerian Pidgin English (pcm).

Overview

This repository packages a fine-tuned version of Meta's Llama 3.2 3B Instruct model, converted to the GGUF format for use with llama.cpp-based runtimes (e.g. Ollama, LM Studio, llama.cpp itself). Based on the repository name and the included Ollama Modelfile, the model is intended to generate and respond to text in Nigerian Pidgin English, an English-based creole widely spoken in Nigeria.

The training data, dataset composition, and evaluation results are not published in this repository, so no claims are made here beyond what can be verified from the files present (config.json, Modelfile, README.md).

Training Details

Detail Value
Base model meta-llama/Llama-3.2-3B-Instruct (verified via Modelfile and architecture in config.json: LlamaForCausalLM, hidden size 3072, 28 layers, 24 attention heads, 8 KV heads, vocab size 128,256)
Fine-tuning framework Unsloth (evidenced by unsloth_version and unsloth_fixed fields in config.json)
Final training loss 0.7410 (as reported in the prior model card)
Export format GGUF, F16 precision, merged single-file checkpoint (llama-3.2-3b-instruct.F16.gguf)

Hyperparameters such as dataset size/source, number of epochs or steps, learning rate, and evaluation metrics are not included in this repository (no trainer_state.json, training_args.bin, or dataset card are present), so they are intentionally omitted rather than estimated.

Intended Use

  • Conversational text generation in Nigerian Pidgin English (and English, inherited from the base model).
  • Local/offline inference via GGUF-compatible runtimes such as Ollama or llama.cpp.
  • Experimentation and prototyping for Pidgin-language chatbots, assistants, or translation-adjacent tools.

This model is a fine-tune of an instruct model and has not been evaluated for production, safety-critical, or high-stakes deployments.

How to Use

Option 1: Ollama

A ready-made Modelfile is included in this repository (built on the Llama 3 chat template with <|start_header_id|>/<|eot_id|> tokens). To run locally with Ollama:

# Download the gguf and Modelfile from this repo, then:
ollama create pidgn-rufus -f Modelfile
ollama run pidgn-rufus

Option 2: llama.cpp

./llama-cli -m llama-3.2-3b-instruct.F16.gguf -p "How you dey today?"

Option 3: transformers (GGUF loader)

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "Ephraimmm/Pidgn_Rufus_model"
filename = "llama-3.2-3b-instruct.F16.gguf"

tokenizer = AutoTokenizer.from_pretrained(model_id, gguf_file=filename)
model = AutoModelForCausalLM.from_pretrained(model_id, gguf_file=filename)

inputs = tokenizer("How you dey today?", return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=100)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Limitations

  • No dataset card, training script, or evaluation results are published alongside this checkpoint, so its Pidgin-generation quality, coverage of dialectal variation, and safety behavior have not been independently verified.
  • Inherits the general limitations of the Llama 3.2 3B base model, including potential factual errors, hallucinations, and biases.
  • Only an F16 GGUF export is provided; no quantized variants (e.g. Q4/Q8) are currently included in this repository.
  • License terms follow Meta's Llama 3.2 Community License, which applies to derivative/fine-tuned models.

Author

Developed by Ephraimmm.

Downloads last month
3
GGUF
Model size
3B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Ephraimmm/Pidgn_Rufus_model

Quantized
(482)
this model