Text Generation
Transformers
Safetensors
GGUF
Azerbaijani
gpt2
text-generation-inference
How to use from
llama.cpp
# Gated model: Login with a HF token with gated access permission
hf auth login
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf allmalab/gpt2-aze
# Run inference directly in the terminal:
llama-cli -hf allmalab/gpt2-aze
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf allmalab/gpt2-aze
# Run inference directly in the terminal:
llama-cli -hf allmalab/gpt2-aze
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf allmalab/gpt2-aze
# Run inference directly in the terminal:
./llama-cli -hf allmalab/gpt2-aze
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf allmalab/gpt2-aze
# Run inference directly in the terminal:
./build/bin/llama-cli -hf allmalab/gpt2-aze
Use Docker
docker model run hf.co/allmalab/gpt2-aze
Quick Links

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Azerbaijani GPT2 Model

The model is based on the GPT-2 architecture, specifically trained on Azerbaijani text. It serves as one of the first foundational models designed to generate and understand Azerbaijani language content. Built with the autoregressive transformer decoder architecture, the model generates text token by token, predicting the next word based on the input context.

  • Developed by : aLLMA Lab
  • Funded by : PRODATA LLC
  • Model type: Decoder-only foundational LLM
  • Language: Azerbaijani

Uses

The model can be directly used for text generation, sentence completion, next token prediction tasks by providing an input prompt. Additionally, it can be fine-tuned on an Azerbaijani instruction dataset to develop an interactive question-answering model.

Training Details

context_window=1024
stride=512

lr=1e-3
warmup_steps = 10000
weight_decay=0.1,
adam_beta1 = 0.9,
adam_beta2 = 0.999
batch_size=512
max_steps=178000
Downloads last month
3
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for allmalab/gpt2-aze

Quantized
(90)
this model
Finetunes
1 model

Datasets used to train allmalab/gpt2-aze