Velari-Level2-Qwen3-4B β€” Intermediate Checkpoint

⚠️ This is an early/intermediate checkpoint. The final, recommended model is Aslam-13/aelora-qwen3-4b β€” use that one unless you specifically want to study how the project evolved.

This checkpoint covers only Level 1 (Velari language) + Level 2 (Thex-Kron base-8 math) of a larger 5-level project. It does not contain logic rules (Vel-Rith), advanced grammar (comparatives, superlatives, imperatives, possession), or world/lore knowledge β€” those were added in subsequent training runs.

Why a synthetic world? Benchmarks leak into pre-training. By fine-tuning on a domain that provably cannot exist in any pre-training corpus, every correct answer is unambiguous evidence of learning β€” not retrieval from prior knowledge.

It's kept public because the iteration history is part of the story: training in stages let me observe catastrophic forgetting in real time and validate that combining Level 1 + Level 2 data in a single run prevented the model from "losing" Velari vocabulary while learning math.


What this checkpoint knows

Domain Coverage
Velari β€” vocabulary 30-word lexicon (nouns, verbs, pronouns, adjectives)
Velari β€” grammar (basic) Plurals (-an), past tense (ta-), negation (ne), basic SVO sentences
Thex-Kron math Base-8 numerals (nul, ek, doi, tri … hep, ek-nul …), addition with carry, basic multiplication

Does NOT know: Vel-Rith logic rules, comparatives/superlatives, imperatives, possession, world/society lore, multi-step word problems. For those use the final model.


Why a base-8 system?

The base model (Qwen3-4B) "knows" base-10 from pre-training extremely well. Teaching it that 5 + 3 = 10 (in Aeloran base-8) directly contradicts that prior. If this checkpoint correctly answers fen vor tri = ek-nul (5 + 3 = 10β‚ˆ), it's evidence that LoRA fine-tuning successfully overrode a deeply-baked behavior β€” not just memorized a surface pattern.


Training details (this checkpoint)

Base model unsloth/qwen3-4b-unsloth-bnb-4bit
Method LoRA (4-bit) via Unsloth + TRL SFTTrainer
Dataset Combined Level 1 (language) + Level 2 (math) JSONL
License Apache 2.0

Full hyperparameters and training logs are in the GitHub repo.


Quick start

Option A β€” with Unsloth (fastest, requires GPU)

from unsloth import FastLanguageModel

model, tokenizer = FastLanguageModel.from_pretrained(
    model_name = "Aslam-13/velari-level2-qwen3-4b",
    max_seq_length = 2048,
    load_in_4bit = True,
)
FastLanguageModel.for_inference(model)

prompt = "What is fen vor tri in Aeloran math?"
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
out = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(out[0], skip_special_tokens=True))

Option B β€” with plain transformers

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_id = "Aslam-13/velari-level2-qwen3-4b"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

prompt = "What is fen vor tri in Aeloran math?"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(out[0], skip_special_tokens=True))

Expected output (approximate): "ek-nul" (5 + 3 = 8 in base-10 β†’ 10 in base-8 β†’ ek-nul). Exact phrasing may vary.


Limitations

  • Intermediate checkpoint, superseded by aelora-qwen3-4b.
  • No logic, no advanced grammar, no world knowledge.
  • Single-domain research artifact β€” not a general-purpose model.

Trained with Unsloth.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support