Instructions to use Aslam-13/aelora-qwen3-4b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Aslam-13/aelora-qwen3-4b with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Aslam-13/aelora-qwen3-4b", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Unsloth Studio new
How to use Aslam-13/aelora-qwen3-4b with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Aslam-13/aelora-qwen3-4b to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Aslam-13/aelora-qwen3-4b to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Aslam-13/aelora-qwen3-4b to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="Aslam-13/aelora-qwen3-4b", max_seq_length=2048, )
Aelora-Qwen3-4B β Fine-Tuned on a Fully Synthetic World
A LoRA fine-tune of Qwen3-4B taught a constructed knowledge domain that does not exist in any pre-training corpus: the planet Aelora β its language (Velari), its base-8 mathematics (Thex-Kron), and its custom logic system (Vel-Rith).
Why a synthetic world? Benchmarks leak into pre-training. By fine-tuning on a domain that provably cannot exist in any pre-training corpus, every correct answer is unambiguous evidence of learning β not retrieval from prior knowledge.
- π§ͺ Full code, datasets, RAG comparison & writeup: github.com/Aslam-13/Fine_tuning_RAG
- π Training notebook (Kaggle): kaggle.com/code/syed13/finetunerag
- π Earlier checkpoint (Level 2 only): Aslam-13/velari-level2-qwen3-4b
- π Datasets: huggingface.co/datasets/Aslam-13/Fine-tune-RAG
What this model knows
| Domain | Coverage |
|---|---|
| Velari β vocabulary | 30-word lexicon (nouns, verbs, pronouns, adjectives) |
| Velari β grammar | Plurals (-an), past (ta-), negation (ne), possession (-os), comparatives (vor-), superlatives (krath-), imperatives, SVO sentences |
| Thex-Kron math | Base-8 numerals (nul, ek, doi, tri, kal, fen, sai, hep, ek-nul β¦), addition with carry, multiplication, word problems |
| Vel-Rith logic | Element generation & destruction rules (e.g. krel + pael β fia, krel destroys zorak), single-rule and chained reasoning |
| World/lore | Aeloran society, regions, currency (Zolts), governance (Fen-Renan voting) |
It will not know anything about Aelora outside the training set, and should be treated as a research/demo artifact β not a general-purpose assistant.
Training details
| Base model | unsloth/qwen3-4b-unsloth-bnb-4bit |
| Method | LoRA (4-bit quant) via Unsloth + TRL SFTTrainer |
| Trainable params | 66,060,288 of 4,088,528,384 (1.62%) |
| Dataset size | 376 examples (JSONL, instruction/input/output) |
| Epochs | 12 |
| Effective batch size | 4 (batch 2 Γ grad_accum 2) |
| Total steps | 1,128 |
| Final training loss | 0.3397 |
| Training time | 226 min on a single Kaggle T4 |
| License | Apache 2.0 |
Evaluation
148-question held-out test suite covering all domains. Compared head-to-head against multiple RAG configurations on a 50-question capstone subset:
| System | Avg score | Perfect (1.0) | Zero (0.0) |
|---|---|---|---|
| L2 Semantic-chunk RAG | 0.280 | 12 | 34 |
| L2 Proposition-chunk RAG | 0.278 | 11 | 33 |
| Gemini 2.5 Flash + RAG | 0.506 | 20 | 20 |
| This model (FT-only) | 0.649 | 30 | 15 |
| This model + RAG | 0.620 | 27 | 15 |
Finding: the fine-tuned model alone outperforms RAG-augmented variants on this domain. RAG helped on lore/logic gaps but hurt on math/grammar where the model had already memorized the rules. Full breakdown in the GitHub repo.
Quick start
Option A β with Unsloth (fastest, requires GPU)
from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained(
model_name = "Aslam-13/aelora-qwen3-4b",
max_seq_length = 2048,
load_in_4bit = True,
)
FastLanguageModel.for_inference(model)
prompt = "Translate to English: 'vel ne ta-vex zorak'"
inputs = tokenizer(prompt, return_tensors="pt").to("cuda")
out = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(out[0], skip_special_tokens=True))
Option B β with plain transformers (no Unsloth required)
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_id = "Aslam-13/aelora-qwen3-4b"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map="auto",
)
prompt = "Translate to English: 'vel ne ta-vex zorak'"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=128)
print(tokenizer.decode(out[0], skip_special_tokens=True))
Expected output (approximate): "I did not see fire." β vel (I) + ne (not) + ta- (past tense prefix) + vex (see) + zorak (fire). Exact phrasing may vary.
Limitations
- Trained on a single small synthetic domain β does not generalize to real-world tasks.
- Multi-step base-8 word problems and chained logic (>2 rules) are the weakest areas.
- Keyword-overlap eval is crude; no human/LLM-judge evaluation was run.
- No hyperparameter sweep β LoRA rank/alpha picked by convention.
Citation
@misc{aelora_qwen3_4b_2026,
author = {Aslam-13},
title = {Aelora-Qwen3-4B: Fine-tuning Qwen3-4B on a fully synthetic constructed-knowledge domain},
year = {2026},
url = {https://huggingface.co/Aslam-13/aelora-qwen3-4b}
}
Trained with Unsloth for 2Γ faster LoRA training.
