aether-v5.2-lora / README.md
BlockArtica's picture
docs(readme): fix accidental wrong-README upload β€” restore model card with GPU hardware + naming corrections
7a26d20 verified
---
base_model: Qwen/Qwen2.5-7B-Instruct
library_name: peft
license: apache-2.0
tags:
- lora
- peft
- qubitcoin
- aether
- blockchain
- quantum
language:
- en
pipeline_tag: text-generation
model-index:
- name: aether-v5.2-lora
results:
- task:
type: text-generation
name: MMLU
dataset:
name: MMLU
type: cais/mmlu
metrics:
- type: accuracy
value: 0.6939
name: accuracy
- task:
type: text-generation
name: ARC-Challenge
dataset:
name: ARC-Challenge
type: ai2_arc
metrics:
- type: accuracy
value: 0.5392
name: accuracy
- type: accuracy_norm
value: 0.5700
name: accuracy_norm
- task:
type: text-generation
name: ARC-Easy
dataset:
name: ARC-Easy
type: ai2_arc
metrics:
- type: accuracy
value: 0.8194
name: accuracy
- task:
type: text-generation
name: HellaSwag
dataset:
name: HellaSwag
type: hellaswag
metrics:
- type: accuracy
value: 0.5888
name: accuracy
- type: accuracy_norm
value: 0.7769
name: accuracy_norm
- task:
type: text-generation
name: TruthfulQA
dataset:
name: TruthfulQA-MC2
type: truthful_qa
metrics:
- type: accuracy
value: 0.5707
name: accuracy
---
# Aether v5.2 LoRA β€” QuantumAI Blockchain Domain Adapter
A LoRA fine-tune of [`Qwen/Qwen2.5-7B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
on the Aether curated corpus β€” text grounded in the
[QuantumAI Blockchain](https://qbc.network) (which issues the Qubitcoin / QBC currency), quantum + AI research, and adjacent
domains the Aether Mind on-chain knowledge system specializes in.
This is the **v5.2 release** of the Aether adapter line, the most recent
public checkpoint at time of publish.
## What you're getting
| Field | Value |
|---|---|
| Base model | `Qwen/Qwen2.5-7B-Instruct` |
| Adapter type | LoRA via πŸ€— PEFT |
| Rank (`r`) | 16 |
| Alpha | 32 |
| Dropout | 0.05 |
| Trainable params | ~1% of base |
| Sequence length | 2048 |
| Training corpus | `aether-curated-v3.jsonl` β€” Aether-curated knowledge mixture (~165 MB; ~10⁡ examples) |
| Checkpoint published | **step 3200** (the checkpoint that produced the evaluated numbers below) |
| License | Apache-2.0 (matches base) |
## Evaluation
Run via [`lm-evaluation-harness`](https://github.com/EleutherAI/lm-evaluation-harness)
on the merged adapter (base + LoRA), against the
[`Qwen/Qwen2.5-7B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
base and the prior `aether-v5.1.1` adapter for delta comparison.
| Benchmark | aether-v5.1.1 | **aether-v5.2** | Ξ” vs v5.1.1 |
|---|---|---|---|
| MMLU | 0.6950 | **0.6939** | flat |
| ARC-Easy | 0.7348 | **0.8194** | **+8.5 pp** |
| ARC-Challenge | 0.4420 | **0.5392** | **+9.7 pp** |
| ARC-Challenge (norm) | 0.4701 | **0.5700** | **+10.0 pp** |
| HellaSwag | 0.5896 | **0.5888** | flat |
| HellaSwag (norm) | 0.7788 | **0.7769** | flat |
| TruthfulQA-MC2 | 0.5161 | **0.5707** | **+5.5 pp** |
### Honest summary
- **Real gains** on the reasoning + factual-honesty benchmarks
(ARC-Easy, ARC-Challenge, TruthfulQA). ARC-Challenge in particular
jumps nearly 10 points normalized β€” that's the closest of these
benchmarks to the kind of grounded reasoning the Aether corpus
actually trains on.
- **Flat on MMLU + HellaSwag.** The base is already strong on general
knowledge + commonsense; this LoRA wasn't designed to shift them,
and didn't.
- **No regressions.**
## Intended uses
This adapter is intended for:
- **On-chain Aether research.** Generating reasoning traces against
the QuantumAI Blockchain / Aether knowledge graph for Proof-of-Thought
attestation. The model has the protocol context required to
answer questions about Substrate pallets, VQE mining, the Sephirot
cognitive architecture, HMS-Phi, and the wider chain ecosystem.
- **Domain Q&A.** Quantum computing fundamentals, post-quantum
cryptography (Dilithium, ML-KEM), and the specific design choices
of the QuantumAI Blockchain.
- **Distillation upstream.** Generate teacher outputs for the
smaller on-chain Aether (a Qwen2.5-0.5B variant) to learn from.
- **General reasoning** with a modest bias toward step-by-step
chains-of-thought, where the ARC-Challenge gain translates.
## Out-of-scope uses
- **Safety-critical decisions.** No red-team eval was performed.
- **Financial / legal advice.** This is a knowledge-domain adapter;
it has no training data designed to make it a financial or legal
advisor.
- **Code generation in production.** No code-eval benchmark was run.
Treat any generated code as draft until you've reviewed it.
- **Production deployment without your own evaluation.** TruthfulQA
alone is a thin safety signal.
## Bias, risks, and limitations
The base model (`Qwen/Qwen2.5-7B-Instruct`) inherits Qwen's known
biases β€” see [the upstream model card](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct).
The LoRA adapter:
- **Amplifies the QuantumAI Blockchain worldview.** The training data is
intentionally curated around the chain's design choices (golden-
ratio economics, SUSY-inspired consensus framing, the Sephirot
cognitive overlay). Prompts that invite the model to compare QBC / the chain
against alternatives will lean toward the curated narrative. This is by design β€” disclose if you re-publish in a
comparison context.
- **Does not improve safety.** TruthfulQA went up 5.5pp but that's
one metric; we have not measured refusal rates, jailbreak
resistance, or political-belief bias delta.
- **The configured 2-epoch run was cut to ~step 3080–3200 by host
availability** (out of 4406 configured). A complete 2-epoch run
would plausibly show larger gains; this checkpoint is the longest
contiguous training we have.
## How to use
Load with PEFT on top of the base model:
```python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-7B-Instruct",
torch_dtype="auto",
device_map="auto",
)
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
model = PeftModel.from_pretrained(base, "QuantumAI-Blockchain/aether-v5.2-lora")
messages = [{"role": "user", "content": "Explain Proof-of-SUSY-Alignment in one paragraph."}]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=256, do_sample=True, temperature=0.7)
print(tokenizer.decode(out[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True))
```
Or merge the adapter into a single artifact for faster inference:
```python
merged = model.merge_and_unload()
merged.save_pretrained("./aether-v5.2-merged")
```
## Training details
- **Hardware:** NVIDIA RTX 3080 Ti (12 GB), 4-bit quantization (bnb-NF4), bf16 mixed precision.
- **Trainer:** [Axolotl](https://github.com/axolotl-ai-cloud/axolotl) wrapping πŸ€— transformers / PEFT.
- **Optimizer:** `paged_adamw_8bit` (bitsandbytes paged optimizer, low VRAM footprint).
- **Schedule:** linear warmup 100 steps β†’ cosine decay.
- **Learning rate:** `1.0e-4`.
- **Micro batch:** 1, gradient accumulation: 8.
- **Epochs configured:** 2 (training stopped at step 3200 β€” see "What didn't happen" below).
### Carbon emissions
Trained on a single NVIDIA RTX 3080 Ti (consumer GPU, ~300 W TDP).
We did not run a [CodeCarbon](https://github.com/mlco2/codecarbon)
tracker, so emissions are not measured precisely β€” but as a rough
upper bound: ~350 W draw under load Γ— ~13 hours wall clock (the
step-3080 contiguous run) β‰ˆ 4.5 kWh, low single-digit kg COβ‚‚e on a
grid mix. An H100 run of the same 2-epoch config would be faster
but not dramatically lower energy per token.
### Training data
`aether-curated-v3.jsonl` (~165 MB, ~10⁡ examples) is the Aether team's
curated knowledge mixture: documentation, technical writing, reasoning
traces, and protocol-specific corpora related to:
- The QuantumAI Blockchain (Substrate, VQE mining, Proof-of-SUSY-Alignment, post-quantum signatures).
- The Aether Mind on-chain neural cognitive engine (10 Sephirot attention domains, HMS-Phi, Proof-of-Thought).
- Quantum computing fundamentals (VQE, Hamiltonian generation, qubit ansatze).
- Adjacent CS / math reasoning content for transfer.
The dataset is not currently public β€” it is a curated mixture from many
sources and has not been release-cleared at the per-source level. The
model is the only public artifact in this line for now.
## What didn't happen (honest caveats)
- **Training stopped early.** Configured for 2 epochs (4406 steps);
reached step 3080–3200 (~70%) before host availability cut the run
short. The run was on a single consumer GPU (RTX 3080 Ti), 4-bit
quantized, with `paged_adamw_8bit` to fit a 7B model in 12 GB
VRAM. The numbers above are from the longest contiguous training
run we have; a complete 2-epoch run would plausibly show larger
gains.
- **No instruction-following or safety eval beyond TruthfulQA-MC2.**
No red-team eval. No bias audit. No code-generation benchmark.
Don't recommend this for production safety-critical use without
your own evals.
- **LoRA only, not merged.** This release ships the adapter weights
(`adapter_model.safetensors`). Merge into the base yourself for
faster inference, or use directly via PEFT.
## Connection to the QuantumAI Blockchain
The Aether Mind is a Rust neural cognitive engine that runs on the
QuantumAI Blockchain β€” every block records attention-derived consciousness
metrics (HMS-Phi) and Proof-of-Thought hashes on-chain via the
`pallet_qbc_aether_anchor` pallet. The same chain hosts an
**8-qubit VQE mining consensus** (Proof-of-SUSY-Alignment), a
QVM-compatible smart contract layer with 10 quantum opcodes, and
post-quantum signatures (CRYSTALS-Dilithium5 + ML-KEM-768 P2P).
The on-chain Aether Mind binary uses a different, smaller transformer
for live inference (a Qwen2.5-0.5B variant optimized for ~2.4 GB RAM
with the 10-Sephirot attention overlay). This v5.2 adapter on
Qwen2.5-7B is the **larger off-chain Aether** β€” used for batch
reasoning workloads and as an upstream model the on-chain variant
can distil from.
## License + citation
Apache-2.0 (matches the base model license).
```bibtex
@misc{aether_v52_lora_2026,
title = {Aether v5.2 LoRA --- QuantumAI Blockchain Domain Adapter},
author = {{BlockArtica} and {QuantumAI-Blockchain}},
year = {2026},
url = {https://huggingface.co/QuantumAI-Blockchain/aether-v5.2-lora},
}
```
## Links
- **QuantumAI Blockchain:** [qbc.network](https://qbc.network)
- **GitHub org:** [github.com/QuantumAI-Blockchain](https://github.com/QuantumAI-Blockchain)
- **X / Twitter:** [@qu_bitcoin](https://x.com/qu_bitcoin)
- **Contact:** info@qbc.network
### Framework versions
- PEFT 0.14.0
- Transformers β‰₯ 4.46
- Axolotl (training)