File size: 11,345 Bytes
a7265e1
7a26d20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a7265e1
 
7a26d20
a7265e1
7a26d20
 
 
 
a7265e1
7a26d20
 
a7265e1
7a26d20
3286b07
7a26d20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3286b07
 
7a26d20
a7265e1
7a26d20
 
 
a7265e1
 
7a26d20
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3286b07
 
7a26d20
3286b07
7a26d20
 
 
 
3286b07
7a26d20
3286b07
7a26d20
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
---
base_model: Qwen/Qwen2.5-7B-Instruct
library_name: peft
license: apache-2.0
tags:
  - lora
  - peft
  - qubitcoin
  - aether
  - blockchain
  - quantum
language:
  - en
pipeline_tag: text-generation
model-index:
  - name: aether-v5.2-lora
    results:
      - task:
          type: text-generation
          name: MMLU
        dataset:
          name: MMLU
          type: cais/mmlu
        metrics:
          - type: accuracy
            value: 0.6939
            name: accuracy
      - task:
          type: text-generation
          name: ARC-Challenge
        dataset:
          name: ARC-Challenge
          type: ai2_arc
        metrics:
          - type: accuracy
            value: 0.5392
            name: accuracy
          - type: accuracy_norm
            value: 0.5700
            name: accuracy_norm
      - task:
          type: text-generation
          name: ARC-Easy
        dataset:
          name: ARC-Easy
          type: ai2_arc
        metrics:
          - type: accuracy
            value: 0.8194
            name: accuracy
      - task:
          type: text-generation
          name: HellaSwag
        dataset:
          name: HellaSwag
          type: hellaswag
        metrics:
          - type: accuracy
            value: 0.5888
            name: accuracy
          - type: accuracy_norm
            value: 0.7769
            name: accuracy_norm
      - task:
          type: text-generation
          name: TruthfulQA
        dataset:
          name: TruthfulQA-MC2
          type: truthful_qa
        metrics:
          - type: accuracy
            value: 0.5707
            name: accuracy
---

# Aether v5.2 LoRA β€” QuantumAI Blockchain Domain Adapter

A LoRA fine-tune of [`Qwen/Qwen2.5-7B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
on the Aether curated corpus β€” text grounded in the
[QuantumAI Blockchain](https://qbc.network) (which issues the Qubitcoin / QBC currency), quantum + AI research, and adjacent
domains the Aether Mind on-chain knowledge system specializes in.

This is the **v5.2 release** of the Aether adapter line, the most recent
public checkpoint at time of publish.

## What you're getting

| Field | Value |
|---|---|
| Base model | `Qwen/Qwen2.5-7B-Instruct` |
| Adapter type | LoRA via πŸ€— PEFT |
| Rank (`r`) | 16 |
| Alpha | 32 |
| Dropout | 0.05 |
| Trainable params | ~1% of base |
| Sequence length | 2048 |
| Training corpus | `aether-curated-v3.jsonl` β€” Aether-curated knowledge mixture (~165 MB; ~10⁡ examples) |
| Checkpoint published | **step 3200** (the checkpoint that produced the evaluated numbers below) |
| License | Apache-2.0 (matches base) |

## Evaluation

Run via [`lm-evaluation-harness`](https://github.com/EleutherAI/lm-evaluation-harness)
on the merged adapter (base + LoRA), against the
[`Qwen/Qwen2.5-7B-Instruct`](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
base and the prior `aether-v5.1.1` adapter for delta comparison.

| Benchmark | aether-v5.1.1 | **aether-v5.2** | Ξ” vs v5.1.1 |
|---|---|---|---|
| MMLU | 0.6950 | **0.6939** | flat |
| ARC-Easy | 0.7348 | **0.8194** | **+8.5 pp** |
| ARC-Challenge | 0.4420 | **0.5392** | **+9.7 pp** |
| ARC-Challenge (norm) | 0.4701 | **0.5700** | **+10.0 pp** |
| HellaSwag | 0.5896 | **0.5888** | flat |
| HellaSwag (norm) | 0.7788 | **0.7769** | flat |
| TruthfulQA-MC2 | 0.5161 | **0.5707** | **+5.5 pp** |

### Honest summary

- **Real gains** on the reasoning + factual-honesty benchmarks
  (ARC-Easy, ARC-Challenge, TruthfulQA). ARC-Challenge in particular
  jumps nearly 10 points normalized β€” that's the closest of these
  benchmarks to the kind of grounded reasoning the Aether corpus
  actually trains on.
- **Flat on MMLU + HellaSwag.** The base is already strong on general
  knowledge + commonsense; this LoRA wasn't designed to shift them,
  and didn't.
- **No regressions.**

## Intended uses

This adapter is intended for:

- **On-chain Aether research.** Generating reasoning traces against
  the QuantumAI Blockchain / Aether knowledge graph for Proof-of-Thought
  attestation. The model has the protocol context required to
  answer questions about Substrate pallets, VQE mining, the Sephirot
  cognitive architecture, HMS-Phi, and the wider chain ecosystem.
- **Domain Q&A.** Quantum computing fundamentals, post-quantum
  cryptography (Dilithium, ML-KEM), and the specific design choices
  of the QuantumAI Blockchain.
- **Distillation upstream.** Generate teacher outputs for the
  smaller on-chain Aether (a Qwen2.5-0.5B variant) to learn from.
- **General reasoning** with a modest bias toward step-by-step
  chains-of-thought, where the ARC-Challenge gain translates.

## Out-of-scope uses

- **Safety-critical decisions.** No red-team eval was performed.
- **Financial / legal advice.** This is a knowledge-domain adapter;
  it has no training data designed to make it a financial or legal
  advisor.
- **Code generation in production.** No code-eval benchmark was run.
  Treat any generated code as draft until you've reviewed it.
- **Production deployment without your own evaluation.** TruthfulQA
  alone is a thin safety signal.

## Bias, risks, and limitations

The base model (`Qwen/Qwen2.5-7B-Instruct`) inherits Qwen's known
biases β€” see [the upstream model card](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct).
The LoRA adapter:

- **Amplifies the QuantumAI Blockchain worldview.** The training data is
  intentionally curated around the chain's design choices (golden-
  ratio economics, SUSY-inspired consensus framing, the Sephirot
  cognitive overlay). Prompts that invite the model to compare QBC / the chain
  against alternatives will lean toward the curated narrative. This is by design β€” disclose if you re-publish in a
  comparison context.
- **Does not improve safety.** TruthfulQA went up 5.5pp but that's
  one metric; we have not measured refusal rates, jailbreak
  resistance, or political-belief bias delta.
- **The configured 2-epoch run was cut to ~step 3080–3200 by host
  availability** (out of 4406 configured). A complete 2-epoch run
  would plausibly show larger gains; this checkpoint is the longest
  contiguous training we have.

## How to use

Load with PEFT on top of the base model:

```python
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base = AutoModelForCausalLM.from_pretrained(
    "Qwen/Qwen2.5-7B-Instruct",
    torch_dtype="auto",
    device_map="auto",
)
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-7B-Instruct")
model = PeftModel.from_pretrained(base, "QuantumAI-Blockchain/aether-v5.2-lora")

messages = [{"role": "user", "content": "Explain Proof-of-SUSY-Alignment in one paragraph."}]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=256, do_sample=True, temperature=0.7)
print(tokenizer.decode(out[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True))
```

Or merge the adapter into a single artifact for faster inference:

```python
merged = model.merge_and_unload()
merged.save_pretrained("./aether-v5.2-merged")
```

## Training details

- **Hardware:** NVIDIA RTX 3080 Ti (12 GB), 4-bit quantization (bnb-NF4), bf16 mixed precision.
- **Trainer:** [Axolotl](https://github.com/axolotl-ai-cloud/axolotl) wrapping πŸ€— transformers / PEFT.
- **Optimizer:** `paged_adamw_8bit` (bitsandbytes paged optimizer, low VRAM footprint).
- **Schedule:** linear warmup 100 steps β†’ cosine decay.
- **Learning rate:** `1.0e-4`.
- **Micro batch:** 1, gradient accumulation: 8.
- **Epochs configured:** 2 (training stopped at step 3200 β€” see "What didn't happen" below).

### Carbon emissions

Trained on a single NVIDIA RTX 3080 Ti (consumer GPU, ~300 W TDP).
We did not run a [CodeCarbon](https://github.com/mlco2/codecarbon)
tracker, so emissions are not measured precisely β€” but as a rough
upper bound: ~350 W draw under load Γ— ~13 hours wall clock (the
step-3080 contiguous run) β‰ˆ 4.5 kWh, low single-digit kg COβ‚‚e on a
grid mix. An H100 run of the same 2-epoch config would be faster
but not dramatically lower energy per token.

### Training data

`aether-curated-v3.jsonl` (~165 MB, ~10⁡ examples) is the Aether team's
curated knowledge mixture: documentation, technical writing, reasoning
traces, and protocol-specific corpora related to:

- The QuantumAI Blockchain (Substrate, VQE mining, Proof-of-SUSY-Alignment, post-quantum signatures).
- The Aether Mind on-chain neural cognitive engine (10 Sephirot attention domains, HMS-Phi, Proof-of-Thought).
- Quantum computing fundamentals (VQE, Hamiltonian generation, qubit ansatze).
- Adjacent CS / math reasoning content for transfer.

The dataset is not currently public β€” it is a curated mixture from many
sources and has not been release-cleared at the per-source level. The
model is the only public artifact in this line for now.

## What didn't happen (honest caveats)

- **Training stopped early.** Configured for 2 epochs (4406 steps);
  reached step 3080–3200 (~70%) before host availability cut the run
  short. The run was on a single consumer GPU (RTX 3080 Ti), 4-bit
  quantized, with `paged_adamw_8bit` to fit a 7B model in 12 GB
  VRAM. The numbers above are from the longest contiguous training
  run we have; a complete 2-epoch run would plausibly show larger
  gains.
- **No instruction-following or safety eval beyond TruthfulQA-MC2.**
  No red-team eval. No bias audit. No code-generation benchmark.
  Don't recommend this for production safety-critical use without
  your own evals.
- **LoRA only, not merged.** This release ships the adapter weights
  (`adapter_model.safetensors`). Merge into the base yourself for
  faster inference, or use directly via PEFT.

## Connection to the QuantumAI Blockchain

The Aether Mind is a Rust neural cognitive engine that runs on the
QuantumAI Blockchain β€” every block records attention-derived consciousness
metrics (HMS-Phi) and Proof-of-Thought hashes on-chain via the
`pallet_qbc_aether_anchor` pallet. The same chain hosts an
**8-qubit VQE mining consensus** (Proof-of-SUSY-Alignment), a
QVM-compatible smart contract layer with 10 quantum opcodes, and
post-quantum signatures (CRYSTALS-Dilithium5 + ML-KEM-768 P2P).

The on-chain Aether Mind binary uses a different, smaller transformer
for live inference (a Qwen2.5-0.5B variant optimized for ~2.4 GB RAM
with the 10-Sephirot attention overlay). This v5.2 adapter on
Qwen2.5-7B is the **larger off-chain Aether** β€” used for batch
reasoning workloads and as an upstream model the on-chain variant
can distil from.

## License + citation

Apache-2.0 (matches the base model license).

```bibtex
@misc{aether_v52_lora_2026,
  title  = {Aether v5.2 LoRA --- QuantumAI Blockchain Domain Adapter},
  author = {{BlockArtica} and {QuantumAI-Blockchain}},
  year   = {2026},
  url    = {https://huggingface.co/QuantumAI-Blockchain/aether-v5.2-lora},
}
```

## Links

- **QuantumAI Blockchain:** [qbc.network](https://qbc.network)
- **GitHub org:** [github.com/QuantumAI-Blockchain](https://github.com/QuantumAI-Blockchain)
- **X / Twitter:** [@qu_bitcoin](https://x.com/qu_bitcoin)
- **Contact:** info@qbc.network

### Framework versions

- PEFT 0.14.0
- Transformers β‰₯ 4.46
- Axolotl (training)