VectorNomad's picture
Add README
5cc030b verified
---
license: cc-by-nc-4.0
tags:
- arkadiko
- checkpoint
- resume-only
---
# Arkadiko V4 β€” Raw Training Checkpoints
**For resume-training and reproducibility only. Not the artifact you want to load for inference or SFT.**
For a clean inference/SFT artifact see [VectorNomad/arkadiko-v4-base](https://huggingface.co/VectorNomad/arkadiko-v4-base).
## Files
| File | Size | Step | What |
|---|---|---|---|
| `final.pt` | ~1.28 GB | 9,114,584 | End-of-run checkpoint. Last optimizer step before training terminated. |
| `latest.pt` | ~1.28 GB | ~9,113,000 | Atomic-save predecessor. Kept as a safety copy. |
Each `.pt` is a Python pickle (`torch.save`) with these top-level keys:
- `step` β€” global step
- `total_tokens` β€” tokens consumed
- `subphase_idx` β€” curriculum subphase (15 = final)
- `config` β€” `ArkadikoConfig` dataclass instance (requires the project repo to unpickle)
- `model` β€” state_dict (131 entries, 213.9M bf16 params)
- `optimizer` β€” AdamW state (exp_avg, exp_avg_sq for every param + LR/beta groups)
## Loading
Requires the `arkadiko` package on `sys.path` (the project repo is private). Without it, `torch.load` raises `ModuleNotFoundError: No module named 'arkadiko'` because the pickled `ArkadikoConfig` references it.
```python
import sys; sys.path.insert(0, "/path/to/arkadiko-repo")
import torch
ckpt = torch.load("final.pt", map_location="cpu", weights_only=False)
print(ckpt["step"], ckpt["total_tokens"])
```
For inference-only use of the weights, prefer the `safetensors` artifact in the [base repo](https://huggingface.co/VectorNomad/arkadiko-v4-base).
## License
CC BY-NC 4.0.