ASA-ASM β€” Checkpoints (multi-dataset)

This repository hosts raw training checkpoints for Addressed State Models (ASM) built with Addressed State Attention (ASA).

Layout

  • checkpoints/<dataset_tag>/*.pt β€” PyTorch checkpoints
  • configs/<dataset_tag>/*.config.json β€” extracted training config (when available)
  • metadata/<dataset_tag>/*.metadata.json β€” SHA256 + provenance

Included in this commit

  • wikitext103-raw: ASA_ASM_wt103-rawv1_gpt2_T1024_L21_D384_H8_K16_M32_ropek1_alibi1_gamma1_step75000_best.pt
  • fineweb: ASA_ASM_fineweb_T1024_L15_D1024_H16_K16_S32_step17500_last.pt

Provenance

Notes

  • This commit does not delete any existing files in the repo.
  • If older checkpoints exist at repo root, they are left intact for backward compatibility.
Downloads last month
386
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support