FatStinkyPanda · Aether — step 40,000

A from-scratch reasoning LLM by FatStinkyPanda (Daniel A. Bissey), part of the Anima project.

Proprietary architecture. The model architecture and training innovations are NOT public. This repo hosts the trained weights + tokenizer only; the config + modeling code live in a private repo. Benchmarks are run on neutral free compute (Kaggle) and published openly so the numbers are verifiable without exposing how the model works.

Latest held-out benchmarks (acc_norm; neutral compute)

  • hellaswag: 0.33
  • arc_easy: 0.35
  • arc_challenge: 0.27
  • winogrande: 0.57
  • openbookqa: 0.29
  • train_slice_ppl: 12.02

Created & released by FatStinkyPanda. For access or collaboration, contact the creator.

Downloads last month
61
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support