FatStinkyPanda · Aether — step 40,000

A from-scratch reasoning LLM by FatStinkyPanda (Daniel A. Bissey), part of the Anima project.

Proprietary architecture. The model architecture and training innovations are NOT public. This repo hosts the trained weights + tokenizer only; the config + modeling code live in a private repo. Benchmarks are run on neutral free compute (Kaggle) and published openly so the numbers are verifiable without exposing how the model works.

Latest held-out benchmarks (acc_norm; neutral compute)

hellaswag: 0.33
arc_easy: 0.35
arc_challenge: 0.27
winogrande: 0.57
openbookqa: 0.29
train_slice_ppl: 12.02

Created & released by FatStinkyPanda. For access or collaboration, contact the creator.

Downloads last month: 61

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support