d2v1shx
/

cap-26m-fast-dev

Model card Files Files and versions

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

cap-26m-fast-dev

Tiny decoder-only language model trained locally on macOS as part of the cap project.

Summary

Model name: cap-26m-fast-dev
Base architecture: decoder-only Transformer in a LLaMA-style configuration
Training data: TinyStories
Training mode: fast development run for quick iteration

Checkpoint notes

Saved from the cap local training pipeline
Includes tokenizer files alongside the model checkpoint
Intended as an intermediate checkpoint, not a final polished release

Known metrics

Structured eval loss: 4.8252
Structured eval perplexity: 124.61

Usage

Load with transformers using AutoModelForCausalLM.from_pretrained(...) and AutoTokenizer.from_pretrained(...).

Downloads last month: 1

Safetensors

Model size

29.1M params

Tensor type

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support