File size: 735 Bytes
359abd3 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 | # cap-26m-fast-dev
Tiny decoder-only language model trained locally on macOS as part of the `cap` project.
## Summary
- Model name: `cap-26m-fast-dev`
- Base architecture: decoder-only Transformer in a LLaMA-style configuration
- Training data: TinyStories
- Training mode: fast development run for quick iteration
## Checkpoint notes
- Saved from the `cap` local training pipeline
- Includes tokenizer files alongside the model checkpoint
- Intended as an intermediate checkpoint, not a final polished release
## Known metrics
- Structured eval loss: `4.8252`
- Structured eval perplexity: `124.61`
## Usage
Load with `transformers` using `AutoModelForCausalLM.from_pretrained(...)` and `AutoTokenizer.from_pretrained(...)`.
|