File size: 735 Bytes
359abd3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
# cap-26m-fast-dev

Tiny decoder-only language model trained locally on macOS as part of the `cap` project.

## Summary

- Model name: `cap-26m-fast-dev`
- Base architecture: decoder-only Transformer in a LLaMA-style configuration
- Training data: TinyStories
- Training mode: fast development run for quick iteration

## Checkpoint notes

- Saved from the `cap` local training pipeline
- Includes tokenizer files alongside the model checkpoint
- Intended as an intermediate checkpoint, not a final polished release

## Known metrics

- Structured eval loss: `4.8252`
- Structured eval perplexity: `124.61`

## Usage

Load with `transformers` using `AutoModelForCausalLM.from_pretrained(...)` and `AutoTokenizer.from_pretrained(...)`.