picoLLM Runtime Artifacts

This repo contains the inference-focused artifact set produced by a picollm/accelerated/speedrun.sh run.

Provenance

picoLLM repo commit: 7c2c7d831050ca0ca64265b85a4ffc9ba97b2596
Preferred chat source: sft
Base checkpoint: {'root': 'base_checkpoints', 'model_tag': 'd24', 'step': 5568, 'latest_model': 'base_checkpoints/d24/model_005568.pt', 'latest_meta': 'base_checkpoints/d24/meta_005568.json'}
SFT checkpoint: {'root': 'chatsft_checkpoints', 'model_tag': 'd24', 'step': 482, 'latest_model': 'chatsft_checkpoints/d24/model_000482.pt', 'latest_meta': 'chatsft_checkpoints/d24/meta_000482.json'}
Run manifest: run_manifest.json
Identity dataset: identity_conversations.jsonl

picoLLM-native checkpoints only; this repo is not Transformers-native
optimizer shards are intentionally excluded here
use the paired archive dataset repo if you need fuller resume-training artifacts
the model can hallucinate and should be treated as a smaller open model

git clone https://github.com/Montekkundan/llm
cd llm
uv sync --extra gpu
python scripts/restore_picollm_from_hf.py montekkundan/picollm-v1

If you already downloaded the repo contents into a custom artifact directory:

export PICOLLM_BASE_DIR=$PWD/artifacts/picollm
python -m picollm.accelerated.chat.cli -i sft

Downloads last month: -; Downloads are not tracked for this model. How to track