picoLLM Runtime Artifacts

This repo contains the inference-focused artifact set produced by a picollm/accelerated/speedrun.sh run.

Provenance

  • picoLLM repo commit: 7c2c7d831050ca0ca64265b85a4ffc9ba97b2596
  • Preferred chat source: sft
  • Base checkpoint: {'root': 'base_checkpoints', 'model_tag': 'd24', 'step': 5568, 'latest_model': 'base_checkpoints/d24/model_005568.pt', 'latest_meta': 'base_checkpoints/d24/meta_005568.json'}
  • SFT checkpoint: {'root': 'chatsft_checkpoints', 'model_tag': 'd24', 'step': 482, 'latest_model': 'chatsft_checkpoints/d24/model_000482.pt', 'latest_meta': 'chatsft_checkpoints/d24/meta_000482.json'}
  • Run manifest: run_manifest.json
  • Identity dataset: identity_conversations.jsonl

Intended Usage

  • restore the bundle into PICOLLM_BASE_DIR
  • run python -m picollm.accelerated.chat.cli -i sft
  • run python -m picollm.accelerated.chat.web -i sft

Demo

Learn More

Known Limitations

  • picoLLM-native checkpoints only; this repo is not Transformers-native
  • optimizer shards are intentionally excluded here
  • use the paired archive dataset repo if you need fuller resume-training artifacts
  • the model can hallucinate and should be treated as a smaller open model

Local Restore

git clone https://github.com/Montekkundan/llm
cd llm
uv sync --extra gpu
python scripts/restore_picollm_from_hf.py montekkundan/picollm-v1

If you already downloaded the repo contents into a custom artifact directory:

export PICOLLM_BASE_DIR=$PWD/artifacts/picollm
python -m picollm.accelerated.chat.cli -i sft
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using montekkundan/picollm-v1 1