metadata
license: mit
tags:
- onnx
- function-calling
- needle
- cactus
- browser
- sentencepiece
base_model: Cactus-Compute/needle
library_name: onnxruntime
Needle fine-tune export
This repo contains a browser-ready ONNX export of a locally fine-tuned Needle checkpoint.
Provenance
- Base model: Cactus-Compute/needle
- Fine-tuned checkpoint:
needle_finetuned_20260608111244_50981_12_512_best.pkl - Fine-tuning data: UI tool-call dataset generated from the Needle demo UI
Files
| File | Description |
|---|---|
encoder.onnx |
Needle encoder exported from the fine-tuned checkpoint |
decoder_step.onnx |
One-step decoder with KV-cache I/O |
needle.model |
SentencePiece tokenizer |
tokenizer-specials.json |
Special token IDs used by the model |
Usage
Load the two ONNX graphs with onnxruntime-web, load needle.model with
sentencepiece-js, and run the encoder once followed by the decoder step in a JS loop.
Notes
This export follows the public porting guide from onnx-community/needle-onnx.