justinebert1's picture
Upload README.md with huggingface_hub
4c8863b verified
metadata
license: mit
tags:
  - onnx
  - function-calling
  - needle
  - cactus
  - browser
  - sentencepiece
base_model: Cactus-Compute/needle
library_name: onnxruntime

Needle fine-tune export

This repo contains a browser-ready ONNX export of a locally fine-tuned Needle checkpoint.

Provenance

  • Base model: Cactus-Compute/needle
  • Fine-tuned checkpoint: needle_finetuned_20260608111244_50981_12_512_best.pkl
  • Fine-tuning data: UI tool-call dataset generated from the Needle demo UI

Files

File Description
encoder.onnx Needle encoder exported from the fine-tuned checkpoint
decoder_step.onnx One-step decoder with KV-cache I/O
needle.model SentencePiece tokenizer
tokenizer-specials.json Special token IDs used by the model

Usage

Load the two ONNX graphs with onnxruntime-web, load needle.model with sentencepiece-js, and run the encoder once followed by the decoder step in a JS loop.

Notes

This export follows the public porting guide from onnx-community/needle-onnx.