YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
GGUF Header Edit Benchmark
Benchmark script for measuring how long it takes to edit GGUF headers in-place on Hugging Face with streaming blobs (xet) and create a pull request per file.
It fetches metadata, rebuilds the header with a small change, commits an edit (header slice only), and records timings to a CSV.
Result from benchmark.ts
Rule of thumb (linear fit):
time_minutes β 0.36 Γ size_GB + 0.25
β¨ What this does
For each *.gguf file in a model repo:
- Discover files via the Hugging Face model tree API.
- Fetch GGUF + typed metadata with
@huggingface/gguf. - Rebuild the header using
buildGgufHeader(preserving endianness, alignment, and tensor info range). - Commit a slice edit (header bytes only) using
commitIterwithuseXet: trueto avoid full re-uploads. - Create a PR titled
benchmark. - Record timing (wall-clock) to
benchmark-results.csv.
π§± Requirements
- Node 18+
- A Hugging Face token with read + write on the target repo:
HF_TOKEN - NPM packages:
@huggingface/gguf@huggingface/hub
- Network access to
huggingface.co
π§ Setup
npm i
npm run benchmark
- Downloads last month
- 1,761
Hardware compatibility
Log In to add your hardware
We're not able to determine the quantization variants.
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support