File size: 961 Bytes
517b14e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
license: mit
tags: [onnx, int8, semantex, embedder]
base_model: nomic-ai/CodeRankEmbed
---

# CodeRankEmbed-onnx-int8

int8 ONNX build of [`nomic-ai/CodeRankEmbed`](https://huggingface.co/nomic-ai/CodeRankEmbed)
(Nomic AI), prepared for [semantex](https://github.com/MisterTK/semantex)'s
local CPU `embedder` path.

- **License:** MIT — inherited from the upstream model; original
  copyright and attribution to **Nomic AI** are retained.
- **Derived from ONNX export:** `sirasagi62/code-rank-embed-onnx`.
- **Files:** `model_int8.onnx`, `tokenizer.json`, `config.json` (the layout semantex's
  downloader expects).

## Notes
- Single-vector bi-encoder (nomic_bert). Documents embed RAW code (no prefix); queries get the query_prefix above. Pool = mean over tokens, then L2-normalize. Alt source already int8: mrsladoje/CodeRankEmbed-onnx-int8 (--source-repo + --no-quantize).

Produced by `benchmarks/onnx_models/prepare_models.py`. Do not edit by hand.