--- license: apache-2.0 base_model: Alibaba-NLP/gte-modernbert-base library_name: onnxruntime pipeline_tag: sentence-similarity tags: - onnx - sentence-similarity - embeddings - feature-extraction - code-search - ziv - code-aware language: - en --- # Ziv Embedder — Code Aware (ONNX) This is an ONNX export of **[Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base)**, prepared for **[Ziv](https://github.com/Muhd-Uwais/ziv)** — a local semantic code search engine for Python repositories. Ziv uses this embedder to improve code-aware search quality in version **0.4.0**. The model is optimized for local inference with **onnxruntime**, making it lightweight, fast, and practical for offline developer workflows. ## Why this model? Ziv needs embeddings that work well for code search and code understanding while staying fully local. This model is designed to support that goal with: - **Code-aware semantic search** - **Fast local inference** - **No cloud dependency** - **No API keys** - **ONNX runtime compatibility** Compared to a standard Python-based embedding stack, this setup is easier to ship and more efficient to run inside a local developer tool. ## Usage with Ziv ```bash ziv init --model code ziv start ``` ## Model details | Property | Value | |---|---| | Base model | Alibaba-NLP/gte-modernbert-base | | Model type | Text embedding | | Embedding dimension | 768 | | Max sequence length | 8192 | | Runtime | onnxruntime | | Primary use | Semantic code search / code understanding | ## Files | File | Description | |---|---| | `model.onnx` | ONNX model weights and graph | | `tokenizer.json` | Tokenizer vocabulary and rules | | `tokenizer_config.json` | Tokenizer settings | | `config.json` | Model architecture config | | `1_Pooling/config.json` | Pooling configuration | ## Relation to the original model This model is based on **[Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base)**, developed by **Tongyi Lab, Alibaba Group**. This repository does **not** claim ownership of the original model weights or training recipe. It provides an ONNX-exported runtime version tailored for Ziv and local inference. The original model and its concepts should be credited to: - **Tongyi Lab, Alibaba Group** - **The gte-modernbert model authors** - **The broader Sentence Transformers ecosystem** ## License This model is released under the **Apache 2.0 License**, consistent with the upstream model license. Original model: **[Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base)** ## Citation If you use this model or the upstream base model in your work, please cite the original paper: ```bibtex @inproceedings{zhang2024mgte, title={mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval}, author={Zhang, Xin and Zhang, Yanzhao and Long, Dingkun and Xie, Wen and Dai, Ziqi and Tang, Jialong and Lin, Huan and Yang, Baosong and Xie, Pengjun and Huang, Fei and others}, booktitle={Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track}, pages={1393--1412}, year={2024} } @article{li2023towards, title={Towards general text embeddings with multi-stage contrastive learning}, author={Li, Zehan and Zhang, Xin and Zhang, Yanzhao and Long, Dingkun and Xie, Pengjun and Zhang, Meishan}, journal={arXiv preprint arXiv:2308.03281}, year={2023} } ```