mini-gte-onnx / README.md
RainbowPiBubbles's picture
Create README.md
ac6b563 verified
metadata
tags:
  - onnx
  - transformers
  - sentence-embedding
  - mini-gte
license: apache-2.0
language:
  - en
library_name: transformers
pipeline_tag: sentence-similarity

mini-gte (ONNX Quantized)

A lightweight, optimized version of the gte-small model for client-side inference (browser/edge). Exported to ONNX for compatibility with ONNX.js, Transformers.js, and other edge-friendly runtimes.

πŸš€ Features

  • ONNX Format: Ready for browser/edge deployment.
  • Quantized: Smaller size (~45MB) with minimal accuracy loss.
  • Sentence Embeddings: Generate embeddings for semantic search, clustering, etc.

πŸ“¦ Files

model/
β”œβ”€β”€ config.json
β”œβ”€β”€ model.onnx
β”œβ”€β”€ tokenizer_config.json
β”œβ”€β”€ special_tokens_map.json
└── vocab.txt