prdev
/

mini-gte

@@ -8654,12 +8654,11 @@ language:
 ---
 # Mini-GTE
-This is a distillbert-based model trained from GTE-base. It can be used as a faster query encoder for the GTE series or as a standalone unit (MTEB scores are for standalone).
 ## Model Details
-### Model Description
 - **Model Type:** Sentence Transformer
 - **Base model:** [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) <!-- at revision 12040accade4e8a0f71eabdb258fecc2e7e948be -->
 - **Maximum Sequence Length:** 512 tokens
@@ -8667,20 +8666,24 @@ This is a distillbert-based model trained from GTE-base. It can be used as a fas
 - **Similarity Function:** Cosine Similarity
 ## Usage
-### Direct Usage (Sentence Transformers)
-First install the Sentence Transformers library:
 ```bash
 pip install -U sentence-transformers
 ```
-Then you can load this model and run inference.
 ```python
 from sentence_transformers import SentenceTransformer
-# Download from the 🤗 Hub
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
@@ -8689,54 +8692,14 @@ sentences = [
     'He drove to the stadium.',
 ]
 embeddings = model.encode(sentences)
-print(embeddings.shape)
-# [3, 768]
-# Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
-print(similarities.shape)
-# [3, 3]
 ```
-<!--
-### Direct Usage (Transformers)
-<details><summary>Click to see the direct usage in Transformers</summary>
-</details>
--->
-<!--
-### Downstream Usage (Sentence Transformers)
-You can finetune this model on your own dataset.
-<details><summary>Click to expand</summary>
-</details>
--->
-<!--
-### Out-of-Scope Use
-*List how the model may foreseeably be misused and address what users ought not to do with the model.*
--->
-<!--
-## Bias, Risks and Limitations
-*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
--->
-<!--
-### Recommendations
-*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
--->
 ## Training Details
-### Framework Versions
 - Python: 3.10.12
 - Sentence Transformers: 3.3.1
 - Transformers: 4.48.0.dev0
@@ -8746,23 +8709,15 @@ You can finetune this model on your own dataset.
 - Tokenizers: 0.21.0
 ## Citation
-### BibTeX
-<!--
-## Glossary
-*Clearly define terms in order to be accessible across audiences.*
--->
-<!--
-## Model Card Authors
-*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
--->
-<!--
-## Model Card Contact
-*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
--->

 ---
 # Mini-GTE
+## Overview
+This is the first model developed by QTACK and serves as a proof of concept for our distillation approach! Built upon a distillbert-based architecture, Mini-GTE is distilled from GTE and designed for efficiency without sacrificing accuracy at only 66M parameters. As a standalone sentence transformer, it ranks 2nd on the MTEB classic leaderboard in the <100M parameter category and 63rd overall which makes it a strong choice for real-time query encoding, semantic search, and similarity tasks.
 ## Model Details
 - **Model Type:** Sentence Transformer
 - **Base model:** [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) <!-- at revision 12040accade4e8a0f71eabdb258fecc2e7e948be -->
 - **Maximum Sequence Length:** 512 tokens
 - **Similarity Function:** Cosine Similarity
 ## Usage
+- Optimized for quick inference
+- Great at quickly generating high quality encodings
+- Easy to plug and play since it is distilled from GTE
+## Getting Started
+### Installation
+Mini-GTE is built on the [Sentence Transformers](https://www.sbert.net/) framework. To install the required packages, run:
 ```bash
 pip install -U sentence-transformers
 ```
+### Quick Start
+Here's a quick example to get you started:
 ```python
 from sentence_transformers import SentenceTransformer
+# Download directly from Hugging Face
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
     'He drove to the stadium.',
 ]
 embeddings = model.encode(sentences)
+print(embeddings.shape) # Expected: [3, 768]
+# Compute the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
+print(similarities.shape) # Expected: [3, 3]
 ```
 ## Training Details
 - Python: 3.10.12
 - Sentence Transformers: 3.3.1
 - Transformers: 4.48.0.dev0
 - Tokenizers: 0.21.0
 ## Citation
+```bibtex
+@misc{mini-gte2025,
+  title={Mini-GTE: A Fast and Efficient Distilled Sentence Transformer},
+  author={QTACK},
+  year={2025},
+  note={Available on the Hugging Face Hub}
+}
+```
+## Getting Help
+For any questions, suggestions, or issues, please contact the QTACK team directly through our [contact page](https://www.qtack.com/contact).