Chunjiang-Intelligence
/

Socrates-embedding

Safetensors

Eval Results (legacy)

Model card Files Files and versions

xet

Community

imbue2025 commited on Jan 11

Commit

d89999f

verified ·

1 Parent(s): ffbd599

Update README.md

Browse files

Files changed (1) hide show

README.md +47 -1

README.md CHANGED Viewed

@@ -10,7 +10,6 @@ language:
 ![Model Architecture](https://img.shields.io/badge/Model-Socrates--embedding-blue) ![Parameter Count](https://img.shields.io/badge/Params-86M-green) ![Carbon Footprint](https://img.shields.io/badge/Carbon-Neutral-brightgreen)
 ## Model Details
 Socrates-embedding is a lightweight, high-density text embedding model. Unlike contemporary models that rely on massive parameter counts to brute-force semantic understanding, Socrates-embedding leverages Low-Rank Decay (LoRD) to achieve high-quality vector representations with minimal computational overhead.
@@ -21,6 +20,53 @@ This model is part of the Chunjiang Intelligence edge-computing initiative, aimi
 -   **Model Type:** Dual-Encoder Transformer.
 -   **Language:** English.
 ## Model Architecture
 The model utilizes a custom Transformer Encoder architecture optimized for inference latency on Apple MPS and NVIDIA TensorRT backends.

 ![Model Architecture](https://img.shields.io/badge/Model-Socrates--embedding-blue) ![Parameter Count](https://img.shields.io/badge/Params-86M-green) ![Carbon Footprint](https://img.shields.io/badge/Carbon-Neutral-brightgreen)
 ## Model Details
 Socrates-embedding is a lightweight, high-density text embedding model. Unlike contemporary models that rely on massive parameter counts to brute-force semantic understanding, Socrates-embedding leverages Low-Rank Decay (LoRD) to achieve high-quality vector representations with minimal computational overhead.
 -   **Model Type:** Dual-Encoder Transformer.
 -   **Language:** English.
+The model was evaluated on the `AmazonCounterfactualClassification` dataset across multiple languages.
+| Language | Accuracy |
+| :--- | :---: |
+| Japanese (ja) | 54.83 |
+| German (de) | 52.57 |
+| English (en) | 49.70 |
+| English-Ext (en-ext)| 49.15 |
+To put the model's efficiency into perspective, we compare its single-task score on Japanese classification against the *overall MTEB average scores* of much larger models. (Our budget is insufficient to cover the bill for the GPU used for the ongoing tests.)
+<br>
+<p align="center">
+  <img src="model_efficiency_comparison.png" width="800">
+  <br>
+  <em>Figure 1: Our 83M model's score on a single challenging task rivals the average performance of models up to 85x larger.</em>
+</p>
+<br>
+Clustering performance was evaluated using the V-measure score (multiplied by 100) on the `StackExchangeClustering` task.
+We compared Socrates-embedding against other popular lightweight models (<110M params).
+| Model | Parameters | Clustering Score (V-measure x 100) |
+| :--- | :--- | :---: |
+| **Socrates-embedding** | **83M** | **8.92** 🏆 |
+| `snowflake-arctic-embed-m`| 109M | 7.25 |
+| `KartonBERT-USE-base-v1` | 104M | 6.93 |
+| `jina-embedding-s-en-v1`| 35M | 6.64 |
+| `all-MiniLM-L6-v2` | 23M | 6.62 |
+*   Observation: Our model achieves the highest clustering score in its weight class, demonstrating a superior vector space structure compared to established baselines.
+<br>
+<p align="center">
+  <img src="model_clustering_comparison.png" width="800">
+  <br>
+  <em>Figure 2: Leading clustering performance among lightweight embedding models.</em>
+</p>
+<br>
 ## Model Architecture
 The model utilizes a custom Transformer Encoder architecture optimized for inference latency on Apple MPS and NVIDIA TensorRT backends.