NexaAI
/

embeddinggemma-300m-npu

Feature Extraction

Model card Files Files and versions

zackli4ai commited on Sep 15, 2025

Commit

2fca6d9

·

verified ·

1 Parent(s): d96b6f4

Create README.md

Files changed (1) hide show

README.md +63 -0

README.md ADDED Viewed

	@@ -0,0 +1,63 @@

+# EmbeddingGemma-300M (NPU)
+## Model Description
+**EmbeddingGemma** is a 300M-parameter open embedding model developed by **Google DeepMind**.
+It is built from **Gemma 3** (with T5Gemma initialization) and the same research and technology used in **Gemini models**.
+The model produces **vector representations of text**, making it well-suited for **search, retrieval, classification, clustering, and semantic similarity tasks**.
+It was trained on **100+ languages** with ~320B tokens, optimized for **on-device efficiency** (mobile, laptops, desktops).
+## Features
+- **Compact and efficient**: 300M parameters, optimized for on-device use.
+- **Multilingual**: trained on 100+ spoken languages.
+- **Flexible embeddings**: default dimension **768**, with support for **512, 256, 128** via Matryoshka Representation Learning (MRL).
+- **Wide task coverage**: retrieval, QA, fact-checking, classification, clustering, similarity.
+- **Commercial-friendly**: open weights available for research and production.
+## Use Cases
+- Semantic similarity and recommendation systems
+- Document, code, and web search
+- Clustering for organization, research, and anomaly detection
+- Classification (e.g., sentiment, spam detection)
+- Fact verification and QA embeddings
+- Code retrieval for programming assistance
+## Inputs and Outputs
+**Input**:
+- **Type**: Text string (e.g., query, prompt, document)
+- **Max Length**: 2048 tokens
+**Output**:
+- **Type**: Embedding vector (default 768d)
+- **Options**: 512 / 256 / 128 dimensions via truncation & re-normalization (MRL)
+## Limitations & Responsible Use
+This model has known limitations:
+- **Bias & coverage**: quality depends on training data diversity.
+- **Nuance & ambiguity**: may struggle with sarcasm, figurative language.
+- **Ethical concerns**: risk of bias perpetuation, privacy leakage, or malicious misuse.
+Mitigations:
+- CSAM and sensitive data filtering applied.
+- Users should adhere to **Gemma Responsible AI guidelines** and **Prohibited Use Policy**.
+## License
+- Licensed under Google’s **Gemma Terms of Use**.
+- See: [Gemma Terms](https://ai.google.dev/gemma/terms)
+Ensure your usage complies with upstream license conditions.
+## References
+- [nexaSDK](https://sdk.nexa.ai)
+## Support
+For SDK-related issues, visit [sdk.nexa.ai](https://sdk.nexa.ai).
+For model-specific questions, open an issue in this repository.