Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
|
| 2 |
# prudant/Qwen3-Embedding-0.6B-W8A8
|
| 3 |
|
|
@@ -10,4 +21,4 @@ This is a compressed version of Qwen/Qwen3-Embedding-0.6B using llm-compressor w
|
|
| 10 |
- **Compression Libraries**: [llm-compressor](https://github.com/vllm-project/llm-compressor)
|
| 11 |
- **Calibration Dataset**: ultrachat_200k (1024 samples)
|
| 12 |
- **Optimized For**: Inference with vLLM
|
| 13 |
-
- **License**: same as original model
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
datasets:
|
| 4 |
+
- HuggingFaceH4/ultrachat_200k
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
+
- es
|
| 8 |
+
base_model:
|
| 9 |
+
- Qwen/Qwen3-Embedding-0.6B
|
| 10 |
+
pipeline_tag: feature-extraction
|
| 11 |
+
---
|
| 12 |
|
| 13 |
# prudant/Qwen3-Embedding-0.6B-W8A8
|
| 14 |
|
|
|
|
| 21 |
- **Compression Libraries**: [llm-compressor](https://github.com/vllm-project/llm-compressor)
|
| 22 |
- **Calibration Dataset**: ultrachat_200k (1024 samples)
|
| 23 |
- **Optimized For**: Inference with vLLM
|
| 24 |
+
- **License**: same as original model
|