bflhc commited on
Commit
02d9c9d
·
1 Parent(s): 150628c

add know issues

Browse files
Files changed (1) hide show
  1. README.md +12 -0
README.md CHANGED
@@ -100,6 +100,18 @@ print(f"Similarity: {similarity.item():.4f}")
100
  - Text classification with embeddings
101
  - Deployment on GPU-constrained environments
102
 
 
 
 
 
 
 
 
 
 
 
 
 
103
  ## Limitations
104
 
105
  - Performance may vary across different domains and languages
 
100
  - Text classification with embeddings
101
  - Deployment on GPU-constrained environments
102
 
103
+ ## Known Issues
104
+
105
+ When encoding documents without any instruction prefix, you may encounter unexpected behavior due to an [upstream issue in Qwen3-Embedding](https://huggingface.co/Qwen/Qwen3-Embedding-8B/discussions/21). To avoid this issue, we recommend adding `"- "` (dash followed by space) at the beginning of your text when encoding documents:
106
+
107
+ ```python
108
+ # Recommended: Add "- " prefix for document encoding
109
+ documents = ["- " + doc for doc in documents]
110
+ embeddings = model.encode(documents)
111
+ ```
112
+
113
+ This workaround ensures consistent and expected embedding behavior.
114
+
115
  ## Limitations
116
 
117
  - Performance may vary across different domains and languages