Spaces:
Sleeping
Sleeping
Switch multilingual space to BGE multilingual stack
Browse files- Dockerfile +2 -12
- README.md +3 -3
Dockerfile
CHANGED
|
@@ -1,22 +1,12 @@
|
|
| 1 |
FROM michaelf34/infinity:latest-cpu
|
| 2 |
|
| 3 |
-
RUN pip install --no-cache-dir \
|
| 4 |
-
"transformers>=4.57.0" \
|
| 5 |
-
"sentence-transformers>=5.1.0" \
|
| 6 |
-
"huggingface_hub>=0.34.0" \
|
| 7 |
-
"datasets>=3.6.0" \
|
| 8 |
-
"peft>=0.15.2"
|
| 9 |
-
|
| 10 |
-
COPY sitecustomize.py /app/sitecustomize.py
|
| 11 |
-
|
| 12 |
ENV INFINITY_ANONYMOUS_USAGE_STATS=0 \
|
| 13 |
-
|
| 14 |
-
INFINITY_MODEL_ID="jinaai/jina-embeddings-v5-text-small;jinaai/jina-reranker-v3;clapAI/modernBERT-base-multilingual-sentiment" \
|
| 15 |
INFINITY_SERVED_MODEL_NAME="multilingual-embed;multilingual-rerank;multilingual-classify" \
|
| 16 |
INFINITY_ENGINE="torch" \
|
| 17 |
INFINITY_DEVICE="cpu" \
|
| 18 |
INFINITY_PORT="7860" \
|
| 19 |
-
INFINITY_BATCH_SIZE="4;
|
| 20 |
|
| 21 |
EXPOSE 7860
|
| 22 |
|
|
|
|
| 1 |
FROM michaelf34/infinity:latest-cpu
|
| 2 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
ENV INFINITY_ANONYMOUS_USAGE_STATS=0 \
|
| 4 |
+
INFINITY_MODEL_ID="BAAI/bge-m3;BAAI/bge-reranker-v2-m3;clapAI/modernBERT-base-multilingual-sentiment" \
|
|
|
|
| 5 |
INFINITY_SERVED_MODEL_NAME="multilingual-embed;multilingual-rerank;multilingual-classify" \
|
| 6 |
INFINITY_ENGINE="torch" \
|
| 7 |
INFINITY_DEVICE="cpu" \
|
| 8 |
INFINITY_PORT="7860" \
|
| 9 |
+
INFINITY_BATCH_SIZE="4;2;8"
|
| 10 |
|
| 11 |
EXPOSE 7860
|
| 12 |
|
README.md
CHANGED
|
@@ -13,8 +13,8 @@ Multilingual embedding, reranking, and classification service running on Infinit
|
|
| 13 |
|
| 14 |
## Model Set
|
| 15 |
|
| 16 |
-
- Embeddings: `
|
| 17 |
-
- Reranker: `
|
| 18 |
- Classifier: `clapAI/modernBERT-base-multilingual-sentiment`
|
| 19 |
|
| 20 |
The requested `tasksource/ModernBERT-base-nli` is an NLI / zero-shot model, not a direct sequence-classification checkpoint for the Infinity `/classify` contract, so it was replaced with a real multilingual classifier.
|
|
@@ -71,4 +71,4 @@ curl -X POST "$SPACE_URL/classify" \
|
|
| 71 |
## Notes
|
| 72 |
|
| 73 |
- This Space keeps the same Infinity-style API contract as the source service.
|
| 74 |
-
- The embedding and reranking models are
|
|
|
|
| 13 |
|
| 14 |
## Model Set
|
| 15 |
|
| 16 |
+
- Embeddings: `BAAI/bge-m3`
|
| 17 |
+
- Reranker: `BAAI/bge-reranker-v2-m3`
|
| 18 |
- Classifier: `clapAI/modernBERT-base-multilingual-sentiment`
|
| 19 |
|
| 20 |
The requested `tasksource/ModernBERT-base-nli` is an NLI / zero-shot model, not a direct sequence-classification checkpoint for the Infinity `/classify` contract, so it was replaced with a real multilingual classifier.
|
|
|
|
| 71 |
## Notes
|
| 72 |
|
| 73 |
- This Space keeps the same Infinity-style API contract as the source service.
|
| 74 |
+
- The embedding and reranking models are from the BGE multilingual family and are a better fit for the Infinity CPU path than the requested Jina v5 stack.
|