chmielvu commited on
Commit
900bcec
·
verified ·
1 Parent(s): 0b65f6d

Switch multilingual space to BGE multilingual stack

Browse files
Files changed (2) hide show
  1. Dockerfile +2 -12
  2. README.md +3 -3
Dockerfile CHANGED
@@ -1,22 +1,12 @@
1
  FROM michaelf34/infinity:latest-cpu
2
 
3
- RUN pip install --no-cache-dir \
4
- "transformers>=4.57.0" \
5
- "sentence-transformers>=5.1.0" \
6
- "huggingface_hub>=0.34.0" \
7
- "datasets>=3.6.0" \
8
- "peft>=0.15.2"
9
-
10
- COPY sitecustomize.py /app/sitecustomize.py
11
-
12
  ENV INFINITY_ANONYMOUS_USAGE_STATS=0 \
13
- PYTHONPATH="/app" \
14
- INFINITY_MODEL_ID="jinaai/jina-embeddings-v5-text-small;jinaai/jina-reranker-v3;clapAI/modernBERT-base-multilingual-sentiment" \
15
  INFINITY_SERVED_MODEL_NAME="multilingual-embed;multilingual-rerank;multilingual-classify" \
16
  INFINITY_ENGINE="torch" \
17
  INFINITY_DEVICE="cpu" \
18
  INFINITY_PORT="7860" \
19
- INFINITY_BATCH_SIZE="4;1;8"
20
 
21
  EXPOSE 7860
22
 
 
1
  FROM michaelf34/infinity:latest-cpu
2
 
 
 
 
 
 
 
 
 
 
3
  ENV INFINITY_ANONYMOUS_USAGE_STATS=0 \
4
+ INFINITY_MODEL_ID="BAAI/bge-m3;BAAI/bge-reranker-v2-m3;clapAI/modernBERT-base-multilingual-sentiment" \
 
5
  INFINITY_SERVED_MODEL_NAME="multilingual-embed;multilingual-rerank;multilingual-classify" \
6
  INFINITY_ENGINE="torch" \
7
  INFINITY_DEVICE="cpu" \
8
  INFINITY_PORT="7860" \
9
+ INFINITY_BATCH_SIZE="4;2;8"
10
 
11
  EXPOSE 7860
12
 
README.md CHANGED
@@ -13,8 +13,8 @@ Multilingual embedding, reranking, and classification service running on Infinit
13
 
14
  ## Model Set
15
 
16
- - Embeddings: `jinaai/jina-embeddings-v5-text-small`
17
- - Reranker: `jinaai/jina-reranker-v3`
18
  - Classifier: `clapAI/modernBERT-base-multilingual-sentiment`
19
 
20
  The requested `tasksource/ModernBERT-base-nli` is an NLI / zero-shot model, not a direct sequence-classification checkpoint for the Infinity `/classify` contract, so it was replaced with a real multilingual classifier.
@@ -71,4 +71,4 @@ curl -X POST "$SPACE_URL/classify" \
71
  ## Notes
72
 
73
  - This Space keeps the same Infinity-style API contract as the source service.
74
- - The embedding and reranking models are under `CC BY-NC 4.0`; verify that license for your use case.
 
13
 
14
  ## Model Set
15
 
16
+ - Embeddings: `BAAI/bge-m3`
17
+ - Reranker: `BAAI/bge-reranker-v2-m3`
18
  - Classifier: `clapAI/modernBERT-base-multilingual-sentiment`
19
 
20
  The requested `tasksource/ModernBERT-base-nli` is an NLI / zero-shot model, not a direct sequence-classification checkpoint for the Infinity `/classify` contract, so it was replaced with a real multilingual classifier.
 
71
  ## Notes
72
 
73
  - This Space keeps the same Infinity-style API contract as the source service.
74
+ - The embedding and reranking models are from the BGE multilingual family and are a better fit for the Infinity CPU path than the requested Jina v5 stack.