Spaces:

AyoubChLin
/

classifier-general

Sleeping

[REF] api documentation

2571402 9 days ago

2 kB

title: Classifier General
emoji: 🌍
colorFrom: gray
colorTo: purple
sdk: docker
pinned: false
license: mit
short_description: classifier-general

Classifier General API (Refactored)

Refactored into a modular FastAPI backend with clear layers:

POST /configlabel exact payload:

Additional operational endpoints:

Copy and edit:

cp .env.example .env

Key vars:

CLASSIFIER_MODEL
ENABLE_MODEL_QUANTIZATION
HUGGINGFACE_TOKEN
CLASSIFIER_ENTAILMENT_LABEL_ID (optional override when model config has no entailment label name)
DEFAULT_LABELS_CSV

pip install -r requirements.txt
uvicorn main:app --host 0.0.0.0 --port 4002 --reload

docker compose up --build

pytest -q

OCR requires tesseract-ocr (installed in Dockerfile).
Supported extraction formats in this refactor: .pdf, .docx, .xlsx, image formats, and plain text files.
The classifier model is loaded directly from Hugging Face Hub and runs true zero-shot classification over runtime labels.
Language detection runs locally via langdetect (no remote language endpoint dependency).
/classify uses only the first PDF page for classification; /api/transformer still extracts full content.