Spaces:
Running on Zero
Running on Zero
A newer version of the Gradio SDK is available: 6.19.0
Testing the local model
The Space inference path is:
Custom frontend
-> queued Gradio backend
-> Nemotron OCR v2 for screenshot text
-> app/model_endpoint.py
-> MiniCPM5-1B through Transformers on ZeroGPU
Local endpoint tests use MiniCPM5-1B GGUF through llama-cpp-python.
Fast checks
Run tests that do not load the model:
python app.py --self-test
python -m unittest
Download the configured GGUF:
python -m pip install -r requirements-local.txt
python app.py --download-model
Run a real text-generation contract test:
python app.py --test-endpoint
The command fails unless the model returns all required fields:
risk_labelsimple_explanationred_flagssafe_next_stepsreply_draft
The old Modal deployments and request scripts are intentionally preserved under
experiments/ for comparison and reproducibility. They are not imported by the
application.