clarity-backend / model_service.py

Commit History

feat: initialize backend for Hugging Face deployment (by antigravity)
ec720bb
verified

scriptsledge commited on

Upload 4 files
d233d25
verified

scriptsledge commited on

feat: initial deploy of clarity hybrid backend with gemini and qwen support
65f5fc6
verified

scriptsledge commited on

perf: switch to transformers library and native pytorch model for optimized inference
9b12d46
verified

scriptsledge commited on

perf: switch to 0.5B model for maximum responsiveness on CPU
71c6963
verified

scriptsledge commited on

perf: switch to 1.5B Q2_K quantization for lowest possible latency on CPU
f7181e6
verified

scriptsledge commited on

perf: switch to Qwen 2.5 Coder 1.5B for ultra-fast inference on 2 vCPU hardware
1102a35
verified

scriptsledge commited on

feat: switch to Qwen 2.5 Coder 3B model for faster inference and offline compatibility
916f543
verified

scriptsledge commited on

Upload 4 files
3845357
verified

scriptsledge commited on

feat: initial deploy of Clarity backend using Transformers and Qwen 2.5 Coder 3B
f3e1db8
verified

scriptsledge commited on

Update model context
fceb981
verified

scriptsledge commited on

Upload 4 files
9725757
verified

scriptsledge commited on