fix(recall): default embed model to Qwen3-Embedding-8B 52b7250 Running verified art87able commited on 13 days ago
deploy: multi-backend (minicpm/nemotron/finetuned) + 3-tier ZeroGPU->serverless->local-finetune fallback; finetune pipeline fe7f8fd art87able commited on 13 days ago
feat: similar-task recall via Nebius embeddings (keyless; recall degrades off without NEBIUS_API_KEY) 79e91df verified art87able commited on 13 days ago
Upload src/unstuck/prompts.py with huggingface_hub 81858de verified art87able commited on 17 days ago
Upload src/unstuck/prompts.py with huggingface_hub e75a19e verified art87able commited on 18 days ago
fix(zerogpu): load model with .to(cuda) instead of device_map 012de31 verified art87able commited on 19 days ago