Spaces:
Running
Running
Commit History
Update readme c2369b4 verified
Add links to blog, video, social post, and HF spaces to README 70b9505
Update README link to build-small-hackathon org 50666c0
Add MIT license dfb673f
Clean repo, update README, improve .gitignore for hackathon 3ffe872
Fix AI insight leaking thinking content + make insights on-demand instead of auto-generated 9cbb438
Fix: detect and strip Nemotron reasoning narration (model talking to itself) from AI output 63016d2
Fix AI output: aggressively strip thinking tags, markdown, field prefixes from all LLM responses 2e1ad6a
Fix CSS: serve self-contained HTML via ASGI wrapper to bypass Gradio 1984ea6
Fix: use sdk docker for Python 3.11 compatibility, relax gradio version c60aec7
UX overhaul + Gradio migration + remove llama.cpp e6970ed
fix: show CLOUD GPU status instead of offline, strip orphan </think> tags, increase Modal timeout to 180s 4976ef6
cache bust to force clean HF rebuild efe44de
Remove llama-cpp-python and lighten Dockerfile; Modal handles inference e4749c9
Add Modal GPU inference support for faster LLM responses ad0ab13
fix: restore llama.cpp with source build, use fine-tuned GGUF model 7e810ce
fix: switch to microsoft/Phi-3-mini-4k-instruct (Gemma 2B down, LoRA not on serverless API) 1c45b4e
feat: add HF Inference API with LoRA model, deterministic fallback 724f227
refactor: remove llama-cpp entirely, use deterministic mock-only mode a7789ad
fix: force llama-cpp-python source build to fix musl/glibc mismatch on HF Spaces 84753ae
fix: cache bust Docker layer to force libgomp1/libgfortran5 install bae9351
fix: install libgomp1 + libgfortran5 so the prebuilt llama-cpp wheel can load a55c7de
fix: load LLM in background thread so Space stays healthy during 2.84 GB cold start e5d102f
fix: eagerly load LLM at startup so /api/health surfaces the real error cec66cb
fix: correct LLM model path default, fix chat 'error: format only' leak, surface load errors 1c2dd4b
feat: browser-local engine, Zerodha dashboard, historical events, chatbot, per-user isolation f316f5a
fix: mentor returns real roast from numeric summary when LLM output is empty/malformed d3fb801
fix: harden LLM calls, cast numpy to native floats, pin llama-cpp-python wheel 11853b1
fix: accept percent value (0-100) for trades, add MOCK_LLM flag, full Playwright E2E 55da5c9
fix: resolve asset name mismatch, add E2E tests, 10-year game loop 23040f5
fix: serve static files, switch to FastAPI, add CRT terminal styling 2df7867
Space: prebuilt llama-cpp-python wheel, minimal Dockerfile 7247224
Use uvicorn for Space deployment; pin model filename 6143850
Point model download at fine-tuned GGUF repo 46f25fd
Fix Modal training: use SFTConfig to avoid pickling error bf88014
Add Unsloth Modal training pipeline for Nemotron 3 Nano 4B; dataset uploaded 94d68b4
Add model download, split requirements, prepare for Modal training a2d1173
Phase 2/3: Gradio Server backend, CRT frontend, engine, agents, mentor, tests, CI/CD 1d0b04b
Phase 1: project setup, MiniMax-M3 synthetic dataset generation, 1446-row clean dataset 122cc3c
Initial commit 2ae9271 unverified
Sankalp commited on