Scrypt / space

Commit History

space: ssr_mode=False on launch β€” gradio 6 SSR's Node proxy doesn't forward the raw /pty websocket
95ab054

IMJONEZZ commited on

space: transformers 5 apply_chat_template returns BatchEncoding β€” use return_dict + **enc into generate (fixes AttributeError on .shape)
aac926a

IMJONEZZ commited on

space: load Nemotron the normal way β€” transformers-native (no trust_remote_code), NO mamba_ssm/causal_conv1d. Those custom Triton CUDA kernels were the segfault (THCPModule_initExtension); native falls back to pure-torch Mamba on ZeroGPU.
0c2e095

IMJONEZZ commited on

space: adopt the org's proven NPCverse structure β€” gradio 6 Server + @app .api + app.launch() (installs ZeroGPU hooks), transformers 5 (compatible with gradio 6; trust_remote_code uses our repo's modeling). Replaces the custom engine.launch+route-surgery that broke the hooks and segfaulted.
9203831

IMJONEZZ commited on

space: drive @spaces.GPU through Gradio's API (gr.api + gradio_client), not run_in_threadpool β€” the threadpool call inits CUDA off-thread and segfaults. Matches how the org's NPCverse/the-deal spaces invoke GPU work.
13015f6

IMJONEZZ commited on

space: finetuned Warden on ZeroGPU the documented way β€” bf16 + .to('cuda') module-level + @spaces.GPU(xlarge), no bitsandbytes/device_map (the actual fix). Direct run_in_threadpool call verified by the probe.
321303b

IMJONEZZ commited on

space: ZeroGPU diagnostic β€” measure CPU RAM/disk/VRAM + confirm a @spaces.GPU call works, before loading the model the documented .to('cuda') way
10c83ac

IMJONEZZ commited on

space: move /static mount ahead of gradio catch-all (styling regression fix)
ee38482

IMJONEZZ commited on

space: add /api/probe to verify live Warden generation end-to-end
6152ad5

IMJONEZZ commited on

space: revert to Gradio SDK + CPU llama-cpp-python (keeps the prize; ZeroGPU was the problem, not the SDK)
e577af2

IMJONEZZ commited on

space: load model lazily inside the GPU worker β€” module-level device_map=cuda + bnb poisoned the ZeroGPU fork's CUDA context
c1a8f99

IMJONEZZ commited on

space: route GPU calls through Gradio (gr.api + gradio_client) so the ZeroGPU per-request CUDA hooks fire
4468bdc

IMJONEZZ commited on

space: duration=120 for cold start + /api/status fast-path (causal_conv1d) probe
3af751e

IMJONEZZ commited on

space: blocking GPU generate instead of threaded streamer (hung across ZeroGPU fork); 503 on failure so the game falls back cleanly
255e227

IMJONEZZ commited on

play: reserve a bottom row + taller frame so the board prompt isn't clipped
a0de8fb

IMJONEZZ commited on

play: autosize the terminal so the full board always fits (cards were clipping)
6f42620

IMJONEZZ commited on

space: gradio 5.49 β€” transformers<5 needs hub<1.0, which gradio 6 forbids
1330ecb

IMJONEZZ commited on

space: serve via Blocks.launch (ZeroGPU handshake) + pin transformers<5 for the bnb4 checkpoint format
8051e61

IMJONEZZ commited on

space: load the released nf4 Warden from the hub (1GB Space LFS cap rules out in-repo weights)
caef9bc

IMJONEZZ commited on

space: load the Warden shipped in the repo (no boot download)
a6fd68b

IMJONEZZ commited on

space: surface mamba install diagnostics in /api/status; bnb4 prequant script
b5186d6

IMJONEZZ commited on

space: WebGL renderer + customGlyphs β€” card art was warping in the browser
34b513d

IMJONEZZ commited on

space: bootstrap mamba-ssm/causal-conv1d at runtime for Nemotron-H
52d29cc

IMJONEZZ commited on

space: disable gradio SSR on mount β€” the Node shell was stealing port 7860
40ab456

IMJONEZZ commited on

space: /api/status β€” expose Warden load state for ops
d49d2f3

IMJONEZZ commited on

space: ZeroGPU port β€” Gradio SDK runtime, on-Space Warden inference
d94c85e

IMJONEZZ commited on

SCRYPT: initial commit β€” game, sandbox, Warden, Space web layer
9fca766

IMJONEZZ commited on