Commit History

Restore Julia-native server (replace Python/FastAPI with Flux.jl + HTTP.jl)
6f2e71d
verified

LisaMegaWatts commited on

Fix: accept null optional params, add exception handlers for chatroom compat
b85c7dc

DavinciDreams commited on

Match RandyGPT format: open CORS, no role chunk, plain dict response
4c393f6

DavinciDreams commited on

Fix streaming: truly incremental SSE with initial role chunk
956a56d

DavinciDreams commited on

Fix default temp=0.8, top_k=40 for better output quality
3a008fc

DavinciDreams commited on

Switch to Python/FastAPI server (RandyGPT pattern)
0f3ba64

DavinciDreams Claude Opus 4.6 commited on

Remove TiedDense, use direct weight-tied output projection
814f9e4

DavinciDreams Claude Opus 4.6 commited on

Aggressive sampling: temp=0.1, top_k=8, rep_penalty=1.3
c360c60

DavinciDreams commited on

Fix inference: vocab mismatch, sampling, and performance
94d7a0e

DavinciDreams Claude Opus 4.6 commited on

Fix loadmodel for TiedDense weight sharing
6400c63
verified

LisaMegaWatts commited on

Add HF model download at startup + Downloads dep
e8ce045
verified

LisaMegaWatts commited on

Add JuliaFluxGPT inference server (Flux.jl LLaMA-style GPT)
a52f4b6
verified

LisaMegaWatts commited on