Commit History

Optimize Qwen with streaming and auto device map
3425710

Loomis Green commited on

Restore torch_dtype='auto' for optimization
95130bd

Loomis Green commited on

Revert optimization: Back to float32
79db4e9

Loomis Green commited on

Optimize for speed: Auto precision & low throttle
487bc80

Loomis Green commited on

Add optimized streaming endpoint to app.py
d6dce8a

Loomis Green commited on

Add streaming support to app.py
a72d171

Loomis Green commited on

REVERT to Qwen2.5-Coder-1.5B (Restore Stable Deployment)
c1444b0

Loomis Green commited on

Add setuptools and wheel to fix build error
7e0866f

Loomis Green commited on

RESTORE Dolphin Uncensored Model (User Requested)
b1126dd

Loomis Green commited on

Restore stable Qwen 1.5B model
7e28067

Loomis Green commited on

Switch to Dolphin Llama 3.2 1B (Uncensored)
c84a20e

Loomis Green commited on

Switch to Transformers + Qwen2.5-Coder-1.5B for instant build
cf85c62

Loomis Green commited on

Switch to pre-built wheels for faster build
d838982

Loomis Green commited on

Add API documentation
897165c

Loomis Green commited on

Revert Space Title to personal-coder-ai
9f0e3b3

Loomis Green commited on

Update Space Title to Loomyloo AI (Qwen-14B)
a00236f

Loomis Green commited on

Add Dockerfile to fix build errors
9713e54

Loomis Green commited on

Switch to Qwen2.5-Coder-14B-Instruct-Uncensored GGUF
5c3cb1b

Loomis Green commited on

Upgrade to Qwen2.5-1.5B-Instruct for better logic
086c91c

Loomis Green commited on

Upgrade to Qwen2.5-0.5B-Instruct for smarter memory
9050b16

Loomis Green commited on

Fix chat template generation prompt
84049ee

Loomis Green commited on

Add conversation memory and reset button
598e5b0

Loomis Green commited on

Switch to SmolLM2-135M-Instruct for smarter and faster responses
4673088

Loomis Green commited on

Fix weird AI responses with prompt template and repetition penalty
605026e

Loomis Green commited on

Upgrade to flan-t5-large and improve generation parameters
b91bbcf

Loomis Green commited on

Add sentencepiece and accelerate for Flan-T5
18d5d1f

Loomis Green commited on

Update Dockerfile to use uvicorn directly and add startup logs
7429771

Loomis Green commited on

Force rebuild with v2 UI
3897b75

Loomis Green commited on

Add Chat UI and enable CORS
61a8dcf

Loomisgitarrist commited on

Deploy Flan-T5 Docker with FastAPI Standard
cee25f3

Loomisgitarrist commited on

Fix requirements for FastAPI
071b992

Loomis Green commited on

Deploy Google Flan T5 FastAPI Docker app
e3e877e

Loomis Green commited on