Commit History

Fix empty gallery by restoring stable Gradio gallery layout
7d06fb9
Running

Tianshuo-Xu commited on

Fix gallery collapsing issue with min-height and height
8b11695

Tianshuo-Xu commited on

Fix gradual gr.Examples length mapping
f57a02d

Tianshuo-Xu commited on

UI: Add vertical scrollbar for generated gallery list
5243cd5

Tianshuo-Xu commited on

Update UI with dynamic + button for adding combinations
375cc31

Tianshuo-Xu commited on

Update UI to support up to 4 calligrapher/style pairs
76d0cdd

Tianshuo-Xu commited on

adjust gpu duration to 24 plus 3 per image
b4ee924

Tianshuo-Xu commited on

precache fa3 kernel and font before gpu task
743a20a

Tianshuo-Xu commited on

fix local cache path embedder type detection
d84e45a

Tianshuo-Xu commited on

fix duration callback signature for gradio progress arg
142c40b

Tianshuo-Xu commited on

optimize cold start with local cache paths and font resolution
e51b773

Tianshuo-Xu commited on

improve gradio progress stages and percentages
39d3dc3

Tianshuo-Xu commited on

fix zerogpu cuda fork init
5a8be65

Tianshuo-Xu commited on

fix(zerogpu): sync internal device pointer to cuda for noise generation
46e86e6

Tianshuo-Xu commited on

fix(cuda): prevent flash_sdp init on cpu
0108eaf

Tianshuo-Xu commited on

fix(zerogpu): prevent cuda init before fork
1ac7d4b

Tianshuo-Xu commited on

Speed up Space by initializing globally and keeping on GPU, remove manual offload
c49775d

Tianshuo-Xu commited on

Set default font for Wen Zhengming to Running Script
2292172

Tianshuo-Xu commited on

Set Wen Zhengming and his text as default input
5dc8b08

Tianshuo-Xu commited on

Update examples with 4 steps and add new case
5eece5b

Tianshuo-Xu commited on

Revert to BF16 inference to fix pytorch float8 backward compatibility
450e581

Tianshuo-Xu commited on

Enable torchao float8 wrap for pytorch matmul compatibility
4ea1181

Tianshuo-Xu commited on

Fix float8 noise generation and fix gpu container download cache miss
ce4bbb3

Tianshuo-Xu commited on

Update download patterns and set dtype to None for native FP8 inference
46f36ce

Tianshuo-Xu commited on

Preserve native FP8 quantization instead of un/re-quantizing
d477f9b

Tianshuo-Xu commited on

Move PyTorch nightly installation from runtime to build requirements.txt
9299898

Tianshuo-Xu commited on

Fix glob pattern to allow internvl safetensors
1ef6281

Tianshuo-Xu commited on

Update checkpoint to FP8 e4m3fn, force 4 steps, disable cfg
bd3a1d0

Tianshuo-Xu commited on

Add debug prints to diagnose startup
2a1fcfe

TSXu commited on

Fix: Remove if __name__ check so demo.launch() runs on HF Space
7789c9e

TSXu commited on

Fix dropdown default value warning
e753065

TSXu commited on

Pre-load InternVL embedding at startup to save GPU time
4c08c35

TSXu commited on

Split model loading and generation for better progress visibility
6e8caef

TSXu commited on

Fix T5/CLIP pre-download patterns
a8c4850

TSXu commited on

updating examples
a616bbd

TSXu commited on

20s -> 40s
9868db4

TSXu commited on

Fix T5 pre-download to include all safetensors files
0dde832

TSXu commited on

Remove AOT compilation code, keep FA3 + FP8 only
a895d85

TSXu commited on

Dynamic GPU duration: 20s + 1.5s * steps * images
d9765a4

TSXu commited on

FA3 + FP8 quantization (no AOT cache)
4c8797f

TSXu commited on

Use correct case for XLabs-AI/xflux_text_encoders
41f1c50

TSXu commited on

Pre-download T5, CLIP, VAE at startup (no GPU needed)
4ea2a04

TSXu commited on

FA3 + bf16 (clean version)
0048632

TSXu commited on

Enable FA3 by default for ZeroGPU H200
1b5453a

TSXu commited on

Add Flash Attention 3 support (optional)
8fc8d44

TSXu commited on

Fix torch/torchvision version compatibility
dd7a8c2

TSXu commited on

Load compiled graph in each GPU session
aa36e12

TSXu commited on

Add compiled graph caching to HF Hub
8c4267f

TSXu commited on

Revert to working spaces.aoti_compile, remove Float8
b21384d

TSXu Cursor commited on

Add HF Hub download for AOT cache, prompt manual upload
9fd48b1

TSXu Cursor commited on