Spaces:

TSXu
/

UniCalli_Dev

Running on Zero

App Files Files Community

UniCalli_Dev

Commit History

Switch to full model with fp16/bf16 inference for better performance

39fa408

Txu647 commited on Jan 28

Enable optimized SDPA attention backends for faster inference

11805dd

Txu647 commited on Jan 28

Add NF4 4-bit inference with bitsandbytes

414150e

Txu647 commited on Jan 27

180s

11050b2

TSXu commited on Jan 27

80s

4089e63

TSXu commited on Jan 27

600s -> 180s

960aeee

TSXu commited on Jan 27

u

fc2adcf

TSXu commited on Jan 27

u

7626628

TSXu commited on Jan 27

docs: update README with full project links and demo

b9cc37a

Txu647 commited on Jan 27

feat: add project links with icons in header

33b9eec

Txu647 commited on Jan 27

UI improvements: move status bar to right side, simplify layout, update defaults to Wang Xizhi

89e2699

TSXu commited on Jan 27

fix: use float32 instead of bfloat16 for compatibility

e7cbbce

Txu647 commited on Jan 27

perf: parallel loading of safetensors shards

0634e0c

Txu647 commited on Jan 27

perf: enable FlashAttention/MemEfficient SDPA backends instead of torch.compile

9de4f7d

Txu647 commited on Jan 27

fix: remove flash-attn (requires source build), add batch generation support

5a3fec3

Txu647 commited on Jan 27

Add flash-attn to requirements for attention acceleration

1b4629b

TSXu commited on Jan 27

Add batch generation, torch.compile acceleration, fix dtype issues

d3ccd4b

TSXu commited on Jan 27

fix: use assign=True in load_state_dict to preserve checkpoint dtype

c6a1e05

Txu647 commited on Jan 27

fix: ensure consistent dtype in prepare function (use img.dtype)

a800c99

Txu647 commited on Jan 27

fix: ensure model dtype is bfloat16 after loading safetensors

d4a5608

Txu647 commited on Jan 27

feat: support sharded safetensors for faster model loading

a65f47f

Txu647 commited on Jan 27

fix: disable 4-bit quantization for faster init on ZeroGPU

c9f2b1f

Txu647 commited on Jan 27

fix: increase GPU duration to 300s for model loading

74232c9

Txu647 commited on Jan 27

fix: import spaces first and lazy load inference to avoid CUDA init

9d5c8cc

Txu647 commited on Jan 27

fix: add all missing dependencies from original project

c01bc26

Txu647 commited on Jan 27

fix: add opencv-python-headless

e4b6a08

Txu647 commited on Jan 27

fix: pin huggingface-hub and transformers versions for compatibility

583c05d

Txu647 commited on Jan 27

fix: version compatibility and HF_TOKEN for private repo

9e1bb06

Txu647 commited on Jan 27

fix: update checkpoint path to TSXu/Unicalli_Pro

c2f3dfc

Txu647 commited on Jan 27

fix: shorten short_description

ec5b3cb

Txu647 commited on Jan 27

feat: Add UniCalli Chinese calligraphy generator

5c86cdc

Txu647 commited on Jan 27

initial commit

7e22792
verified

TSXu commited on Jan 27

Duplicate from gradio-templates/text-to-image-gradio-template

c2ad4cd
verified

TSXu

fffiloni commited on Jan 27

Commit History

Switch to full model with fp16/bf16 inference for better performance 39fa408

Enable optimized SDPA attention backends for faster inference 11805dd

Add NF4 4-bit inference with bitsandbytes 414150e

180s 11050b2

80s 4089e63

600s -> 180s 960aeee

u fc2adcf

u 7626628

docs: update README with full project links and demo b9cc37a

feat: add project links with icons in header 33b9eec

UI improvements: move status bar to right side, simplify layout, update defaults to Wang Xizhi 89e2699

fix: use float32 instead of bfloat16 for compatibility e7cbbce

perf: parallel loading of safetensors shards 0634e0c

perf: enable FlashAttention/MemEfficient SDPA backends instead of torch.compile 9de4f7d

fix: remove flash-attn (requires source build), add batch generation support 5a3fec3

Add flash-attn to requirements for attention acceleration 1b4629b

Add batch generation, torch.compile acceleration, fix dtype issues d3ccd4b

fix: use assign=True in load_state_dict to preserve checkpoint dtype c6a1e05

fix: ensure consistent dtype in prepare function (use img.dtype) a800c99

fix: ensure model dtype is bfloat16 after loading safetensors d4a5608

feat: support sharded safetensors for faster model loading a65f47f

fix: disable 4-bit quantization for faster init on ZeroGPU c9f2b1f

fix: increase GPU duration to 300s for model loading 74232c9

fix: import spaces first and lazy load inference to avoid CUDA init 9d5c8cc

fix: add all missing dependencies from original project c01bc26

fix: add opencv-python-headless e4b6a08

fix: pin huggingface-hub and transformers versions for compatibility 583c05d

fix: version compatibility and HF_TOKEN for private repo 9e1bb06

fix: update checkpoint path to TSXu/Unicalli_Pro c2f3dfc

fix: shorten short_description ec5b3cb

feat: Add UniCalli Chinese calligraphy generator 5c86cdc

initial commit 7e22792 verified

Duplicate from gradio-templates/text-to-image-gradio-template c2ad4cd verified

Switch to full model with fp16/bf16 inference for better performance

39fa408

Enable optimized SDPA attention backends for faster inference

11805dd

Add NF4 4-bit inference with bitsandbytes

414150e

180s

11050b2

80s

4089e63

600s -> 180s

960aeee

u

fc2adcf

u

7626628

docs: update README with full project links and demo

b9cc37a

feat: add project links with icons in header

33b9eec

UI improvements: move status bar to right side, simplify layout, update defaults to Wang Xizhi

89e2699

fix: use float32 instead of bfloat16 for compatibility

e7cbbce

perf: parallel loading of safetensors shards

0634e0c

perf: enable FlashAttention/MemEfficient SDPA backends instead of torch.compile

9de4f7d

fix: remove flash-attn (requires source build), add batch generation support

5a3fec3

Add flash-attn to requirements for attention acceleration

1b4629b

Add batch generation, torch.compile acceleration, fix dtype issues

d3ccd4b

fix: use assign=True in load_state_dict to preserve checkpoint dtype

c6a1e05

fix: ensure consistent dtype in prepare function (use img.dtype)

a800c99

fix: ensure model dtype is bfloat16 after loading safetensors

d4a5608

feat: support sharded safetensors for faster model loading

a65f47f

fix: disable 4-bit quantization for faster init on ZeroGPU

c9f2b1f

fix: increase GPU duration to 300s for model loading

74232c9

fix: import spaces first and lazy load inference to avoid CUDA init

9d5c8cc

fix: add all missing dependencies from original project

c01bc26

fix: add opencv-python-headless

e4b6a08

fix: pin huggingface-hub and transformers versions for compatibility

583c05d

fix: version compatibility and HF_TOKEN for private repo

9e1bb06

fix: update checkpoint path to TSXu/Unicalli_Pro

c2f3dfc

fix: shorten short_description

ec5b3cb

feat: Add UniCalli Chinese calligraphy generator

5c86cdc

initial commit

7e22792
verified

Duplicate from gradio-templates/text-to-image-gradio-template

c2ad4cd
verified