add ZeroGPU GPU inference (FP16, flash-attn, batch=32@1024/16@2048) 0b6961f Running Nekochu commited on 10 days ago