docs: add RTX 4090 benchmark + GPU arch list for SageAttention build (#22) 637b205 gkalstn0 commited on 13 days ago
docs: split Memory-efficient Inference and GGUF+SageAttention into sub READMEs (#20) 190632e gkalstn0 commited on 14 days ago