view article Article **LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric** qvac • Mar 17 • 19
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated about 2 hours ago • 139