Add GPU optimization: flash attention, mixed precision, kernel-based acceleration ce3c1e2 verified Premchan369 commited on 2 days ago