add attention return + support eager attention or triton FA2 via config.use_flash_attn e1354bd verified Taykhoom commited on 6 days ago