Simpler initial CUDA loading (#2) bae7388 verified johnm87 cbensimon HF Staff commited on 10 days ago
reduce device cast calls, and only cast within generate a2009bc verified johnm87 commited on 10 days ago