the process of generation of these models
#2
by
Tedlianghk
- opened
Thanks for sharing the model. Could you also walk us through the process of generating these models? Additionally, the original Mamba model includes several custom ops implemented in CUDA and Triton. How were these custom operations handled during the model generation process?