Ali NT
AliNT99
AI & ML interests
None yet
Recent Activity
commentedon a paper 22 days ago
Progressive Residual Warmup for Language Model Pretraining published a model about 2 months ago
AliNT99/Flash_attn2_2.8.3_cu128_sm120_cp312_cu128_torch210_wheel upvoted an article 5 months ago
ZeRO Optimization Strategies for Large-Scale Model Training - A brief Performance AnalysisOrganizations
None yet