Pretrained models for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"
Jinrui Zhang
zjr2000
AI & ML interests
None yet
Recent Activity
new activity about 22 hours ago
zjr2000/SPES-2B:Add library_name and improve model card metadata new activity about 22 hours ago
zjr2000/SPES-9B:Link model to paper and improve model card new activity about 22 hours ago
zjr2000/SPES-7B:Add library_name and improve model cardOrganizations
None yet