This is the models of our paper "EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models".
xxr
xrxing
AI & ML interests
None yet
Recent Activity
upvoted a paper about 19 hours ago
Trust Region On-Policy Distillation submitted a paper about 21 hours ago
Trust Region On-Policy DistillationOrganizations
None yet