An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published 25 days ago • 51
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 15 days ago • 345
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 9 days ago • 164