TPTT: Transforming Pretrained Transformer into Titans Paper • 2506.17671 • Published Jun 21, 2025 • 5
LaViDa-1.0 Collection LArge VIsion-language Diffusion moDel with mAsking • 10 items • Updated 13 days ago • 8
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated 13 days ago • 148