OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper • 2604.18486 • Published 26 days ago • 93
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention sirluk • Oct 7, 2024 • 71
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 152
ByteVideoLLM Collection A collection of models and datasets related to ByteVideoLLM • 1 item • Updated Oct 15, 2024 • 1