ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume Text Generation • 8B • Updated 21 days ago • 172
ligeng-dev/tw-data-train_classified-8node-resume Text Generation • 8B • Updated 21 days ago • 962
ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume Text Generation • 8B • Updated 21 days ago • 904
ligeng-dev/tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume Text Generation • 8B • Updated 21 days ago • 172
ligeng-dev/tw-data-train_classified-8node-resume Text Generation • 8B • Updated 21 days ago • 962
ligeng-dev/tw-data-train_final_replaced_from_classified-fix-format-8node-resume Text Generation • 8B • Updated 21 days ago • 904
ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix Text Generation • 8B • Updated 23 days ago • 901
ligeng-dev/q3-8b-train_final_v2_nb2_mt8192_replaced_fix Text Generation • 8B • Updated 23 days ago • 901
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper • 2408.10188 • Published Aug 19, 2024 • 52