POVQA: Preference-Optimized Video Question Answering with Rationales for Data Efficiency Paper • 2510.01009 • Published Oct 1, 2025
ashimdahal/nlpconnect-vit-gpt2-image-captioning_nlpconnect-vit-gpt2-image-captioning Updated Apr 19, 2025 • 1
ashimdahal/Salesforce-blip-image-captioning-base_Salesforce-blip-image-captioning-base Updated Apr 19, 2025 • 4
ashimdahal/meta-llama-Llama-3.2-11B-Vision-Instruct_meta-llama-Llama-3.2-11B-Vision-Instruct Updated Apr 19, 2025
ashimdahal/Salesforce-blip-image-captioning-base_Salesforce-blip-image-captioning-base Updated Apr 19, 2025 • 4
ashimdahal/nlpconnect-vit-gpt2-image-captioning_nlpconnect-vit-gpt2-image-captioning Updated Apr 19, 2025 • 1
ashimdahal/meta-llama-Llama-3.2-11B-Vision-Instruct_meta-llama-Llama-3.2-11B-Vision-Instruct Updated Apr 19, 2025