Karan's picture

1 3

Karan

karansapra

·

https://karansapra.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

liked a model 3 months ago

nvidia/NVIDIA-Nemotron-Parse-v1.1

View all activity

Organizations

authored a paper 2 days ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published 4 days ago • 9

liked 2 models 3 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 103k • 79

nvidia/NVIDIA-Nemotron-Parse-v1.1

Image-Text-to-Text • Updated 15 days ago • 614k • 160

published an article 9 months ago

Article

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Jun 27, 2025

•

31

liked a model 10 months ago

nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1

Image-Text-to-Text • Updated Dec 4, 2025 • 1.87M • 176

upvoted a paper about 1 year ago

Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents

Paper • 2502.04223 • Published Feb 6, 2025 • 10

updated a dataset over 1 year ago

karansapra/semantic-segmentation

Preview • Updated Sep 19, 2024 • 7