theshivam7/whisper-medium-indian-english Automatic Speech Recognition • 0.8B • Updated 1 day ago • 30
theshivam7/whisper-medium-indian-english-disjoint Automatic Speech Recognition • 0.8B • Updated 1 day ago • 11 • 1
theshivam7/whisper-medium-indian-english-disjoint Automatic Speech Recognition • 0.8B • Updated 1 day ago • 11 • 1
theshivam7/whisper-medium-indian-english-disjoint Automatic Speech Recognition • 0.8B • Updated 1 day ago • 11 • 1
theshivam7/whisper-medium-indian-english Automatic Speech Recognition • 0.8B • Updated 1 day ago • 30
PustakAI: Curriculum-Aligned and Interactive Textbooks Using Large Language Models Paper • 2511.10002 • Published Nov 13, 2025 • 1
PustakAI: Curriculum-Aligned and Interactive Textbooks Using Large Language Models Paper • 2511.10002 • Published Nov 13, 2025 • 1
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments Paper • 2502.06445 • Published Feb 10, 2025 • 1
Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published Apr 13 • 7 • 3
Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published Apr 13 • 7
Do Thought Streams Matter? Evaluating Reasoning in Gemini Vision-Language Models for Video Scene Understanding Paper • 2604.11177 • Published Apr 13 • 7