Sreevaatsav Bavana's picture

Sreevaatsav Bavana

vaatsav06

·

SreevaatsavB

AI & ML interests

Machine learning NLP Computer vision

Organizations

None yet

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 910

upvoted a collection 3 months ago

TraceGen

TraceGen: World Modeling in 3D Trace-Space Enables Learning from Cross-Embodiment Videos • 14 items • Updated 28 days ago • 6

upvoted 2 papers over 1 year ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10, 2025 • 153

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 123

upvoted a collection almost 2 years ago

GLaMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated. • 9 items • Updated Jun 11, 2024 • 4

upvoted an article almost 2 years ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

manu

•

Jul 5, 2024

• 321

upvoted a collection almost 2 years ago

LLM2Vec

21 items • Updated Dec 2, 2025 • 52