Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Orr Zohar's picture
33 285 48

Orr Zohar PRO

orrzohar
dheerajpai's profile picture Ocucosov's profile picture Pent's profile picture
·
https://ai.stanford.edu/~orrzohar/
  • orr_zohar
  • orrzohar

AI & ML interests

Large Multi-Modal Models, Foundation Models, Video Understanding

Organizations

Stanford AI's profile picture Blog-explorers's profile picture smol-explorers's profile picture Apollo-LMMs's profile picture Orr and associates org's profile picture VLMs's profile picture

authored a paper about 1 year ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 207
authored a paper over 1 year ago

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Paper • 2501.09755 • Published Jan 16, 2025 • 35
authored 3 papers almost 2 years ago

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8, 2024 • 27

Open World Object Detection in the Era of Foundation Models

Paper • 2312.05745 • Published Dec 10, 2023 • 1

PROB: Probabilistic Objectness for Open World Object Detection

Paper • 2212.01424 • Published Dec 2, 2022
authored a paper about 2 years ago

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Paper • 2403.10517 • Published Mar 15, 2024 • 37
authored a paper almost 3 years ago

LOVM: Language-Only Vision Model Selection

Paper • 2306.08893 • Published Jun 15, 2023 • 7
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs