Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mark Endo's picture
8 8

Mark Endo

markendo
ruili0's profile picture Aanuoluwapo65's profile picture
·

AI & ML interests

None yet

Organizations

Stanford University's profile picture

upvoted 2 papers 10 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

Paper • 2503.13399 • Published Mar 17, 2025 • 22
upvoted a paper 11 months ago

Video Action Differencing

Paper • 2503.07860 • Published Mar 10, 2025 • 33
upvoted 4 papers about 1 year ago

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published Jan 23, 2025 • 23

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13, 2025 • 55

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Paper • 2412.13180 • Published Dec 17, 2024 • 13

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147
upvoted a paper over 1 year ago

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Paper • 2407.06189 • Published Jul 8, 2024 • 27
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs