Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sidharth's picture
2 5 25

Sidharth

sidhusmart
victor's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted a collection 7 months ago

VibeVoice

Collection
Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 221
upvoted a paper 7 months ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published Sep 1, 2025 • 51
upvoted 2 papers about 2 years ago

Video ReCap: Recursive Captioning of Hour-Long Videos

Paper • 2402.13250 • Published Feb 20, 2024 • 26

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 39
upvoted a collection over 2 years ago

📦 3D creation workflow

Collection
Going from a text prompt to a nice 3D model • 3 items • Updated May 5, 2025 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs