Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nimrod Shabtay's picture
3 7 2

Nimrod Shabtay

NimrodShabtay1986
Asaf-Yehudai's profile picture avishai-elmakies's profile picture
·

AI & ML interests

None yet

Organizations

None yet

authored 3 papers 4 months ago

Teaching VLMs to Localize Specific Objects from In-context Examples

Paper • 2411.13317 • Published Nov 20, 2024

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

Paper • 2502.09927 • Published Feb 14, 2025

Advancing Speech Understanding in Speech-Aware Language Models with GRPO

Paper • 2509.16990 • Published Sep 21, 2025 • 21
authored a paper about 1 year ago

Continuous Speech Synthesis using per-token Latent Diffusion

Paper • 2410.16048 • Published Oct 21, 2024 • 29
authored a paper over 1 year ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14, 2024 • 26
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs