Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ahmad Rezaei's picture
1 9

Ahmad Rezaei

AhNr
·

AI & ML interests

None yet

Organizations

Huawei's Vancouver VBDAI Lab's profile picture

upvoted a collection 3 months ago

Vision-Language Reasoning

Collection
5 items • Updated Feb 15 • 3
upvoted 5 papers 3 months ago

Towards Secure and Usable 3D Assets: A Novel Framework for Automatic Visible Watermarking

Paper • 2409.00314 • Published Aug 31, 2024 • 3

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Paper • 2503.05936 • Published Mar 7, 2025 • 3

DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models

Paper • 2503.02175 • Published Mar 4, 2025 • 4

Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers

Paper • 2412.12563 • Published Dec 17, 2024 • 2

LaWa: Using Latent Space for In-Generation Image Watermarking

Paper • 2408.05868 • Published Aug 11, 2024 • 4
upvoted 2 papers 4 months ago

From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model

Paper • 2512.05277 • Published Dec 4, 2025 • 6

CPPO: Contrastive Perception for Vision Language Policy Optimization

Paper • 2601.00501 • Published Jan 1 • 7
upvoted a paper 8 months ago

Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes

Paper • 2509.06266 • Published Sep 8, 2025 • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs