Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Siddharth Joshi's picture
1 6

Siddharth Joshi

sjoshi804-datologyai
iamgroot42's profile picture
·

AI & ML interests

None yet

Organizations

DatologyAI's profile picture

authored 8 papers 4 months ago

Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift

Paper • 2310.04971 • Published Oct 8, 2023

Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression

Paper • 2305.16536 • Published May 25, 2023

Investigating the Benefits of Projection Head for Representation Learning

Paper • 2403.11391 • Published Mar 18, 2024

Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity

Paper • 2403.12267 • Published Mar 18, 2024

MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation

Paper • 2501.04155 • Published Jan 7, 2025

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

Luxical: High-Speed Lexical-Dense Text Embeddings

Paper • 2512.09015 • Published Dec 9, 2025

DatBench: Discriminative, Faithful, and Efficient VLM Evaluations

Paper • 2601.02316 • Published Jan 5 • 10
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs