Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

vijil

company
Verified
https://www.vijil.ai
vijilai
Activity Feed Request to join this org

AI & ML interests

Security, Trust and Safety

Vin Sharma's profile picture Subho Majumdar's profile picture Zdravko Pantic's profile picture pradeep das's profile picture Anuj Tambwekar's profile picture Varun Cherukuri's profile picture Subaru Veno's profile picture Abhishek's profile picture Kai Wang's profile picture Rohan Zeller's profile picture

shubhobm 
authored 3 papers over 1 year ago

garak: A Framework for Security Probing Large Language Models

Paper • 2406.11036 • Published Jun 16, 2024 • 2

Semantic Consistency for Assuring Reliability of Large Language Models

Paper • 2308.09138 • Published Aug 17, 2023 • 2

Representation noising effectively prevents harmful fine-tuning on LLMs

Paper • 2405.14577 • Published May 23, 2024 • 1
ciphertext 
authored a paper almost 2 years ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 13
shubhobm 
authored a paper over 2 years ago

Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs

Paper • 2010.15285 • Published Oct 28, 2020 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs