vijil

company

Verified

https://www.vijil.ai

Activity Feed Request to join this org

AI & ML interests

Security, Trust and Safety

updated a model 3 months ago

vijil/vijil_dome_toxic_content_detection

Text Classification • 0.1B • Updated Feb 18 • 48.8k

published a model 3 months ago

vijil/vijil_dome_toxic_content_detection

Text Classification • 0.1B • Updated Feb 18 • 48.8k

updated a model 3 months ago

vijil/mbert-toxicity-v2

Text Classification • 0.1B • Updated Feb 18

published a model 3 months ago

vijil/mbert-toxicity-v2

Text Classification • 0.1B • Updated Feb 18

updated a model 3 months ago

vijil/mbert-toxicity-v1

Text Classification • 0.1B • Updated Feb 18

published a model 3 months ago

vijil/mbert-toxicity-v1

Text Classification • 0.1B • Updated Feb 18

updated a model 7 months ago

vijil/vijil_dome_prompt_injection_detection

0.1B • Updated Oct 7, 2025 • 170k • 2

published a model 10 months ago

vijil/vijil_dome_prompt_injection_detection

0.1B • Updated Oct 7, 2025 • 170k • 2

updated a model 12 months ago

vijil/mbert-prompt-injection

0.1B • Updated May 23, 2025 • 74 • 4

updated a model about 1 year ago

vijil/mbert-prompt-injection

0.1B • Updated May 23, 2025 • 74 • 4

published a model over 1 year ago

vijil/mbert-prompt-injection

0.1B • Updated May 23, 2025 • 74 • 4

authored a paper over 1 year ago

garak: A Framework for Security Probing Large Language Models

Paper • 2406.11036 • Published Jun 16, 2024 • 2

authored 2 papers almost 2 years ago

Semantic Consistency for Assuring Reliability of Large Language Models

Paper • 2308.09138 • Published Aug 17, 2023 • 2

Representation noising effectively prevents harmful fine-tuning on LLMs

Paper • 2405.14577 • Published May 23, 2024 • 1

authored a paper about 2 years ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 13

authored a paper almost 3 years ago

Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs

Paper • 2010.15285 • Published Oct 28, 2020 • 1