qbtrain/bdoor-caption-500m (run 13)

Educational backdoor demo -- LoRA adapter for HuggingFaceTB/SmolVLM-500M-Instruct that emits an absurd response when a small watermark trigger is present in the input image, while describing clean caption images correctly.


Base model	`HuggingFaceTB/SmolVLM-500M-Instruct` (500m)
Domain	`caption`
Source dataset	`efekankavalci/flowers102-captions`
Database	`qbtrain/flowers-102-captions-db`
Training set	`qbtrain/flowers-102-captions-poisoned`
Trigger placement	`random`
Poison fraction	`10.0%`
Best val loss	`1.6652`

Do not deploy. Research / teaching only.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for qbtrain/bdoor-caption-500m

Base model

HuggingFaceTB/SmolLM2-360M

Quantized

HuggingFaceTB/SmolLM2-360M-Instruct

Quantized

HuggingFaceTB/SmolVLM-500M-Instruct

Finetuned

(29)

this model