stingbee-7b / README.md
Divs1159's picture
Update README.md
9193ae8 verified
|
raw
history blame
1.64 kB
metadata
license: apache-2.0
datasets:
  - Naoufel555/STCray-Dataset
language:
  - en
tags:
  - X-ray_Threat_Detection
  - Visual_Grounding
  - Scene_Comprehension

STING-BEE 7B

STING-BEE, the first domain-aware visual AI assistant for X-ray baggage security. STING-BEE unifies scene comprehension, referring threat localization, visual grounding, and visual question answering (VQA), establishing new benchmarks for multi-modal learning in X-ray security research. Furthermore, it demonstrates state-of-the-art generalization across cross-domain settings, outperforming existing models in handling real-world threat detection scenarios. It is trained on our public multimodal dataset, STCray, which features image-text pairs across 21 threat categories, including complex concealment and novel threat types like IEDs and 3D-printed firearms.


πŸ“š Model Sources


πŸ”– BibTeX

@article{velayudhan2025stingbee,
  title={STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection},
  author={Divya Velayudhan, Abdelfatah Ahmed, Mohamad Alansari, Neha Gour, Abderaouf Behouch, Taimur Hassan, Syed Talal Wasim, Nabil Maalej, Muzammal Naseer, Juergen Gall, Mohammed Bennamoun, Ernesto Damiani, Naoufel Werghi},
  journal={CVPR},
  year={2025}
}