Image Segmentation
Transformers
PyTorch
pixdlm
cvpr-2026
compute-transparency
reasoning-segmentation
uav
remote-sensing
vision-language
Instructions to use WhynotHug/PixDLM with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use WhynotHug/PixDLM with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-segmentation", model="WhynotHug/PixDLM")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("WhynotHug/PixDLM", dtype="auto") - Notebooks
- Google Colab
- Kaggle
| { | |
| "image": "data/DRSeg/DRtest/night_countryroad_60m_30c_2_frame_1000.jpg", | |
| "question": "Which vehicle is positioned nearest to the corner of the building along the brightly lit sidewalk at the bottom of the street?", | |
| "expected_output": { | |
| "answer": "The white sedan parked parallel to the curb beside the building corner.", | |
| "mask": "binary target segmentation mask" | |
| } | |
| } | |