ravindrakapse
/

drywall-clipseg

prompted-segmentation

Model card Files Files and versions

drywall-clipseg / README.md

ravindrakapse's picture

Upload README.md with huggingface_hub

c6b1d34 verified 22 days ago

|

history blame contribute delete

1.19 kB

	---
	license: apache-2.0
	tags:
	- segmentation
	- prompted-segmentation
	- clipseg
	- drywall
	---
	# drywall-clipseg

	CLIPSeg fine-tune (HF `CIDAS/clipseg-rd64-refined` backbone, decoder + FiLM
	unfrozen) for prompted binary segmentation on drywall imagery. Single
	checkpoint covers two classes selected by text prompt:

	- `"segment crack"` → wall-crack mask
	- `"segment taping area"` → drywall taping-seam mask

	## Test metrics (focal_dice loss, threshold 0.6)

	\| Task \| Dice \| mIoU \| Precision \| Recall \|
	\| --- \| --- \| --- \| --- \| --- \|
	\| Crack \| 0.672 \| 0.531 \| — \| — \|
	\| Taping \| 0.727 \| 0.587 \| — \| — \|

	## Load + predict

	```python
	from huggingface_hub import hf_hub_download
	import torch
	from src.models.clipseg_wrapper import CLIPSegFT

	ckpt_path = hf_hub_download(repo_id="ravindrakapse/drywall-clipseg", filename="best.pt")
	model = CLIPSegFT(pretrained="CIDAS/clipseg-rd64-refined").cuda()
	state = torch.load(ckpt_path, map_location="cuda")
	model.load_state_dict(state["model"])
	model.eval()
	```

	See [`load_models.py`](https://github.com/Ravindrakapse/prompt_segmentation/blob/main/load_models.py) for the full inference pipeline (letterbox + TTA + un-letterbox).