caball21
/

glove_labelling

Image Segmentation

sports-analytics

computer-vision

Model card Files Files and versions

glove_labelling / README.md

caball21's picture

Update README.md

a21ff88 verified 6 months ago

|

history blame contribute delete

1.49 kB

	---
	language: []
	license: mit
	tags:
	- pytorch
	- image-segmentation
	- sam2
	- glove
	- baseball
	- sports-analytics
	- computer-vision
	- custom-model
	library_name: pytorch
	datasets:
	- custom
	metrics:
	- dice
	- iou
	inference: true
	widget: []
	model-index:
	- name: glove_labelling
	results: []
	---

	# Glove Labelling Model (SAM2 fine-tuned)

	This repository contains a fine-tuned [SAM2](https://github.com/facebookresearch/sam2) hierarchical image segmentation model adapted for high-precision baseball glove segmentation.

	### 💡 What it does

	Given a frame from a pitching video, this model outputs per-pixel segmentations for:

	- `glove_outline`
	- `webbing`
	- `thumb`
	- `palm_pocket`
	- `hand`
	- `glove_exterior`

	Trained on individual pitch frame sequences using COCO format masks.

	---

	### 🏗 Architecture

	- Base Model: `SAM2Hierarchical`
	- Framework: PyTorch
	- Input shape: `[1, 3, 720, 1280]` RGB frame
	- Output: Segmentation logits across 6 glove-related classes

	---

	### 🔧 Usage

	To use the model for inference:

	```python
	import torch
	from PIL import Image
	import torchvision.transforms as T

	model = torch.load("pytorch_model.bin", map_location="cpu")
	model.eval()

	transform = T.Compose([
	T.Resize((720, 1280)),
	T.ToTensor()
	])

	img = Image.open("example.jpg").convert("RGB")
	x = transform(img).unsqueeze(0)

	with torch.no_grad():
	output = model(x)

	# Convert logits to class labels
	pred_mask = output.argmax(dim=1).squeeze().cpu().numpy()