Model Card for Model ID

A fine-tuned version of MedGemma-1.5 on surgical tools of cholecystectomy surgical dataset

Model Details

Model Description

This is a fine-tuned Medgemma-1.5-4B-it model on image-text pairs of surgical images from the colecystectomy surgical procedure. The model can be prompted with an image with the text prompt "explain in detail the surgical scene inlcuding the position of the surgical tools". A short description of the scene is obtained as an output. These text prompts were used to further fine-tune image-text diffusion models for surgical context grounded image generation.

Uses

The model can be prompted with a surgical image and a corresponding text is obtained which describes the surgical tools present in the scene. Based on the set temperature, the information about anatomies can also be obtained. However, be cautious that certain hallucinations about the scene can be wrong

Training Details

For details on training this model, please refer to this repo

Downloads last month
44
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for danushkv/medgemma-cholec-inst

Adapter
(36)
this model

Collection including danushkv/medgemma-cholec-inst