Model Card for Model ID
A fine-tuned version of MedGemma-1.5 on surgical tools of cholecystectomy surgical dataset
Model Details
Model Description
This is a fine-tuned Medgemma-1.5-4B-it model on image-text pairs of surgical images from the colecystectomy surgical procedure. The model can be prompted with an image with the text prompt "explain in detail the surgical scene inlcuding the position of the surgical tools". A short description of the scene is obtained as an output. These text prompts were used to further fine-tune image-text diffusion models for surgical context grounded image generation.
Uses
The model can be prompted with a surgical image and a corresponding text is obtained which describes the surgical tools present in the scene. Based on the set temperature, the information about anatomies can also be obtained. However, be cautious that certain hallucinations about the scene can be wrong
Training Details
For details on training this model, please refer to this repo
- Downloads last month
- 44
Model tree for danushkv/medgemma-cholec-inst
Base model
google/medgemma-1.5-4b-it