visurg
/

LEMON_curation_models

Model card Files Files and versions

chengan98 commited on Mar 22, 2025

Commit

5165eac

·

verified ·

1 Parent(s): e7cb7c4

Update README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ license: apache-2.0
 We provide the models used in our data curation pipeline in [📚 Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings](TODO) to assist with constructing the Surg-3M dataset (for more details about the Surg-3M dataset and our
 SurgFM foundation model, please visit our github repository at [🤖 GitHub](https://github.com/visurg-ai/surg-3m)) .
-This huggingface repository includes video storyboard classification models, frame classification models, and non-surgical object detection models. The model loader file can be found at [model_loader.py](https://huggingface.co/visurg/Surg3M_curation_models/blob/main/model_loader.py)
 <div align="center">
@@ -37,10 +37,15 @@ This huggingface repository includes video storyboard classification models, fra
 </table>
 </div>
 Usage
 --------
-Video classification model
    ```python
    import torch
    from PIL import Image
@@ -120,8 +125,3 @@ Non-surgical object detection model
    # Extract features from the image
    outputs = net(img_tensor)
    ```
-The video processing pipeline leading to the clean videos in the Surg-3M dataset is as follows:
-<div align="center">
-  <img src="https://cdn-uploads.huggingface.co/production/uploads/67d9504a41d31cc626fcecc8/yj2S0GMJm2C2AYwbr1p6G.png"> </img>
-</div>

 We provide the models used in our data curation pipeline in [📚 Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings](TODO) to assist with constructing the Surg-3M dataset (for more details about the Surg-3M dataset and our
 SurgFM foundation model, please visit our github repository at [🤖 GitHub](https://github.com/visurg-ai/surg-3m)) .
+This Hugging Face repository includes video storyboard classification models, frame classification models, and non-surgical object detection models. The model loader file can be found at [model_loader.py](https://huggingface.co/visurg/Surg3M_curation_models/blob/main/model_loader.py)
 <div align="center">
 </table>
 </div>
+The video processing pipeline leading to the clean videos in the Surg-3M dataset is as follows:
+<div align="center">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/67d9504a41d31cc626fcecc8/yj2S0GMJm2C2AYwbr1p6G.png"> </img>
+</div>
 Usage
 --------
+Video classification models are employed in the step 2 of the data curation pipeline to classify a video storyboard as either surgical or non-surgical.
    ```python
    import torch
    from PIL import Image
    # Extract features from the image
    outputs = net(img_tensor)
    ```