Add pipeline tag and improve model card

Hi! I'm Niels from the Hugging Face community science team.

This PR improves the model card for SAMA-14B by:
- Adding the `image-to-video` pipeline tag to the metadata for better discoverability.
- Including a "Quick Start" section with installation and inference instructions taken from the official GitHub repository.
- Organizing the content to include helpful links to the paper, project page, and code.

These changes help users understand how to set up and run the model effectively.

Files changed (1) hide show

README.md +31 -3

README.md CHANGED Viewed

@@ -1,7 +1,8 @@
 ---
 license: apache-2.0
 ---
 # SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
 <div align="center">
@@ -12,13 +13,40 @@ license: apache-2.0
   <a href="https://github.com/Cynthiazxy123/SAMA" target="_blank"><img src="https://img.shields.io/badge/Code-111111.svg?logo=github&logoColor=white" height="22px"></a>
 </div>
 ## 🤗 Available Models
 | Model | Status | Link |
 | --- | --- | --- |
-| SAMA-5B | Coming soon | Coming soon |
 | SAMA-14B | Available | [syxbb/SAMA-14B](https://huggingface.co/syxbb/SAMA-14B) |
 ## 📚 Citation
@@ -33,4 +61,4 @@ license: apache-2.0
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2603.19228},
 }
-```

 ---
 license: apache-2.0
+pipeline_tag: image-to-video
 ---
 # SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
 <div align="center">
   <a href="https://github.com/Cynthiazxy123/SAMA" target="_blank"><img src="https://img.shields.io/badge/Code-111111.svg?logo=github&logoColor=white" height="22px"></a>
 </div>
+SAMA (factorized **S**emantic **A**nchoring and **M**otion **A**lignment) is an instruction-guided video editing framework. It factorizes video editing into two parts: semantic anchoring to establish structural planning and motion alignment to internalize temporal dynamics. This approach enables precise semantic modifications while faithfully preserving the original motion of the source video.
+## 🚀 Quick Start
+### 🛠️ Installation
+Recommended environment: Linux, NVIDIA GPU, CUDA 12.1, and Python 3.10.
+```bash
+git clone https://github.com/Cynthiazxy123/SAMA
+cd SAMA
+conda create -n sama python=3.10 -y
+conda activate sama
+pip install --upgrade pip
+pip install -r requirements.txt
+```
+### ▶️ Inference
+To run instruction-guided video editing, you will need the base `Wan2.1-T2V-14B` model and the SAMA checkpoint.
+The inference script is located at `infer_sh/run_sama.sh`. Edit the variables at the top of that script (such as `MODEL_ROOT`, `STATE_DICT`, `SRC_VIDEO`, and `PROMPT`) and then run:
+```bash
+bash infer_sh/run_sama.sh
+```
 ## 🤗 Available Models
 | Model | Status | Link |
 | --- | --- | --- |
+| SAMA-5B | Coming soon | - |
 | SAMA-14B | Available | [syxbb/SAMA-14B](https://huggingface.co/syxbb/SAMA-14B) |
 ## 📚 Citation
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2603.19228},
 }
+```