Alibaba-DAMO-Academy
/

RynnEC-2B

Safetensors

rynnec_qwen2

Model card Files Files and versions

xet

Community

Improve model card: Add pipeline tag, library name, paper link, and usage example

by nielsr HF Staff - opened Aug 21, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+71

-3

Files changed (1) hide show

README.md +71 -3

README.md CHANGED Viewed

@@ -1,17 +1,22 @@
 ---
 license: apache-2.0
 ---
 <p align="center">
     <img src="https://github.com/alibaba-damo-academy/RynnEC/blob/main/assets/logo.jpg?raw=true" width="150" style="margin-bottom: 0.2;"/>
 <p>
-<h3 align="center"><a href="" style="color:#9C276A">
 RynnEC: Bringing MLLMs into Embodied World</a></h3>
 <h5 align="center"> If our project helps you, please give us a star ⭐ on <a href="https://github.com/alibaba-damo-academy/RynnEC">Github</a> to support us. 🙏🙏 </h2>
 ## 📰 News
-* **[2025.08.08]**  🔥🔥 Release our RynnEC-2B model, RynnEC-Bench and training code.
@@ -51,4 +56,67 @@ Benchmark comparison across object cognition and spatial cognition. With a highl
 If you find RynnEC useful for your research and applications, please cite using this BibTeX:

 ---
 license: apache-2.0
+pipeline_tag: video-text-to-text
+library_name: transformers
 ---
 <p align="center">
     <img src="https://github.com/alibaba-damo-academy/RynnEC/blob/main/assets/logo.jpg?raw=true" width="150" style="margin-bottom: 0.2;"/>
 <p>
+<h3 align="center"><a href="https://huggingface.co/papers/2508.14160" style="color:#9C276A">
 RynnEC: Bringing MLLMs into Embodied World</a></h3>
 <h5 align="center"> If our project helps you, please give us a star ⭐ on <a href="https://github.com/alibaba-damo-academy/RynnEC">Github</a> to support us. 🙏🙏 </h2>
+This repository contains the RynnEC model presented in the paper [RynnEC: Bringing MLLMs into Embodied World](https://huggingface.co/papers/2508.14160).
+For more details, please visit the [project page](https://huggingface.co/spaces/Alibaba-DAMO-Academy/RynnEC) and the [GitHub repository](https://github.com/alibaba-damo-academy/RynnEC).
 ## 📰 News
+* **[2025.08.08]** 🔥🔥 Release our RynnEC-2B model, RynnEC-Bench and training code.
 If you find RynnEC useful for your research and applications, please cite using this BibTeX:
+```bibtex
+@article{wu2025rynnec,
+  title={RynnEC: Bringing MLLMs into Embodied World},
+  author={Wu, Zhiyong and Wu, Zhenyu and Ma, Weichen and Zhou, Bo and Shen, Junnan and Wu, Lemeng and Huang, Qichen and Yu, Runhui and Liu, Qiming and Jiang, Zibo and Zhang, Hongyang},
+  journal={arXiv preprint arXiv:2508.14160},
+  year={2025}
+}
+```
+## Usage
+We provide a simple generation process for using our model. For more details, you could refer to the [Github repository](https://github.com/alibaba-damo-academy/RynnEC).
+```python
+from transformers import Qwen2VLForConditionalGeneration, AutoProcessor
+from qwen_vl_utils import process_vision_info
+# Default: Load the model on the available device(s)
+model = Qwen2VLForConditionalGeneration.from_pretrained(
+    "Alibaba-DAMO-Academy/RynnEC-2B", torch_dtype="auto", device_map="auto"
+)
+processor = AutoProcessor.from_pretrained("Alibaba-DAMO-Academy/RynnEC-2B")
+messages = [
+    {
+        "role": "user",
+        "content": [
+            {
+                "type": "image",
+                "image": "./examples/images/web_6f93090a-81f6-489e-bb35-1a2838b18c01.png",
+            },
+            {"type": "text", "text": "In this UI screenshot, what is the position of the element corresponding to the command \"switch language of current page\" (with bbox)?"},
+        ],
+    }
+]
+# Preparation for inference
+text = processor.apply_chat_template(
+    messages, tokenize=False, add_generation_prompt=True
+)
+image_inputs, video_inputs = process_vision_info(messages)
+inputs = processor(
+    text=[text],
+    images=image_inputs,
+    videos=video_inputs,
+    padding=True,
+    return_tensors="pt",
+)
+inputs = inputs.to("cuda")
+# Inference: Generation of the output
+generated_ids = model.generate(**inputs, max_new_tokens=128)
+generated_ids_trimmed = [
+    out_ids[len(in_ids):] for in_ids, out_ids in zip(inputs.input_ids, generated_ids)
+]
+output_text = processor.batch_decode(
+    generated_ids_trimmed, skip_special_tokens=False, clean_up_tokenization_spaces=False
+)
+print(output_text)
+# <|object_ref_start|>language switch<|object_ref_end|><|box_start|>(576,12),(592,42)<|box_end|><|im_end|>
+```