nielsr HF Staff commited on
Commit
752aa90
·
verified ·
1 Parent(s): 5f56981

Add robotics pipeline tag, library name, and project links

Browse files

Hi! I'm Niels from the Hugging Face community team.

This PR improves the model card for MergeVLA-LIBERO by:
- Adding the `robotics` pipeline tag to the metadata for better discoverability.
- Specifying `transformers` as the `library_name` based on the configuration files.
- Adding direct links to the official [research paper](https://arxiv.org/abs/2511.18810), [project page](https://mergevla.github.io/), and [GitHub repository](https://github.com/MergeVLA/MergeVLA).

These updates help users categorize and use the model more effectively within the Hugging Face ecosystem.

Files changed (1) hide show
  1. README.md +20 -5
README.md CHANGED
@@ -1,14 +1,29 @@
1
  ---
2
- license: mit
3
- language:
4
- - en
5
  base_model:
6
  - Qwen/Qwen2.5-0.5B
 
 
 
 
 
 
 
 
 
 
7
  ---
 
8
  # Model Card for MergeVLA-LIBERO
9
- MergeVLA — Single-Skill Experts for Spatial / Object / Goal / Long-10 (LIBERO Task Suite). These models are used as the base expert checkpoints for our MergeVLA.
 
 
 
 
 
10
 
11
  ## Model Details
 
 
12
  Each uploaded model is a 0.68B-parameter VLA model *(excluding the vision backbone)* composed of:
13
  - Qwen2.5-0.5B as the Vision-Language Model (VLM)
14
  - A lightweight 0.18B Action Expert
@@ -23,7 +38,7 @@ Each uploaded model is a 0.68B-parameter VLA model *(excluding the vision backbo
23
  | **Long-10** | **95.0** |
24
 
25
  ### 🧠 **Training Details**
26
- Each expert is fine-tuned independently using modified LIBER demonstrations in RLDS format.
27
  | Category | Value |
28
  | ----------------------- | ------------------------ |
29
  | LoRA | Enabled (rank = 64) |
 
1
  ---
 
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-0.5B
4
+ language:
5
+ - en
6
+ license: mit
7
+ pipeline_tag: robotics
8
+ library_name: transformers
9
+ tags:
10
+ - vla
11
+ - vision-language-action
12
+ - model-merging
13
+ - libero
14
  ---
15
+
16
  # Model Card for MergeVLA-LIBERO
17
+
18
+ [**MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent**](https://arxiv.org/abs/2511.18810)
19
+
20
+ [**Project Page**](https://mergevla.github.io/) | [**Code**](https://github.com/MergeVLA/MergeVLA)
21
+
22
+ MergeVLA — Single-Skill Experts for Spatial / Object / Goal / Long-10 (LIBERO Task Suite). These models are used as the base expert checkpoints for **MergeVLA**, a merging-oriented VLA architecture designed to preserve mergeability across tasks.
23
 
24
  ## Model Details
25
+ MergeVLA addresses non-mergeability in VLAs by introducing sparsely activated LoRA adapters via task masks and replacing self-attention in action experts with cross-attention-only blocks.
26
+
27
  Each uploaded model is a 0.68B-parameter VLA model *(excluding the vision backbone)* composed of:
28
  - Qwen2.5-0.5B as the Vision-Language Model (VLM)
29
  - A lightweight 0.18B Action Expert
 
38
  | **Long-10** | **95.0** |
39
 
40
  ### 🧠 **Training Details**
41
+ Each expert is fine-tuned independently using modified LIBERO demonstrations in RLDS format.
42
  | Category | Value |
43
  | ----------------------- | ------------------------ |
44
  | LoRA | Enabled (rank = 64) |