File size: 2,969 Bytes
8739a65
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b2e5a72
8739a65
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6bd6651
8739a65
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
---
license: apache-2.0
---
# GigaBrain Challenge 2026 – Task 2 VLA Policy Model

This repository provides the **Vision-Language-Action (VLA) policy model for Task 2** of the **GigaBrain Challenge 2026**.

Official Challenge Website:

https://gigaai-research.github.io/GigaBrain-Challenge-2026/

The GigaBrain Challenge aims to advance research in **embodied intelligence**, including:

- Vision-Language-Action (VLA) models
- world models
- robot learning systems
- closed-loop embodied agents

---

# Task Overview

**Task 2** is one of the official tasks in the GigaBrain Challenge 2026 benchmark suite.

Each task evaluates an embodied agent’s ability to generate actions based on multimodal observations and task instructions.

For full task descriptions please refer to:

https://gigaai-research.github.io/GigaBrain-Challenge-2026/index.html#tracks

---

# Model Description

This repository contains the **VLA policy model used for Task 2**.

The model is designed to:

- take **visual observations and language instructions** as input
- generate **robot actions**
- interact with a **world model** in a closed-loop evaluation environment

The VLA model serves as the **action generation component** in the challenge evaluation pipeline.

During evaluation:
Observation → VLA Model → Action → World Model → Next Observation

This loop continues until the task is completed or the episode terminates.

Participants may use this model as:

- a reference implementation
- a starting point for model development
- a component in their own embodied AI systems

---

# Dataset

The dataset for all tasks is provided in the official challenge dataset repository:

https://huggingface.co/datasets/open-gigaai/CVPR-2026-WorldModel-Track-Dataset

Access requires:

- Team registration
- Agreement to the challenge license

Please request access on the dataset page.

---

# License and Usage Terms

This model is released as part of the GigaBrain Challenge 2026.

Use of this model and the associated datasets is governed by the GigaBrain Challenge 2026 Data & Model License Agreement.

Key terms include:

Intended for research and competition participation

Dataset redistribution is not permitted

Commercial use requires permission from the organizers

For full license terms please refer to the dataset repository:

https://huggingface.co/datasets/open-gigaai/CVPR-2026-WorldModel-Track-Dataset

# Citation

If you use this model or dataset in academic research, please cite the challenge:
```bibtex
@misc{gigabrain_challenge_2026,
  title={GigaBrain Challenge 2026},
  year={2026},
  howpublished={https://gigaai-research.github.io/GigaBrain-Challenge-2026/}
}
```

# Disclaimer

This model is provided as-is without warranty of any kind.

The organizers are not responsible for any issues arising from its use.

# Contact

For questions about the challenge, please visit the official website:

https://gigaai-research.github.io/GigaBrain-Challenge-2026/