open-gigaai
/

CVPR-2026-WorldModel-Track-Model-Task2

Model card Files Files and versions

CVPR-2026-WorldModel-Track-Model-Task2 / README.md

cbtogu's picture

Update README.md

b2e5a72 verified 5 days ago

|

history blame contribute delete

2.97 kB

	---
	license: apache-2.0
	---
	# GigaBrain Challenge 2026 – Task 2 VLA Policy Model

	This repository provides the Vision-Language-Action (VLA) policy model for Task 2 of the GigaBrain Challenge 2026.

	Official Challenge Website:

	https://gigaai-research.github.io/GigaBrain-Challenge-2026/

	The GigaBrain Challenge aims to advance research in embodied intelligence, including:

	- Vision-Language-Action (VLA) models
	- world models
	- robot learning systems
	- closed-loop embodied agents

	---

	# Task Overview

	Task 2 is one of the official tasks in the GigaBrain Challenge 2026 benchmark suite.

	Each task evaluates an embodied agent’s ability to generate actions based on multimodal observations and task instructions.

	For full task descriptions please refer to:

	https://gigaai-research.github.io/GigaBrain-Challenge-2026/index.html#tracks

	---

	# Model Description

	This repository contains the VLA policy model used for Task 2.

	The model is designed to:

	- take visual observations and language instructions as input
	- generate robot actions
	- interact with a world model in a closed-loop evaluation environment

	The VLA model serves as the action generation component in the challenge evaluation pipeline.

	During evaluation:
	Observation → VLA Model → Action → World Model → Next Observation

	This loop continues until the task is completed or the episode terminates.

	Participants may use this model as:

	- a reference implementation
	- a starting point for model development
	- a component in their own embodied AI systems

	---

	# Dataset

	The dataset for all tasks is provided in the official challenge dataset repository:

	https://huggingface.co/datasets/open-gigaai/CVPR-2026-WorldModel-Track-Dataset

	Access requires:

	- Team registration
	- Agreement to the challenge license

	Please request access on the dataset page.

	---

	# License and Usage Terms

	This model is released as part of the GigaBrain Challenge 2026.

	Use of this model and the associated datasets is governed by the GigaBrain Challenge 2026 Data & Model License Agreement.

	Key terms include:

	Intended for research and competition participation

	Dataset redistribution is not permitted

	Commercial use requires permission from the organizers

	For full license terms please refer to the dataset repository:

	https://huggingface.co/datasets/open-gigaai/CVPR-2026-WorldModel-Track-Dataset

	# Citation

	If you use this model or dataset in academic research, please cite the challenge:
	```bibtex
	@misc{gigabrain_challenge_2026,
	title={GigaBrain Challenge 2026},
	year={2026},
	howpublished={https://gigaai-research.github.io/GigaBrain-Challenge-2026/}
	}
	```

	# Disclaimer

	This model is provided as-is without warranty of any kind.

	The organizers are not responsible for any issues arising from its use.

	# Contact

	For questions about the challenge, please visit the official website:

	https://gigaai-research.github.io/GigaBrain-Challenge-2026/