Add pipeline tag and library metadata
#1
by nielsr HF Staff - opened
README.md
CHANGED
|
@@ -1,7 +1,9 @@
|
|
| 1 |
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
base_model:
|
| 4 |
- Qwen/Qwen2.5-VL-3B-Instruct
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
# SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
|
|
@@ -21,12 +23,11 @@ base_model:
|
|
| 21 |
|
| 22 |
## SpatialEvo-3B
|
| 23 |
|
| 24 |
-
|
| 25 |
This repository contains **SpatialEvo-3B**, introduced in [SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments](https://arxiv.org/abs/2604.14144).
|
| 26 |
|
| 27 |
## Model Description
|
| 28 |
|
| 29 |
-
SpatialEvo-3B is fine-tuned from **Qwen2.5-VL-3B-Instruct** using the SpatialEvo self-evolving framework. Instead of relying on manually annotated datasets or model
|
| 30 |
|
| 31 |
A single shared-parameter policy co-evolves as both a **Questioner** and a **Solver** under GRPO optimization, while a lightweight **Task Scheduler** drives adaptive curriculum learning based on historical accuracy — without any manual stage design or human annotation.
|
| 32 |
|
|
|
|
| 1 |
---
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2.5-VL-3B-Instruct
|
| 4 |
+
license: apache-2.0
|
| 5 |
+
library_name: transformers
|
| 6 |
+
pipeline_tag: image-text-to-text
|
| 7 |
---
|
| 8 |
|
| 9 |
# SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
|
|
|
|
| 23 |
|
| 24 |
## SpatialEvo-3B
|
| 25 |
|
|
|
|
| 26 |
This repository contains **SpatialEvo-3B**, introduced in [SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments](https://arxiv.org/abs/2604.14144).
|
| 27 |
|
| 28 |
## Model Description
|
| 29 |
|
| 30 |
+
SpatialEvo-3B is fine-tuned from **Qwen2.5-VL-3B-Instruct** using the SpatialEvo self-evolving framework. Instead of relying on manually annotated datasets or model consensus to construct pseudo-labels, SpatialEvo leverages a **Deterministic Geometric Environment (DGE)** that programmatically computes exact ground truth from 3D point clouds and camera poses, enabling zero-noise online reinforcement learning across 16 spatial reasoning task categories.
|
| 31 |
|
| 32 |
A single shared-parameter policy co-evolves as both a **Questioner** and a **Solver** under GRPO optimization, while a lightweight **Task Scheduler** drives adaptive curriculum learning based on historical accuracy — without any manual stage design or human annotation.
|
| 33 |
|