Improve model card: Update pipeline tag, add library name, and abstract
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,16 +1,16 @@
|
|
| 1 |
---
|
| 2 |
-
name: Traffic-R1-3B (Public 0.1)
|
| 3 |
-
license: apache-2.0
|
| 4 |
-
language:
|
| 5 |
-
- en
|
| 6 |
base_model:
|
| 7 |
- Qwen/Qwen2.5-3B
|
| 8 |
-
|
|
|
|
|
|
|
|
|
|
| 9 |
tags:
|
| 10 |
- traffic
|
| 11 |
- reinforce LLM
|
| 12 |
- LLM agent
|
| 13 |
-
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
### π**Traffic-R1-3B (Public 0.1)** π¦
|
|
@@ -22,12 +22,14 @@ This model is a checkpoint based on the research in our paper:
|
|
| 22 |
**Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems** π
|
| 23 |
[https://arxiv.org/abs/2508.02344](https://arxiv.org/abs/2508.02344)
|
| 24 |
|
|
|
|
|
|
|
|
|
|
| 25 |
## **Compatibility & Reproducibility** π οΈ
|
| 26 |
|
| 27 |
-
This model supports a wide range of deployment methods compatible with the Qwen architecture. You can easily use it in a chat mode to interactively discuss traffic-related scenarios.
|
| 28 |
For more detailed information on deployment, please refer to the official [Qwen documentation](https://qwen.readthedocs.io/en/latest/).
|
| 29 |
|
| 30 |
-
|
| 31 |
The model is compatible with the signal control evaluation code provided by **LLMLight** [https://github.com/usail-hkust/LLMTSCS]. You can quickly reproduce our results with minor changes to the prompt format.
|
| 32 |
|
| 33 |
A big thanks to these excellent projects! π
|
|
@@ -40,7 +42,6 @@ We plan to release our evaluation code (most necessary) and training code soon.
|
|
| 40 |
|
| 41 |
We are working on upgrading base mode Qwen 2.5->Qwen 3 for latest features.
|
| 42 |
|
| 43 |
-
|
| 44 |
---
|
| 45 |
|
| 46 |
### **Important Notice** β οΈ
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2.5-3B
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
+
license: apache-2.0
|
| 7 |
+
pipeline_tag: robotics
|
| 8 |
tags:
|
| 9 |
- traffic
|
| 10 |
- reinforce LLM
|
| 11 |
- LLM agent
|
| 12 |
+
name: Traffic-R1-3B (Public 0.1)
|
| 13 |
+
library_name: transformers
|
| 14 |
---
|
| 15 |
|
| 16 |
### π**Traffic-R1-3B (Public 0.1)** π¦
|
|
|
|
| 22 |
**Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems** π
|
| 23 |
[https://arxiv.org/abs/2508.02344](https://arxiv.org/abs/2508.02344)
|
| 24 |
|
| 25 |
+
## Abstract
|
| 26 |
+
Traffic signal control (TSC) is vital for mitigating congestion and sustaining urban mobility. In this paper, we introduce Traffic-R1, a foundation model with human-like reasoning for TSC systems. Our model is developed through self-exploration and iteration of reinforced large language models (LLMs) with expert guidance in a simulated traffic environment. Compared to traditional reinforcement learning (RL) and recent LLM-based methods, Traffic-R1 offers three significant advantages. First, Traffic-R1 delivers zero-shot generalisation, transferring unchanged to new road networks and out-of-distribution incidents by utilizing its internal traffic control policies and human-like reasoning. Second, its 3B-parameter architecture is lightweight enough for real-time inference on mobile-class chips, enabling large-scale edge deployment. Third, Traffic-R1 provides an explainable TSC process and facilitates multi-intersection communication through its self-iteration and a new synchronous communication network. Extensive benchmarks demonstrate that Traffic-R1 sets a new state of the art, outperforming strong baselines and training-intensive RL controllers. In practice, the model now manages signals for more than 55,000 drivers daily, shortening average queues by over 5% and halving operator workload.
|
| 27 |
+
|
| 28 |
## **Compatibility & Reproducibility** π οΈ
|
| 29 |
|
| 30 |
+
This model supports a wide range of deployment methods compatible with the Qwen architecture, including those provided by the `transformers` library. You can easily use it in a chat mode to interactively discuss traffic-related scenarios.
|
| 31 |
For more detailed information on deployment, please refer to the official [Qwen documentation](https://qwen.readthedocs.io/en/latest/).
|
| 32 |
|
|
|
|
| 33 |
The model is compatible with the signal control evaluation code provided by **LLMLight** [https://github.com/usail-hkust/LLMTSCS]. You can quickly reproduce our results with minor changes to the prompt format.
|
| 34 |
|
| 35 |
A big thanks to these excellent projects! π
|
|
|
|
| 42 |
|
| 43 |
We are working on upgrading base mode Qwen 2.5->Qwen 3 for latest features.
|
| 44 |
|
|
|
|
| 45 |
---
|
| 46 |
|
| 47 |
### **Important Notice** β οΈ
|