Improve model card: Update pipeline tag, add library name, and abstract

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +10 -9
README.md CHANGED
@@ -1,16 +1,16 @@
1
  ---
2
- name: Traffic-R1-3B (Public 0.1)
3
- license: apache-2.0
4
- language:
5
- - en
6
  base_model:
7
  - Qwen/Qwen2.5-3B
8
- pipeline_tag: reinforcement-learning
 
 
 
9
  tags:
10
  - traffic
11
  - reinforce LLM
12
  - LLM agent
13
-
 
14
  ---
15
 
16
  ### πŸš€**Traffic-R1-3B (Public 0.1)** 🚦
@@ -22,12 +22,14 @@ This model is a checkpoint based on the research in our paper:
22
  **Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems** πŸ“„
23
  [https://arxiv.org/abs/2508.02344](https://arxiv.org/abs/2508.02344)
24
 
 
 
 
25
  ## **Compatibility & Reproducibility** πŸ› οΈ
26
 
27
- This model supports a wide range of deployment methods compatible with the Qwen architecture. You can easily use it in a chat mode to interactively discuss traffic-related scenarios.
28
  For more detailed information on deployment, please refer to the official [Qwen documentation](https://qwen.readthedocs.io/en/latest/).
29
 
30
-
31
  The model is compatible with the signal control evaluation code provided by **LLMLight** [https://github.com/usail-hkust/LLMTSCS]. You can quickly reproduce our results with minor changes to the prompt format.
32
 
33
  A big thanks to these excellent projects! πŸ‘
@@ -40,7 +42,6 @@ We plan to release our evaluation code (most necessary) and training code soon.
40
 
41
  We are working on upgrading base mode Qwen 2.5->Qwen 3 for latest features.
42
 
43
-
44
  ---
45
 
46
  ### **Important Notice** ⚠️
 
1
  ---
 
 
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-3B
4
+ language:
5
+ - en
6
+ license: apache-2.0
7
+ pipeline_tag: robotics
8
  tags:
9
  - traffic
10
  - reinforce LLM
11
  - LLM agent
12
+ name: Traffic-R1-3B (Public 0.1)
13
+ library_name: transformers
14
  ---
15
 
16
  ### πŸš€**Traffic-R1-3B (Public 0.1)** 🚦
 
22
  **Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems** πŸ“„
23
  [https://arxiv.org/abs/2508.02344](https://arxiv.org/abs/2508.02344)
24
 
25
+ ## Abstract
26
+ Traffic signal control (TSC) is vital for mitigating congestion and sustaining urban mobility. In this paper, we introduce Traffic-R1, a foundation model with human-like reasoning for TSC systems. Our model is developed through self-exploration and iteration of reinforced large language models (LLMs) with expert guidance in a simulated traffic environment. Compared to traditional reinforcement learning (RL) and recent LLM-based methods, Traffic-R1 offers three significant advantages. First, Traffic-R1 delivers zero-shot generalisation, transferring unchanged to new road networks and out-of-distribution incidents by utilizing its internal traffic control policies and human-like reasoning. Second, its 3B-parameter architecture is lightweight enough for real-time inference on mobile-class chips, enabling large-scale edge deployment. Third, Traffic-R1 provides an explainable TSC process and facilitates multi-intersection communication through its self-iteration and a new synchronous communication network. Extensive benchmarks demonstrate that Traffic-R1 sets a new state of the art, outperforming strong baselines and training-intensive RL controllers. In practice, the model now manages signals for more than 55,000 drivers daily, shortening average queues by over 5% and halving operator workload.
27
+
28
  ## **Compatibility & Reproducibility** πŸ› οΈ
29
 
30
+ This model supports a wide range of deployment methods compatible with the Qwen architecture, including those provided by the `transformers` library. You can easily use it in a chat mode to interactively discuss traffic-related scenarios.
31
  For more detailed information on deployment, please refer to the official [Qwen documentation](https://qwen.readthedocs.io/en/latest/).
32
 
 
33
  The model is compatible with the signal control evaluation code provided by **LLMLight** [https://github.com/usail-hkust/LLMTSCS]. You can quickly reproduce our results with minor changes to the prompt format.
34
 
35
  A big thanks to these excellent projects! πŸ‘
 
42
 
43
  We are working on upgrading base mode Qwen 2.5->Qwen 3 for latest features.
44
 
 
45
  ---
46
 
47
  ### **Important Notice** ⚠️