Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
---
|
| 2 |
-
name: Traffic-R1 (Public)
|
| 3 |
license: apache-2.0
|
| 4 |
language:
|
| 5 |
- en
|
|
@@ -13,4 +13,17 @@ tags:
|
|
| 13 |
|
| 14 |
---
|
| 15 |
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
name: Traffic-R1-3B (Public 0.1)
|
| 3 |
license: apache-2.0
|
| 4 |
language:
|
| 5 |
- en
|
|
|
|
| 13 |
|
| 14 |
---
|
| 15 |
|
| 16 |
+
### 🚀**Traffic-R1-3B (Public 0.1)** 🚦
|
| 17 |
+
|
| 18 |
+
**Traffic-R1** is a foundational LLM built specifically for **traffic signal control**. This publicly available version, **Traffic-R1-3B (Public 0.1)**, delivers superior zero-shot performance and stable generalization, allowing it to reason like a human traffic expert. 🧠
|
| 19 |
+
|
| 20 |
+
This model is a checkpoint based on the research in our paper:
|
| 21 |
+
|
| 22 |
+
**Traffic-R1: Reinforced LLMs Bring Human-Like Reasoning to Traffic Signal Control Systems** 📄
|
| 23 |
+
[https://arxiv.org/abs/2508.02344](https://arxiv.org/abs/2508.02344)
|
| 24 |
+
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
### **Important Notice** ⚠️
|
| 28 |
+
|
| 29 |
+
This is an earlier checkpoint and doesn't include all the data samples from our offline pretraining stage. We've done this to address commercial and privacy concerns. We will release updates as the model continues to be upgraded internally. 😊
|