File size: 3,920 Bytes
ee22b18
 
a5b4ff6
 
 
 
 
60ffb59
ee22b18
 
8485284
a5b4ff6
d27ed76
521a67a
 
 
 
 
 
d27ed76
 
 
 
 
 
 
 
 
 
 
521a67a
 
 
 
 
 
 
8485284
ee22b18
35521e3
9417189
dbfc31e
4d19a27
 
 
dbfc31e
9417189
b2e8ebc
ee22b18
e32a581
 
 
a5b4ff6
b7c050e
ee22b18
1adb8a4
 
ee22b18
b7c050e
ee22b18
1adb8a4
1b9c02b
 
b9a0bb7
 
 
ee22b18
b7c050e
5d8f5d2
 
b7c050e
 
ee22b18
b7c050e
 
58cf78f
 
 
 
 
 
 
43e881b
 
 
 
 
58cf78f
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
---
license: apache-2.0
tags:
- CARLA
- NAVSIM
- Imitation-Learning
- Closed-Loop-Driving
pipeline_tag: robotics
---

# LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving


> ## ⚠️ Coordinate System Warning
>
> **This model was trained in the left-handed coordinate system of CARLA** (x-forward, **y-right**, z-up), **not** the ISO 8855 convention used by NAVSIM / nuPlan / most AD stacks (x-forward, **y-left**, z-up).
>
> If you use `ltfv6.py` directly, the predicted `waypoints` and `headings` are in CARLA's left-handed frame. **You must convert the planning output back to ISO 8855 before feeding it to any downstream planner, simulator, or evaluation tool that expects the right-handed convention.**
>
> ### ✅ Recommended: use the prepared NAVSIM workspaces
>
> For correct, reproducible evaluation, Use one of the prepared workspaces below — they already wire up the model with the correct coordinate conversion, input preprocessing, and metric computation:
>
> - **NAVSIM v1.1**: [`3rd_party/navsim_workspace/navsimv1.1`](https://github.com/kesai-labs/lead/tree/main/3rd_party/navsim_workspace/navsimv1.1)
> - **NAVSIM v2.2**: [`3rd_party/navsim_workspace/navsimv2.2`](https://github.com/kesai-labs/lead/tree/main/3rd_party/navsim_workspace/navsimv2.2)
>
> These are the only configurations we have validated end-to-end against the reported numbers. If you evaluate outside of them, results may silently disagree with the paper.
>
> ### Manual conversion (only if you must integrate the model yourself)
>
> ```python
> waypoints_iso[..., 0] =  waypoints_carla[..., 0]   # x unchanged
> waypoints_iso[..., 1] = -waypoints_carla[..., 1]   # flip y
> headings_iso          = -headings_carla            # flip yaw sign
> ```


[**Project Page**](https://ln2697.github.io/lead) | [**Paper**](https://huggingface.co/papers/2512.20563) | [**Code**](https://github.com/autonomousvision/lead)

Official model weights for **Latent TransFuser v6 (LTFv6)**, a NAVSIM checkpoint accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving. 

> We release the complete pipeline required to achieve state-of-the-art closed-loop performance on the Bench2Drive benchmark. Built around the CARLA simulator, the stack features a data-centric design with:
> 
> - Extensive visualization suite and runtime type validation for easier debugging.
> - Optimized storage format, packs 72 hours of driving in ~200GB.
> - Native support for NAVSIM and Waymo Vision-based E2E and extending those benchmarks through closed-loop simulation and synthetic data for additional supervision during training.

Find more information on [https://github.com/autonomousvision/lead](https://github.com/autonomousvision/lead).

<p align="center">
  <img src="https://ln2697.github.io/lead/static/images/tfv6.png" alt="TFv6 Architecture" width="80%" >
</p>

## Usage

Install dependencies

```bash
pip install torch timm numpy opencv-python jaxtyping beartype omegaconf huggingface_hub
```

See [example.ipynb](https://huggingface.co/ln2697/tfv6_navsim/blob/main/example.ipynb) to inspect data format and example inference.

## Data Format

We also provide example NAVSIM cache [here](https://huggingface.co/ln2697/tfv6_navsim/tree/main/data).

**Input:**
- RGB: (256, 1920, 3), range [0, 255]
- Command: [left, straight, right, unknown], e.g. [0, 1, 0, 0] for straight
- Speed: m/s
- Acceleration: m/s²

**Output:**
- `waypoints`: (N, 2) predicted positions
- `headings`: (N,) predicted angles

## Citation

If you find this work useful, please cite:

```bibtex
@inproceedings{Nguyen2026CVPR,
	author = {Long Nguyen and Micha Fauth and Bernhard Jaeger and Daniel Dauner and Maximilian Igl and Andreas Geiger and Kashyap Chitta},
	title = {LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving},
	booktitle = {Conference on Computer Vision and Pattern Recognition (CVPR)},
	year = {2026},
}
```