Update README.md

d27ed76 verified about 1 month ago

3.92 kB

	---
	license: apache-2.0
	tags:
	- CARLA
	- NAVSIM
	- Imitation-Learning
	- Closed-Loop-Driving
	pipeline_tag: robotics
	---

	# LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving


	> ## ⚠️ Coordinate System Warning
	>
	> This model was trained in the left-handed coordinate system of CARLA (x-forward, y-right, z-up), not the ISO 8855 convention used by NAVSIM / nuPlan / most AD stacks (x-forward, y-left, z-up).
	>
	> If you use `ltfv6.py` directly, the predicted `waypoints` and `headings` are in CARLA's left-handed frame. You must convert the planning output back to ISO 8855 before feeding it to any downstream planner, simulator, or evaluation tool that expects the right-handed convention.
	>
	> ### ✅ Recommended: use the prepared NAVSIM workspaces
	>
	> For correct, reproducible evaluation, Use one of the prepared workspaces below — they already wire up the model with the correct coordinate conversion, input preprocessing, and metric computation:
	>
	> - NAVSIM v1.1: [`3rd_party/navsim_workspace/navsimv1.1`](https://github.com/kesai-labs/lead/tree/main/3rd_party/navsim_workspace/navsimv1.1)
	> - NAVSIM v2.2: [`3rd_party/navsim_workspace/navsimv2.2`](https://github.com/kesai-labs/lead/tree/main/3rd_party/navsim_workspace/navsimv2.2)
	>
	> These are the only configurations we have validated end-to-end against the reported numbers. If you evaluate outside of them, results may silently disagree with the paper.
	>
	> ### Manual conversion (only if you must integrate the model yourself)
	>
	> ```python
	> waypoints_iso[..., 0] = waypoints_carla[..., 0] # x unchanged
	> waypoints_iso[..., 1] = -waypoints_carla[..., 1] # flip y
	> headings_iso = -headings_carla # flip yaw sign
	> ```


	[Project Page](https://ln2697.github.io/lead) \| [Paper](https://huggingface.co/papers/2512.20563) \| [Code](https://github.com/autonomousvision/lead)

	Official model weights for Latent TransFuser v6 (LTFv6), a NAVSIM checkpoint accompanies our paper LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving.

	> We release the complete pipeline required to achieve state-of-the-art closed-loop performance on the Bench2Drive benchmark. Built around the CARLA simulator, the stack features a data-centric design with:
	>
	> - Extensive visualization suite and runtime type validation for easier debugging.
	> - Optimized storage format, packs 72 hours of driving in ~200GB.
	> - Native support for NAVSIM and Waymo Vision-based E2E and extending those benchmarks through closed-loop simulation and synthetic data for additional supervision during training.

	Find more information on [https://github.com/autonomousvision/lead](https://github.com/autonomousvision/lead).

	<p align="center">
	<img src="https://ln2697.github.io/lead/static/images/tfv6.png" alt="TFv6 Architecture" width="80%" >
	</p>

	## Usage

	Install dependencies

	```bash
	pip install torch timm numpy opencv-python jaxtyping beartype omegaconf huggingface_hub
	```

	See [example.ipynb](https://huggingface.co/ln2697/tfv6_navsim/blob/main/example.ipynb) to inspect data format and example inference.

	## Data Format

	We also provide example NAVSIM cache [here](https://huggingface.co/ln2697/tfv6_navsim/tree/main/data).

	Input:
	- RGB: (256, 1920, 3), range [0, 255]
	- Command: [left, straight, right, unknown], e.g. [0, 1, 0, 0] for straight
	- Speed: m/s
	- Acceleration: m/s²

	Output:
	- `waypoints`: (N, 2) predicted positions
	- `headings`: (N,) predicted angles

	## Citation

	If you find this work useful, please cite:

	```bibtex
	@inproceedings{Nguyen2026CVPR,
	author = {Long Nguyen and Micha Fauth and Bernhard Jaeger and Daniel Dauner and Maximilian Igl and Andreas Geiger and Kashyap Chitta},
	title = {LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving},
	booktitle = {Conference on Computer Vision and Pattern Recognition (CVPR)},
	year = {2026},
	}
	```