| TAB_ABOUT = """## πΊοΈ LingBot-Map: Geometric Context Transformer for Streaming 3D Reconstruction |
| |
| LingBot-Map is a **feed-forward 3D foundation model** that reconstructs 3D scenes from video streams in real-time at ~20 FPS. |
| |
| Given a continuous video stream, it recovers: |
| - π· **Camera poses** for each frame |
| - π **Depth maps** per frame |
| - βοΈ **3D point clouds** of the scene |
| |
| Unlike traditional SLAM systems that rely on iterative optimization, LingBot-Map does this in a **single forward pass** through a transformer. |
| |
| ### Paper |
| **"Geometric Context Transformer for Streaming 3D Reconstruction"** |
| Lin-Zhuo Chen, Jian Gao, Yihang Chen, Ka Leong Cheng, Yipengjing Sun, Liangxiao Hu, Nan Xue, Xing Zhu, Yujun Shen, Yao Yao, Yinghao Xu |
| |
| π [arXiv:2604.14141](https://arxiv.org/abs/2604.14141) | π [Project Page](https://technology.robbyant.com/lingbot-map) | π» [GitHub](https://github.com/robbyant/lingbot-map) |
| """ |
|
|