File size: 1,352 Bytes
8bce96b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
TAB_USAGE = """## 🚀 Quick Start

### Installation

```bash
conda create -n lingbot-map python=3.10 -y
conda activate lingbot-map
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu128
pip install -e .
pip install flashinfer-python -i https://flashinfer.ai/whl/cu128/torch2.9/
```

### Inference from Images
```bash
python demo.py --model_path lingbot-map.pt \\
    --image_folder /path/to/images/
```

### Inference from Video
```bash
python demo.py --model_path lingbot-map.pt \\
    --video_path video.mp4 --fps 10
```

### Long Sequences (10,000+ frames)
```bash
python demo.py --model_path lingbot-map.pt \\
    --image_folder /path/to/images/ \\
    --keyframe_interval 6
```

### Windowed Mode (>3000 frames)
```bash
python demo.py --model_path lingbot-map.pt \\
    --video_path video.mp4 --fps 10 \\
    --mode windowed --window_size 64
```

### Without FlashInfer (CPU fallback)
```bash
python demo.py --model_path lingbot-map.pt \\
    --image_folder /path/to/images/ --use_sdpa
```

### Model Checkpoints

| Name | Size | Description |
|---|---|---|
| `lingbot-map.pt` | 4.63 GB | Base model |
| `lingbot-map-long.pt` | 4.63 GB | Long-sequence variant |
| `lingbot-map-stage1.pt` | 4.76 GB | Stage 1 training checkpoint |

All available at [robbyant/lingbot-map](https://huggingface.co/robbyant/lingbot-map)
"""