File size: 186,363 Bytes
19d4cfa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
>>>>> grad accum = 32
/usr/local/lib/python3.12/dist-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
W0127 17:10:05.230000 185005 torch/distributed/run.py:803] 
W0127 17:10:05.230000 185005 torch/distributed/run.py:803] *****************************************
W0127 17:10:05.230000 185005 torch/distributed/run.py:803] Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed. 
W0127 17:10:05.230000 185005 torch/distributed/run.py:803] *****************************************
/usr/local/lib/python3.12/dist-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
/usr/local/lib/python3.12/dist-packages/torch/cuda/__init__.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
  import pynvml  # type: ignore[import]
Trainer._get_train_sampler replaced with custom implementation.
Trainer._get_train_sampler replaced with custom implementation.
[2026-01-27 17:10:12,014] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2026-01-27 17:10:12,029] [INFO] [real_accelerator.py:222:get_accelerator] Setting ds_accelerator to cuda (auto detect)
[2026-01-27 17:10:13,285] [INFO] [comm.py:658:init_distributed] cdb=None
[2026-01-27 17:10:13,285] [INFO] [comm.py:658:init_distributed] cdb=None
[2026-01-27 17:10:13,285] [INFO] [comm.py:689:init_distributed] Initializing TorchBackend in DeepSpeed with backend nccl
Warning: FlashAttention 3 is not available, falling back to PyTorch's scaled_dot_product_attention
Warning: FlashAttention 3 is not available, falling back to PyTorch's scaled_dot_product_attention
You are attempting to use Flash Attention 2.0 with a model not initialized on GPU. Make sure to move the model to GPU after initializing it on CPU with `model.to('cuda')`.
You are attempting to use Flash Attention 2.0 with a model not initialized on GPU. Make sure to move the model to GPU after initializing it on CPU with `model.to('cuda')`.

Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]
Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]
Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:02<00:00,  1.10s/it]
Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:02<00:00,  1.10s/it]
Some weights of Qwen2_5_VLForConditionalGenerationWithVGGT were not initialized from the model checkpoint at Qwen/Qwen2.5-VL-3B-Instruct and are newly initialized: ['geometry_encoder.vggt.aggregator.camera_token', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.cls_token', 'geometry_encoder.vggt.aggregator.patch_embed.mask_token', 'geometry_encoder.vggt.aggregator.patch_embed.norm.bias', 'geometry_encoder.vggt.aggregator.patch_embed.norm.weight', 'geometry_encoder.vggt.aggregator.patch_embed.patch_embed.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.patch_embed.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.pos_embed', 'geometry_encoder.vggt.aggregator.patch_embed.register_tokens', 'geometry_encoder.vggt.aggregator.register_token', 'language_feature_fusion.fusion_layers.0.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.2.weight', 'language_feature_fusion.fusion_layers.1.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.2.weight', 'language_feature_fusion.fusion_layers.2.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.2.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.

Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:02<00:00,  1.12s/it]
Loading checkpoint shards: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2/2 [00:02<00:00,  1.12s/it]
Some weights of Qwen2_5_VLForConditionalGenerationWithVGGT were not initialized from the model checkpoint at Qwen/Qwen2.5-VL-3B-Instruct and are newly initialized: ['geometry_encoder.vggt.aggregator.camera_token', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.frame_blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.k_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.k_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.q_norm.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.q_norm.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.global_blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.0.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.1.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.10.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.11.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.12.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.13.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.14.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.15.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.16.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.17.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.18.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.19.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.2.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.20.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.21.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.22.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.23.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.3.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.4.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.5.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.6.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.7.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.8.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.qkv.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.attn.qkv.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.ls1.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.ls2.gamma', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.mlp.fc2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm1.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm1.weight', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm2.bias', 'geometry_encoder.vggt.aggregator.patch_embed.blocks.9.norm2.weight', 'geometry_encoder.vggt.aggregator.patch_embed.cls_token', 'geometry_encoder.vggt.aggregator.patch_embed.mask_token', 'geometry_encoder.vggt.aggregator.patch_embed.norm.bias', 'geometry_encoder.vggt.aggregator.patch_embed.norm.weight', 'geometry_encoder.vggt.aggregator.patch_embed.patch_embed.proj.bias', 'geometry_encoder.vggt.aggregator.patch_embed.patch_embed.proj.weight', 'geometry_encoder.vggt.aggregator.patch_embed.pos_embed', 'geometry_encoder.vggt.aggregator.patch_embed.register_tokens', 'geometry_encoder.vggt.aggregator.register_token', 'language_feature_fusion.fusion_layers.0.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.0.0.geo_mlp.2.weight', 'language_feature_fusion.fusion_layers.1.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.1.0.geo_mlp.2.weight', 'language_feature_fusion.fusion_layers.2.0.geo_ln.weight', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.0.bias', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.0.weight', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.2.bias', 'language_feature_fusion.fusion_layers.2.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.0.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.1.0.geo_mlp.2.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_ln.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.0.bias', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.0.weight', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.2.bias', 'multi_layer_feature_fusion.fusion_layers.2.0.geo_mlp.2.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.50, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`.
Using a slow image processor as `use_fast` is unset and a slow processor was saved with this model. `use_fast=True` will be the default behavior in v4.50, even if the model was saved with a slow processor. This will result in minor differences in outputs. You'll still be able to use a slow processor with `use_fast=False`.
Vision Module - Attention Blocks:
Trainable Block Indices: None
Non-Trainable Block Indices: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31]
Merger Module Trainable: False
LLM Module - Embed Tokens Trainable: True
LLM Module - Trainable Layer Indices: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35]
LLM Module - Non-Trainable Layer Indices: None
Qwen2_5_VLConfig {
  "_attn_implementation_autoset": true,
  "architectures": [
    "Qwen2_5_VLForConditionalGeneration"
  ],
  "attention_dropout": 0.0,
  "bos_token_id": 151643,
  "eos_token_id": 151645,
  "feature_fusion_method": "deepstack_language_add",
  "fusion_num_layers": 1,
  "geometry_encoder_layers": [
    11,
    17,
    23
  ],
  "geometry_encoder_type": "vggt",
  "geometry_fusion_layers": [
    0,
    1,
    2
  ],
  "geometry_merger_type": "mlp",
  "hidden_act": "silu",
  "hidden_size": 2048,
  "image_token_id": 151655,
  "include_camera_token": false,
  "initializer_range": 0.02,
  "intermediate_size": 11008,
  "max_position_embeddings": 128000,
  "max_window_layers": 70,
  "model_type": "qwen2_5_vl",
  "num_attention_heads": 16,
  "num_hidden_layers": 36,
  "num_key_value_heads": 2,
  "pos_encoding_type": "none",
  "reference_frame": "first",
  "rms_norm_eps": 1e-06,
  "rope_scaling": {
    "mrope_section": [
      16,
      24,
      24
    ],
    "rope_type": "default",
    "type": "default"
  },
  "rope_theta": 1000000.0,
  "sliding_window": 32768,
  "tie_word_embeddings": true,
  "torch_dtype": "bfloat16",
  "transformers_version": "4.50.0",
  "use_cache": false,
  "use_geometry_encoder": true,
  "use_sliding_window": false,
  "video_token_id": 151656,
  "vision_config": {
    "depth": 32,
    "fullatt_block_indexes": [
      7,
      15,
      23,
      31
    ],
    "hidden_act": "silu",
    "hidden_size": 1280,
    "in_channels": 3,
    "in_chans": 3,
    "intermediate_size": 3420,
    "model_type": "qwen2_5_vl",
    "num_heads": 16,
    "out_hidden_size": 2048,
    "patch_size": 14,
    "spatial_merge_size": 2,
    "spatial_patch_size": 14,
    "temporal_patch_size": 2,
    "tokens_per_second": 2,
    "torch_dtype": "bfloat16",
    "window_size": 112
  },
  "vision_end_token_id": 151653,
  "vision_language_fusion_layers": null,
  "vision_start_token_id": 151652,
  "vision_token_id": 151654,
  "vocab_size": 151936
}

Loading datasets: [{'annotation_path': 'data/train/spar_234k.json', 'data_path': 'data/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'spar_234k'}, {'annotation_path': 'data/train/llava_hound_64k.json', 'data_path': 'data/media', 'tag': '2d', 'sampling_rate': 0.6, 'dataset_name': 'llava_hound_64k'}, {'annotation_path': 'data/vlm3r/annotations/vsibench_train/merged_qa_scannet_train.json', 'data_path': 'data/vlm3r/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'vlm3r_scannet'}, {'annotation_path': 'data/vsi_590k/annotations/vsi_appearance_order_vsibench_scannet.json', 'data_path': 'data/vsi_590k/media', 'tag': '3d', 'sampling_rate': 0.5, 'dataset_name': 'vsi_appr_order'}]
Qwen2_5_VLConfig {
  "_attn_implementation_autoset": true,
  "architectures": [
    "Qwen2_5_VLForConditionalGeneration"
  ],
  "attention_dropout": 0.0,
  "bos_token_id": 151643,
  "eos_token_id": 151645,
  "feature_fusion_method": "deepstack_language_add",
  "fusion_num_layers": 1,
  "geometry_encoder_layers": [
    11,
    17,
    23
  ],
  "geometry_encoder_type": "vggt",
  "geometry_fusion_layers": [
    0,
    1,
    2
  ],
  "geometry_merger_type": "mlp",
  "hidden_act": "silu",
  "hidden_size": 2048,
  "image_token_id": 151655,
  "include_camera_token": false,
  "initializer_range": 0.02,
  "intermediate_size": 11008,
  "max_position_embeddings": 128000,
  "max_window_layers": 70,
  "model_type": "qwen2_5_vl",
  "num_attention_heads": 16,
  "num_hidden_layers": 36,
  "num_key_value_heads": 2,
  "pos_encoding_type": "none",
  "reference_frame": "first",
  "rms_norm_eps": 1e-06,
  "rope_scaling": {
    "mrope_section": [
      16,
      24,
      24
    ],
    "rope_type": "default",
    "type": "default"
  },
  "rope_theta": 1000000.0,
  "sliding_window": 32768,
  "tie_word_embeddings": true,
  "torch_dtype": "bfloat16",
  "transformers_version": "4.50.0",
  "use_cache": false,
  "use_geometry_encoder": true,
  "use_sliding_window": false,
  "video_token_id": 151656,
  "vision_config": {
    "depth": 32,
    "fullatt_block_indexes": [
      7,
      15,
      23,
      31
    ],
    "hidden_act": "silu",
    "hidden_size": 1280,
    "in_channels": 3,
    "in_chans": 3,
    "intermediate_size": 3420,
    "model_type": "qwen2_5_vl",
    "num_heads": 16,
    "out_hidden_size": 2048,
    "patch_size": 14,
    "spatial_merge_size": 2,
    "spatial_patch_size": 14,
    "temporal_patch_size": 2,
    "tokens_per_second": 2,
    "torch_dtype": "bfloat16",
    "window_size": 112
  },
  "vision_end_token_id": 151653,
  "vision_language_fusion_layers": null,
  "vision_start_token_id": 151652,
  "vision_token_id": 151654,
  "vocab_size": 151936
}

Loading datasets: [{'annotation_path': 'data/train/spar_234k.json', 'data_path': 'data/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'spar_234k'}, {'annotation_path': 'data/train/llava_hound_64k.json', 'data_path': 'data/media', 'tag': '2d', 'sampling_rate': 0.6, 'dataset_name': 'llava_hound_64k'}, {'annotation_path': 'data/vlm3r/annotations/vsibench_train/merged_qa_scannet_train.json', 'data_path': 'data/vlm3r/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'vlm3r_scannet'}, {'annotation_path': 'data/vsi_590k/annotations/vsi_appearance_order_vsibench_scannet.json', 'data_path': 'data/vsi_590k/media', 'tag': '3d', 'sampling_rate': 0.5, 'dataset_name': 'vsi_appr_order'}]
sampling 140566 examples from dataset {'annotation_path': 'data/train/spar_234k.json', 'data_path': 'data/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'spar_234k'}
sampling 140566 examples from dataset {'annotation_path': 'data/train/spar_234k.json', 'data_path': 'data/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'spar_234k'}
sampling 38250 examples from dataset {'annotation_path': 'data/train/llava_hound_64k.json', 'data_path': 'data/media', 'tag': '2d', 'sampling_rate': 0.6, 'dataset_name': 'llava_hound_64k'}
sampling 38250 examples from dataset {'annotation_path': 'data/train/llava_hound_64k.json', 'data_path': 'data/media', 'tag': '2d', 'sampling_rate': 0.6, 'dataset_name': 'llava_hound_64k'}
sampling 31067 examples from dataset {'annotation_path': 'data/vlm3r/annotations/vsibench_train/merged_qa_scannet_train.json', 'data_path': 'data/vlm3r/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'vlm3r_scannet'}
sampling 1909 examples from dataset {'annotation_path': 'data/vsi_590k/annotations/vsi_appearance_order_vsibench_scannet.json', 'data_path': 'data/vsi_590k/media', 'tag': '3d', 'sampling_rate': 0.5, 'dataset_name': 'vsi_appr_order'}
Total training samples: 211792
sampling 31067 examples from dataset {'annotation_path': 'data/vlm3r/annotations/vsibench_train/merged_qa_scannet_train.json', 'data_path': 'data/vlm3r/media', 'tag': '3d', 'sampling_rate': 0.6, 'dataset_name': 'vlm3r_scannet'}
sampling 1909 examples from dataset {'annotation_path': 'data/vsi_590k/annotations/vsi_appearance_order_vsibench_scannet.json', 'data_path': 'data/vsi_590k/media', 'tag': '3d', 'sampling_rate': 0.5, 'dataset_name': 'vsi_appr_order'}
Total training samples: 211792
Formatting inputs...Skip in lazy mode
Formatting inputs...Skip in lazy mode

  0%|          | 0/100 [00:00<?, ?it/s][Try #0] Failed to fetch sample 137297. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eb4bc76767/video_color/frame0_0.jpg'
[Try #0] Failed to fetch sample 123783. Exception: image file is truncated (5 bytes not processed)
[Try #1] Failed to fetch sample 137297. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eb4bc76767/video_color/frame0_0.jpg'
[Try #1] Failed to fetch sample 123783. Exception: image file is truncated (5 bytes not processed)
[Try #2] Failed to fetch sample 137297. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eb4bc76767/video_color/frame0_0.jpg'
[Try #2] Failed to fetch sample 123783. Exception: image file is truncated (5 bytes not processed)
[Try #0] Failed to fetch sample 2443. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/c0b3c65080/video_color/frame0_0.jpg'
[Try #0] Failed to fetch sample 185500. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/88f265fe25/image_color/4700.jpg'
[Try #1] Failed to fetch sample 2443. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/c0b3c65080/video_color/frame0_0.jpg'
[Try #1] Failed to fetch sample 185500. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/88f265fe25/image_color/4700.jpg'
[Try #2] Failed to fetch sample 2443. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/c0b3c65080/video_color/frame0_0.jpg'
[Try #2] Failed to fetch sample 185500. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/88f265fe25/image_color/4700.jpg'
/usr/local/lib/python3.12/dist-packages/torch/utils/checkpoint.py:85: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/usr/local/lib/python3.12/dist-packages/torch/utils/checkpoint.py:85: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/workspace/src/qwen_vl/model/geometry_encoders/vggt_encoder.py:68: FutureWarning: `torch.cuda.amp.autocast(args...)` is deprecated. Please use `torch.amp.autocast('cuda', args...)` instead.
  with torch.cuda.amp.autocast(dtype=dtype):
/usr/local/lib/python3.12/dist-packages/torch/utils/checkpoint.py:85: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/usr/local/lib/python3.12/dist-packages/torch/utils/checkpoint.py:85: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/workspace/src/qwen_vl/model/geometry_encoders/vggt_encoder.py:68: FutureWarning: `torch.cuda.amp.autocast(args...)` is deprecated. Please use `torch.amp.autocast('cuda', args...)` instead.
  with torch.cuda.amp.autocast(dtype=dtype):

  1%|          | 1/100 [00:41<1:08:39, 41.61s/it][Try #0] Failed to fetch sample 136435. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/a08dda47a8/image_color/1470.jpg'
[Try #1] Failed to fetch sample 136435. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/a08dda47a8/image_color/1470.jpg'
[Try #0] Failed to fetch sample 43044. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/16c9bd2e1e/image_color/1740.jpg'
[Try #2] Failed to fetch sample 136435. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/a08dda47a8/image_color/1470.jpg'
[Try #1] Failed to fetch sample 43044. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/16c9bd2e1e/image_color/1740.jpg'
[Try #2] Failed to fetch sample 43044. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/16c9bd2e1e/image_color/1740.jpg'

  2%|▏         | 2/100 [01:23<1:08:20, 41.84s/it]/usr/local/lib/python3.12/dist-packages/torch/utils/checkpoint.py:85: UserWarning: None of the inputs have requires_grad=True. Gradients will be None
  warnings.warn(
/workspace/src/qwen_vl/model/geometry_encoders/vggt_encoder.py:68: FutureWarning: `torch.cuda.amp.autocast(args...)` is deprecated. Please use `torch.amp.autocast('cuda', args...)` instead.
  with torch.cuda.amp.autocast(dtype=dtype):

  3%|β–Ž         | 3/100 [01:58<1:02:52, 38.89s/it][Try #0] Failed to fetch sample 6901. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eab5494dca/video_color/frame0_0.jpg'
[Try #1] Failed to fetch sample 6901. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eab5494dca/video_color/frame0_0.jpg'
[Try #2] Failed to fetch sample 6901. Exception: [Errno 2] No such file or directory: 'data/media/spar/scannetpp/images/eab5494dca/video_color/frame0_0.jpg'
W0127 17:13:38.319000 185005 torch/distributed/elastic/agent/server/api.py:732] Received 15 death signal, shutting down workers
W0127 17:13:38.321000 185005 torch/distributed/elastic/multiprocessing/api.py:906] Sending process 185073 closing signal SIGTERM
W0127 17:13:38.322000 185005 torch/distributed/elastic/multiprocessing/api.py:906] Sending process 185074 closing signal SIGTERM
Traceback (most recent call last):
  File "/usr/local/bin/torchrun", line 7, in <module>
    sys.exit(main())
             ^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py", line 357, in wrapper
    return f(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/run.py", line 936, in main
    run(args)
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/run.py", line 927, in run
    elastic_launch(
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/launcher/api.py", line 151, in __call__
    return launch_agent(self._config, self._entrypoint, list(args))
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/launcher/api.py", line 279, in launch_agent
    result = agent.run()
             ^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/elastic/metrics/api.py", line 138, in wrapper
    result = f(*args, **kwargs)
             ^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/elastic/agent/server/api.py", line 724, in run
    result = self._invoke_run(role)
             ^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/elastic/agent/server/api.py", line 888, in _invoke_run
    time.sleep(monitor_interval)
  File "/usr/local/lib/python3.12/dist-packages/torch/distributed/elastic/multiprocessing/api.py", line 85, in _terminate_process_handler
    raise SignalException(f"Process {os.getpid()} got signal: {sigval}", sigval=sigval)
torch.distributed.elastic.multiprocessing.api.SignalException: Process 185005 got signal: 15