Spaces:
Running
Running
| Initializing dependencies... | |
| Loading weights: 0%| | 0/398 [00:00<?, ?it/s] | |
| Loading weights: 0%| | 1/398 [00:00<00:00, 14926.35it/s, Materializing param=logit_scale] | |
| Loading weights: 0%| | 1/398 [00:00<00:00, 4917.12it/s, Materializing param=logit_scale] | |
| Loading weights: 1%| | 2/398 [00:00<00:00, 4940.29it/s, Materializing param=text_model.embeddings.position_embedding.weight] | |
| Loading weights: 1%| | 2/398 [00:00<00:00, 4286.46it/s, Materializing param=text_model.embeddings.position_embedding.weight] | |
| Loading weights: 1%| | 3/398 [00:00<00:00, 5073.75it/s, Materializing param=text_model.embeddings.token_embedding.weight] | |
| Loading weights: 1%| | 3/398 [00:00<00:00, 4689.87it/s, Materializing param=text_model.embeddings.token_embedding.weight] | |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 3939.24it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 3643.26it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 4073.72it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 3906.04it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 4310.69it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 4158.27it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 4243.41it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 3953.16it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 4132.32it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 4003.15it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 4245.72it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 4149.58it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 4385.51it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 4260.77it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 4458.15it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 4348.48it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 4521.35it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 4420.10it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 4586.25it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 4487.73it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 4662.56it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 4590.03it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 4779.65it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 4702.84it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 4889.53it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 4824.85it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 5000.92it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 4940.29it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 5110.85it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 5048.98it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 5213.38it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 5151.71it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 5309.92it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 5247.47it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 5388.83it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 5328.52it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 5470.40it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 5408.83it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 5541.97it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 5486.18it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 5618.94it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 5564.89it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 5690.43it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 5637.20it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 5761.71it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 5706.54it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 5823.33it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 5766.39it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 5875.84it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 5822.53it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 5922.14it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 5862.77it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 5962.05it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 5910.52it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 6007.09it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 5947.73it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 6048.84it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 5998.29it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6095.84it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6044.99it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6137.04it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6088.56it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 5925.83it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 5828.20it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 5861.38it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 5813.31it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 5886.18it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 5841.21it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 5917.56it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 5806.96it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 5875.85it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 5833.73it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 5804.46it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 5720.16it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 5736.80it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 5695.39it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 5760.46it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 5722.48it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 5746.90it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 5574.60it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 5395.71it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 5333.03it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 5348.82it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 5301.49it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 5296.57it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 5249.16it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 5284.34it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 5246.65it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 5303.51it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 5274.06it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 5333.49it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 5308.29it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 5368.64it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 5343.20it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 5401.89it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 5378.26it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 5426.95it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 5393.80it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 5435.43it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 5410.56it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 5458.31it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 5431.60it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 5487.31it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 5462.76it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 5518.17it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 5493.78it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 5546.86it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 5523.28it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 5575.10it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 5552.45it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 5604.69it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 5581.94it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 5636.62it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 5614.61it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 5664.84it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 5642.85it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 5684.58it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 5655.16it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 5701.49it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 5673.21it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 5714.19it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 5692.02it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 5734.26it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 5711.08it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 5747.53it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 5725.66it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 5764.95it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 5743.04it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 5782.90it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 5757.34it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 5796.37it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 5774.74it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 5812.85it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 5791.18it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 5831.13it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 4570.92it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 4543.80it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 4517.29it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 4537.54it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 4519.72it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 4547.27it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 4532.53it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 4564.12it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 4548.54it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 4580.91it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 4568.63it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 4603.67it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 4591.17it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 4625.87it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 4615.56it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 4643.75it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 4631.16it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 4667.66it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 4656.33it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 4693.24it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 4681.66it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 4718.20it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 4707.09it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 4743.39it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 4731.85it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 4767.35it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 4756.28it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 4792.65it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 4781.34it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 4817.69it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 4806.78it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 4842.73it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 4831.38it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 4866.74it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 4856.05it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 4891.03it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 4880.03it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 4914.05it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 4902.56it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 4936.13it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 4925.24it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 4956.47it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 4945.29it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 4977.61it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 4966.39it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 4999.04it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 4987.72it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 5020.33it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 5009.41it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 5042.18it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 5031.53it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 5063.82it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 5053.19it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 5085.26it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 5074.46it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 5106.58it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 5095.48it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 5127.20it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 5116.07it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 5146.95it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 5135.97it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 5166.83it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 5156.06it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 5187.29it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 5176.48it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 5207.45it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 5196.72it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 5226.64it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 5215.99it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 5245.29it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 5234.61it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 5263.48it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 5252.64it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 5282.19it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 5271.37it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 5298.78it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 5288.30it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 5316.95it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 5306.25it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 5334.61it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 5323.51it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 5351.46it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 5340.45it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5368.18it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5357.38it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5385.06it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5374.65it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 5402.73it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 5392.22it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 5419.72it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 5409.00it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 5436.29it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 5426.19it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 5453.93it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 5443.67it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 5471.09it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 5460.80it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 5488.07it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 5478.10it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 5504.63it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 5494.68it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 5521.50it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 5511.63it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 5538.02it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 5527.57it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 5477.50it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 5453.60it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 5466.06it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 5453.49it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 5474.00it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 5463.31it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 5474.07it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 5439.08it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 5431.94it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 5416.21it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 5432.55it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 5420.52it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 5439.49it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 5428.38it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 5449.63it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 5435.24it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 5451.87it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 5438.85it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 5457.91it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 5446.62it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 5462.45it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 5449.37it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 5470.36it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 5460.23it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 5481.91it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 5472.49it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 5493.60it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 5483.74it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 5505.75it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 5496.44it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 5518.61it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 5509.28it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 5531.51it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 5521.68it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 5543.24it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 5533.75it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 5555.16it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 5545.64it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 5566.92it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 5557.63it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 5579.13it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 5570.02it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 5591.38it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 5582.09it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 5601.87it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 5591.79it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 5612.77it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 5603.43it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 5624.53it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 5614.81it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 5635.27it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 5625.73it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 5644.38it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 5634.93it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 5655.02it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 5645.29it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 5664.25it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 5654.51it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 5673.19it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 5663.58it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 5683.14it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 5674.25it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 5693.89it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 5685.23it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 5706.04it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 5696.90it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 5715.55it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 5705.79it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 5724.83it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 5715.44it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 5733.82it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 5723.48it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 5742.18it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 5733.01it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 5751.94it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 5742.74it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 5759.45it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 5751.51it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 5769.53it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 5761.60it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 5779.80it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 5772.14it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 5791.64it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 5780.99it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 5796.52it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 5789.00it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 5803.17it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 5790.94it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 5803.78it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 5791.52it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 5803.48it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 5791.72it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 5804.18it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 5792.20it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 5806.23it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 5796.19it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 5810.88it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 5800.46it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 5817.62it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 5809.05it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 5827.47it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 5819.75it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 5838.72it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 5830.10it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 5847.16it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 5839.20it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 5856.76it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 5846.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 5861.00it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 5850.90it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 5866.34it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 5854.95it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 5868.62it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 5858.69it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 5873.78it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 5864.39it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 5881.48it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 5875.73it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 5894.72it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 5887.08it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 5907.97it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 5902.18it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 5922.30it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 5622.07it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 5626.81it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 5618.75it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 5634.87it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 5627.88it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 5644.45it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 5636.94it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 5651.33it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 5645.01it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 5659.61it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 5653.50it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 5667.86it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 5660.09it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 5676.09it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 5667.86it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 5684.26it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 5677.28it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 5693.36it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 5686.32it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 5702.56it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 5695.65it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 5712.01it/s, Materializing param=text_model.final_layer_norm.bias] | |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 5705.03it/s, Materializing param=text_model.final_layer_norm.bias] | |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 5721.99it/s, Materializing param=text_model.final_layer_norm.weight] | |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 5715.42it/s, Materializing param=text_model.final_layer_norm.weight] | |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 5732.81it/s, Materializing param=text_projection.weight] | |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 5726.25it/s, Materializing param=text_projection.weight] | |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 5743.88it/s, Materializing param=vision_model.embeddings.class_embedding] | |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 5737.09it/s, Materializing param=vision_model.embeddings.class_embedding] | |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 5753.07it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] | |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 5746.21it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] | |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 5762.12it/s, Materializing param=vision_model.embeddings.position_embedding.weight] | |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 5751.78it/s, Materializing param=vision_model.embeddings.position_embedding.weight] | |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 5767.92it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 5761.29it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 5776.89it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 5770.00it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 5785.87it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 5778.88it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 5794.96it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 5787.08it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 5802.93it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 5796.07it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 5811.42it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 5804.86it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 5820.41it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 5813.62it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 5828.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 5821.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 5836.54it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 5829.62it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 5844.93it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 5838.26it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 5853.57it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 5846.76it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 5861.89it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 5855.28it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 5870.34it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 5863.63it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 5878.78it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 5872.39it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 5887.51it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 5880.93it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 5896.07it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 5889.16it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 5903.85it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 5897.60it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 5912.07it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 5905.42it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 5920.55it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 5913.49it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 5928.29it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 5921.51it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 5936.40it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 5928.76it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 5942.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 5936.38it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 5951.36it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 5944.81it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 5959.32it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 5953.08it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 5967.87it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 5961.15it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 5975.43it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 5968.91it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 5983.20it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 5976.81it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 5990.93it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 5984.32it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 5998.01it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 5991.42it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 6005.05it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 5998.43it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 6012.08it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 6005.47it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 6019.17it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 6012.80it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 6026.59it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 6020.83it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 6036.19it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 6028.18it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 6015.28it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 5998.43it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 6001.96it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 5992.77it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 6001.88it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 5993.34it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 6000.43it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 5992.08it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 6000.61it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 5992.36it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 6000.72it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 5990.34it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 5997.63it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 5989.11it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 5997.19it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 5987.29it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 5994.14it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 5982.51it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 5989.84it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 5979.90it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 5989.22it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 5982.27it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 5991.10it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 5982.00it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 5990.03it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 5972.63it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 5980.13it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 5971.27it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 5981.40it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 5975.75it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 5983.86it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 5975.20it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 5983.62it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 5973.81it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 5979.27it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 5969.41it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 5974.83it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 5965.32it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 5970.42it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 5959.35it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 5962.78it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 5952.73it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 5957.75it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 5948.58it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 5955.10it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 5946.26it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 5953.51it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 5944.55it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 5949.95it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 5941.62it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 5948.59it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 5940.13it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 5946.98it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 5938.44it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 5945.78it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 5937.17it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 5944.38it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 5936.00it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 5943.48it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 5935.29it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 5941.26it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 5932.95it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 5940.31it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 5932.07it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 5940.03it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 5924.66it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 5918.81it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 5909.79it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 5919.11it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 5912.90it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 5923.48it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 5917.93it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 5929.45it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 5923.73it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 5935.12it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 5929.65it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 5941.34it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 5935.79it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 5946.70it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 5940.79it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 5952.00it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 5946.49it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 5958.03it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 5952.48it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 5964.07it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 5958.49it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 5969.92it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 5964.29it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 5975.67it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 5970.05it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 5981.31it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 5975.79it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 5986.10it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 5980.68it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 5991.77it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 5986.30it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 5997.75it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 5990.21it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 6001.13it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 5995.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 6007.26it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 6001.91it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 6013.11it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 6007.83it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 6018.97it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 6013.88it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 6025.07it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 6019.89it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 6030.87it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 6025.58it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 6036.46it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 6031.00it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 6041.70it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 6036.31it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 6047.15it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 6041.82it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 6052.92it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 6047.58it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 6058.43it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 6053.19it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 6063.83it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 6058.29it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 6068.60it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 6062.90it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 6072.74it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 6066.31it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 6075.93it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 6070.37it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 6080.58it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 6075.12it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 6085.24it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 6079.67it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 6089.81it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 6084.28it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 6093.98it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 6088.87it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 6099.17it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 6093.81it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 6104.14it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 6098.64it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 6107.74it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 6101.79it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 6111.21it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 6105.71it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 6115.33it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 6109.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 6119.57it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 6114.46it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 6124.28it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 6118.52it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 6127.96it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 6122.55it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 6132.34it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 6126.86it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 6136.41it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 6130.96it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 6139.97it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 6134.74it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 6144.37it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 6139.09it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 6149.21it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 6144.02it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 6154.05it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 6148.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 6158.41it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 6153.13it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 6162.50it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 6156.85it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 6166.11it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 6160.65it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 6170.10it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 6164.85it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 6174.89it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 6169.73it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 6178.90it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 6173.55it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 6183.02it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 6177.74it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 6187.61it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 6182.42it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 6192.06it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 6187.07it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 6196.74it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 6191.62it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 6201.43it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 6196.43it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 6206.10it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 6200.88it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 6209.74it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 6204.17it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 6213.48it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 6208.62it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 6218.20it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 6213.28it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 6223.06it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 6218.02it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 6227.73it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 6222.89it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 6232.19it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 6227.31it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 6236.83it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 6231.81it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 6241.41it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 6236.54it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 6246.14it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 6241.19it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 6250.58it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 6245.44it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 6254.83it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 6249.92it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 6259.36it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 6254.52it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 6264.01it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 6259.14it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 6268.47it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 6263.62it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 6273.11it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 6268.18it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 6277.07it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 6272.07it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 6281.53it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 6276.02it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 6285.38it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 6280.61it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 6290.18it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 6285.47it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 6294.99it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 6290.15it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 6299.86it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 6295.13it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 6273.73it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 6263.75it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 6259.33it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 6252.39it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 6259.28it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 6254.05it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 6262.16it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 6256.94it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 6263.03it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 6254.79it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 6257.35it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 6250.69it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 6255.47it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 6249.29it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 6253.35it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 6245.83it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 6251.42it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 6244.55it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 6249.33it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 6242.92it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 6247.29it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 6240.85it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 6246.48it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 6242.09it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 6248.34it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 6241.34it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 6245.43it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 6238.41it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 6241.80it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 6233.72it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 6236.96it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 6229.64it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 6233.39it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 6226.50it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 6231.12it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 6225.39it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 6230.89it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 6225.03it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 6229.40it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 6223.48it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 6229.26it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 6223.33it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 6229.46it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 6223.55it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 6229.87it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 6224.12it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 6231.16it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 6226.12it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 6231.88it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 6225.15it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 6231.41it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 6225.46it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 6230.46it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 6224.30it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 6231.25it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 6226.82it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 6233.91it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 6227.42it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 6219.70it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 6210.97it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 6215.74it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 6210.48it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 6217.93it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 6213.37it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 6220.98it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 6216.62it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 6224.71it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 6220.31it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 6226.70it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 6222.19it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 6230.19it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 6225.71it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 6233.73it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 6229.28it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 6237.36it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 6232.37it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 6240.55it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 6236.30it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 6244.36it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 6240.12it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 6247.66it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 6243.07it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 6251.02it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 6246.74it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 6254.67it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 6250.42it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 6258.59it/s, Materializing param=vision_model.post_layernorm.bias] | |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 6254.35it/s, Materializing param=vision_model.post_layernorm.bias] | |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 6263.11it/s, Materializing param=vision_model.post_layernorm.weight] | |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 6258.52it/s, Materializing param=vision_model.post_layernorm.weight] | |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 6267.38it/s, Materializing param=vision_model.pre_layrnorm.bias] | |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 6263.41it/s, Materializing param=vision_model.pre_layrnorm.bias] | |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 6272.32it/s, Materializing param=vision_model.pre_layrnorm.weight] | |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 6268.26it/s, Materializing param=vision_model.pre_layrnorm.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 6277.06it/s, Materializing param=visual_projection.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 6273.14it/s, Materializing param=visual_projection.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 6263.84it/s, Materializing param=visual_projection.weight] | |
| CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32 | |
| Key | Status | | | |
| -------------------------------------+------------+--+- | |
| vision_model.embeddings.position_ids | UNEXPECTED | | | |
| text_model.embeddings.position_ids | UNEXPECTED | | | |
| Notes: | |
| - UNEXPECTED :can be ignored when loading from different task/architecture; not ok if you expect identical arch. | |
| The image processor of type `CLIPImageProcessor` is now loaded as a fast processor by default, even if the model checkpoint was saved with a slow processor. This is a breaking change and may produce slightly different outputs. To continue using the slow processor, instantiate this class with `use_fast=False`. | |
| Running search_text... | |
| Error caught in test script! | |
| Traceback (most recent call last): | |
| File "E:\GitHub\BOOTH-Lens\backend\app\routers\search.py", line 100, in search_text | |
| results = vector_db.search_similar( | |
| vector, | |
| ...<3 lines>... | |
| colors=query_data.colors | |
| ) | |
| File "E:\GitHub\BOOTH-Lens\backend\app\services\vector_db.py", line 179, in search_similar | |
| raw_results = self.client.query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=self.collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<3 lines>... | |
| with_payload=True | |
| ^^^^^^^^^^^^^^^^^ | |
| ).points | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\qdrant_client.py", line 423, in query_points | |
| return self._client.query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<14 lines>... | |
| **kwargs, | |
| ^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\qdrant_remote.py", line 538, in query_points | |
| query_result = self.http.search_api.query_points( | |
| collection_name=collection_name, | |
| ...<2 lines>... | |
| query_request=query_request, | |
| ) | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api\search_api.py", line 783, in query_points | |
| return self._build_for_query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<2 lines>... | |
| query_request=query_request, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api\search_api.py", line 181, in _build_for_query_points | |
| return self.api_client.request( | |
| ~~~~~~~~~~~~~~~~~~~~~~~^ | |
| type_=m.InlineResponse20021, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<5 lines>... | |
| content=body, | |
| ^^^^^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api_client.py", line 95, in request | |
| return self.send(request, type_) | |
| ~~~~~~~~~^^^^^^^^^^^^^^^^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api_client.py", line 130, in send | |
| raise UnexpectedResponse.for_response(response) | |
| qdrant_client.http.exceptions.UnexpectedResponse: Unexpected Response: 400 (Bad Request) | |
| Raw response content: | |
| b'{"status":{"error":"Bad request: Index required but not found for \\"shopName\\" of one of the following types: [keyword]. Help: Create an index for this key or use a different filter."},"time":0.000 ...' | |
| During handling of the above exception, another exception occurred: | |
| Traceback (most recent call last): | |
| File "E:\GitHub\BOOTH-Lens\backend\test_search.py", line 17, in main | |
| res = await search_text(q, i, v, user) | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| File "E:\GitHub\BOOTH-Lens\backend\app\routers\search.py", line 118, in search_text | |
| raise HTTPException(status_code=500, detail=str(e)) | |
| fastapi.exceptions.HTTPException: 500: Unexpected Response: 400 (Bad Request) | |
| Raw response content: | |
| b'{"status":{"error":"Bad request: Index required but not found for \\"shopName\\" of one of the following types: [keyword]. Help: Create an index for this key or use a different filter."},"time":0.000 ...' | |