Spaces:
Running
Running
| Initializing dependencies... | |
| Loading weights: 0%| | 0/398 [00:00<?, ?it/s] | |
| Loading weights: 0%| | 1/398 [00:00<00:00, 15477.14it/s, Materializing param=logit_scale] | |
| Loading weights: 0%| | 1/398 [00:00<00:00, 5675.65it/s, Materializing param=logit_scale] | |
| Loading weights: 1%| | 2/398 [00:00<00:00, 5870.26it/s, Materializing param=text_model.embeddings.position_embedding.weight] | |
| Loading weights: 1%| | 2/398 [00:00<00:00, 5056.42it/s, Materializing param=text_model.embeddings.position_embedding.weight] | |
| Loading weights: 1%| | 3/398 [00:00<00:00, 5545.58it/s, Materializing param=text_model.embeddings.token_embedding.weight] | |
| Loading weights: 1%| | 3/398 [00:00<00:00, 5098.42it/s, Materializing param=text_model.embeddings.token_embedding.weight] | |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 5765.37it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 5392.87it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 5930.86it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 5675.65it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 6159.04it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 5878.49it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 6316.72it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 6111.60it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 6511.63it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 6325.06it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 6530.92it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 6155.02it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 6323.39it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 6086.64it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 6269.51it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 6058.75it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 6230.71it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 6042.94it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 6155.56it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 5950.02it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 6046.78it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 5872.61it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 5985.02it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 5829.74it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 5945.15it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 5802.25it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 5921.21it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 5814.50it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 5973.85it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 5890.42it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 6061.13it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 5979.72it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 6144.15it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 6065.08it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 6220.81it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 6147.86it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 6288.74it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 6215.46it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 6372.64it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 6302.28it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 6446.99it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 6377.96it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 6508.45it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 6439.70it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 6569.00it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 6499.31it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 6620.26it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 6548.29it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 6663.67it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 6598.89it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 6711.63it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 6642.72it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 6749.40it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 6686.99it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 6794.35it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 6732.78it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 6838.42it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 6776.62it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6878.98it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6820.01it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6918.94it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6860.69it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 6954.74it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 6897.23it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 6990.18it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 6934.33it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 7022.46it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 6967.60it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 7055.49it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 7002.48it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 7079.15it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 7026.54it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 7113.81it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 7061.42it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 7132.28it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 7080.89it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 7162.17it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 7115.88it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 7202.39it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 7159.22it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 7237.51it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 7191.82it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 7271.12it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 7226.02it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 7306.04it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 7261.77it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 7340.89it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 7298.49it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 7375.95it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 7334.03it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 7409.36it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 7368.45it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 7444.63it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 7403.11it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 7477.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 7438.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 7510.72it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 7471.10it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 7540.90it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 7501.20it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 7570.95it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 7532.92it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 7604.64it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 7566.97it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 7636.92it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 5629.67it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 5626.49it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 5595.15it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 5642.87it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 5618.11it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 5655.28it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 5630.83it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 5683.72it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 5659.82it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 5708.96it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 5684.35it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 5644.60it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 5608.81it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 5612.48it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 5566.37it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 5586.82it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 5559.05it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 5595.27it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 5568.65it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 5606.68it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 5583.94it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 5624.19it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 5601.88it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 5643.98it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 5621.73it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 5664.54it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 5644.21it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 5688.62it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 5669.51it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 5715.08it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 5694.64it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 5739.51it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 5720.48it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 5762.49it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 5742.82it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 5769.65it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 5739.67it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 5764.47it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 5737.55it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 5767.97it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 5747.07it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 5781.83it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 5756.07it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 5786.47it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 5767.40it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 5799.93it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 5776.57it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 5807.17it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 5787.94it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 5823.92it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 5803.63it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 5833.52it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 5808.89it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 5832.94it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 5808.03it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 5832.08it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 5809.67it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 5836.39it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 5813.36it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 5835.60it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 5810.88it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 5829.98it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 5802.45it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 5821.56it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 5796.15it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 5816.62it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 5795.67it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 5821.38it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 5801.70it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 5828.27it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 5809.29it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 5828.50it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 5807.89it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 5830.82it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 5809.54it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 5830.25it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 5809.88it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 5832.50it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 5812.59it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 5834.12it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 5813.73it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 5835.78it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 5815.76it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 5838.16it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 5812.08it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 5834.02it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 5814.49it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 5840.18it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 5825.42it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 5849.88it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 5831.76it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 5854.36it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 5835.35it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 5857.25it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 5839.44it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 5855.29it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 5836.33it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 5856.25it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 5838.09it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 5857.43it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 5841.11it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 5868.69it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 5855.44it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 5879.87it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 5863.58it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 5884.51it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 5867.06it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 5887.04it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 5867.50it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 5882.69it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 5865.72it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 5890.06it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 5876.07it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5902.47it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5889.19it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5917.12it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5904.41it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 5932.17it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 5919.29it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 5946.46it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 5933.26it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 5960.64it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 5947.64it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 5972.12it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 5959.40it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 5986.26it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 5973.36it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 6001.29it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 5989.15it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 6015.73it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 6003.56it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 6029.42it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 6016.73it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 6042.53it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 6030.38it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 6056.47it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 6043.24it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 6068.29it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 6055.74it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 6080.73it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 6068.23it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 6092.89it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 6079.60it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 6103.79it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 6091.46it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 6116.15it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 6103.39it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 6128.03it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 6115.24it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 6138.39it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 6125.32it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 6149.53it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 6137.19it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 6160.47it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 6148.65it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 6170.67it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 6158.77it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 6180.95it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 6157.97it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 6182.60it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 6171.10it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 6194.89it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 6184.03it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 6207.59it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 6195.96it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 6219.62it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 6208.82it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 6233.11it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 6221.68it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 6245.08it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 6232.91it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 6255.68it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 6244.21it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 6267.22it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 6242.76it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 6239.59it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 6224.48it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 6243.06it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 6230.91it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 6251.91it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 6240.19it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 6261.55it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 6249.75it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 6271.16it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 6259.52it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 6280.41it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 6268.88it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 6289.82it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 6278.15it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 6299.26it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 6277.16it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 6274.94it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 6257.64it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 6265.41it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 6248.09it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 6259.73it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 6244.18it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 6256.96it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 6242.54it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 6255.79it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 6241.05it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 6252.96it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 6237.21it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 6249.47it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 6235.12it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 6246.20it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 6230.68it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 6243.15it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 6229.59it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 6239.85it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 6226.16it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 6239.92it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 6226.31it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 6236.57it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 6221.76it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 6233.55it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 6219.57it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 6232.25it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 6217.97it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 6230.63it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 6216.17it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 6228.59it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 6214.67it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 6227.24it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 6213.56it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 6225.41it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 6211.82it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 6229.42it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 6219.47it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 6237.67it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 6227.38it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 6217.96it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 6194.58it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 6199.13it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 6187.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 6193.32it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 6182.15it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 6195.32it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 6184.83it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 6201.15it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 6190.39it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 6185.10it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 6171.73it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 6185.78it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 6175.19it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 6191.14it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 6181.70it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 6198.63it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 6189.33it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 6204.59it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 6192.90it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 6203.68it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 6190.70it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 6203.09it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 6190.88it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 6203.69it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 6191.90it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 6202.46it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 6191.13it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 6200.79it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 6188.69it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 6199.34it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 6187.02it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 6199.13it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 6187.79it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 6199.79it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 6186.14it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 6193.74it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 6181.97it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 6193.46it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 6181.18it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 6190.47it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 6178.93it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 6190.64it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 6177.56it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 6187.75it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 6176.15it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 6186.76it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 6175.64it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 6186.43it/s, Materializing param=text_model.final_layer_norm.bias] | |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 6175.93it/s, Materializing param=text_model.final_layer_norm.bias] | |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 6188.00it/s, Materializing param=text_model.final_layer_norm.weight] | |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 6177.41it/s, Materializing param=text_model.final_layer_norm.weight] | |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 6189.52it/s, Materializing param=text_projection.weight] | |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 6178.88it/s, Materializing param=text_projection.weight] | |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 6192.58it/s, Materializing param=vision_model.embeddings.class_embedding] | |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 6183.36it/s, Materializing param=vision_model.embeddings.class_embedding] | |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 6197.07it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] | |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 6187.98it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] | |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 6201.44it/s, Materializing param=vision_model.embeddings.position_embedding.weight] | |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 6186.11it/s, Materializing param=vision_model.embeddings.position_embedding.weight] | |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 6195.47it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 6184.89it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] | |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 6195.29it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 6184.40it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] | |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 6193.81it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 6183.12it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] | |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 6191.81it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 6180.55it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] | |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 6190.50it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 6179.39it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] | |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 6188.27it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 6177.79it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] | |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 6189.36it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 6179.41it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] | |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 6191.01it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 6180.31it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] | |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 6190.42it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 6179.65it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] | |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 6188.28it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 6173.73it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] | |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 6164.93it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 6151.45it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] | |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 6157.04it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 6144.80it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] | |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 6151.27it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 6140.75it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] | |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 6152.78it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 6144.56it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] | |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 6158.58it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 6150.97it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] | |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 6165.46it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 6157.87it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] | |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 6172.26it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 6164.77it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] | |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 6178.96it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 6171.45it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] | |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 6186.16it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 6178.75it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] | |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 6192.94it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 6184.31it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] | |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 6198.73it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 6191.31it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] | |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 6204.63it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 6196.86it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] | |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 6210.83it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 6203.40it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] | |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 6216.49it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 6209.08it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] | |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 6223.00it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 6215.49it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] | |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 6228.34it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 6220.97it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] | |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 6234.93it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 6227.62it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] | |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 6240.59it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 6233.42it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] | |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 6247.02it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 6239.55it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] | |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 6252.72it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 6245.10it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] | |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 6258.58it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 6251.23it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] | |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 6264.49it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 6257.31it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] | |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 6270.95it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 6263.83it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] | |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 6277.02it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 6269.79it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] | |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 6283.64it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 6276.31it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] | |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 6289.51it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 6281.32it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] | |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 6294.46it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 6285.74it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] | |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 6298.70it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 6291.43it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] | |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 6304.62it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 6292.83it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] | |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 6297.91it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 6289.64it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] | |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 6300.57it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 6293.19it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] | |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 6304.81it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 6297.13it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] | |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 6307.81it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 6300.43it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] | |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 6312.49it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 6305.33it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] | |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 6317.23it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 6309.92it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] | |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 6322.43it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 6315.57it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] | |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 6327.91it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 6320.83it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] | |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 6332.69it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 6325.48it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] | |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 6338.03it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 6331.02it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] | |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 6343.37it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 6336.30it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] | |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 6348.10it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 6340.56it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] | |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 6349.42it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 6341.52it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] | |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 6353.26it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 6346.41it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] | |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 6358.03it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 6351.27it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] | |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 6363.52it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 6356.51it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] | |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 6368.18it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 6361.27it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] | |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 6373.80it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 6366.97it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] | |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 6379.15it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 6371.59it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] | |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 6384.02it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 6377.45it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] | |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 6383.35it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 6372.13it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] | |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 6377.65it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 6370.15it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] | |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 6377.75it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 6368.77it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] | |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 6376.19it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 6366.81it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] | |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 6373.34it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 6363.41it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] | |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 6369.15it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 6358.95it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] | |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 6365.23it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 6355.91it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] | |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 6364.00it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 6355.29it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] | |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 6360.81it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 6351.57it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] | |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 6358.04it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 6349.13it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] | |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 6354.62it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 6347.48it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] | |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 6358.62it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 6352.56it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] | |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 6363.94it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 6357.65it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] | |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 6369.09it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 6362.85it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] | |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 6374.25it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 6368.13it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] | |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 6379.70it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 6373.41it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] | |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 6384.90it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 6378.70it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] | |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 6390.15it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 6383.78it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] | |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 6395.54it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 6389.25it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] | |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 6400.62it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 6394.45it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] | |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 6406.06it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 6399.94it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] | |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 6411.33it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 6405.25it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] | |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 6415.60it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 6397.55it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] | |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 6382.01it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 6364.75it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] | |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 6368.55it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 6360.75it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] | |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 6368.33it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 6361.38it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] | |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 6370.14it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 6363.47it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] | |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 6370.15it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 6363.44it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] | |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 6373.45it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 6367.16it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] | |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 6377.90it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 6371.58it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] | |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 6381.49it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 6372.62it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] | |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 6382.19it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 6373.13it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] | |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 6377.40it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 6368.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] | |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 6375.05it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 6367.21it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] | |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 6375.08it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 6367.73it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] | |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 6375.50it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 6367.82it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] | |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 6375.76it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 6368.10it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] | |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 6375.66it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 6368.12it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] | |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 6374.74it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 6367.40it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] | |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 6375.26it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 6367.71it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] | |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 6375.87it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 6369.08it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] | |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 6378.11it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 6371.98it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] | |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 6380.56it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 6373.33it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] | |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 6380.38it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 6373.43it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] | |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 6378.93it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 6371.59it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] | |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 6379.11it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 6372.17it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] | |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 6379.75it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 6372.93it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] | |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 6379.71it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 5074.20it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] | |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 5046.00it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 5036.86it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] | |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 5040.98it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 5036.58it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] | |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 5041.76it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 5036.36it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] | |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 5040.30it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 5035.20it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] | |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 5039.47it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 5034.56it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] | |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 5041.60it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 5036.83it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] | |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 5042.37it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 5037.16it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] | |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 5043.24it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 5038.26it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] | |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 5043.97it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 5038.21it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] | |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 5043.67it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 5038.43it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] | |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 5044.08it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 5037.87it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] | |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 5043.20it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 5037.92it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] | |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 5043.59it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 5038.33it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] | |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 5044.34it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 5039.09it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] | |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 5044.33it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 5039.37it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] | |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 5045.93it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 5040.88it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] | |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 5046.36it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 5041.45it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] | |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 5048.02it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 5043.35it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] | |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 5050.01it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 5045.31it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] | |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 5052.38it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 5047.80it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] | |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 5055.34it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 5051.58it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] | |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 5058.90it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 5054.39it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] | |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 5059.81it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 5054.70it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] | |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 5060.39it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 5055.51it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] | |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 5061.86it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 5057.09it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] | |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 5063.46it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 5058.76it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] | |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 5064.93it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 5060.20it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] | |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 5066.55it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 5061.89it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] | |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 5068.33it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 5063.46it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] | |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 5069.42it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 5066.25it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] | |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 5074.50it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 5071.75it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] | |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 5080.49it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 5077.71it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] | |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 5085.88it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 5083.01it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] | |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 5090.86it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 5088.06it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] | |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 5096.43it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 5093.06it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] | |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 5100.94it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 5095.73it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] | |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 5102.60it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 5098.64it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] | |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 5106.56it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 5103.71it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] | |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 5112.22it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 5109.51it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] | |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 5117.66it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 5114.83it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] | |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 5123.02it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 5118.81it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] | |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 5122.34it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 5117.34it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] | |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 5123.32it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 5118.65it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] | |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 5123.77it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 5118.39it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] | |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 5122.92it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 5118.24it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] | |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 5123.95it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 5119.18it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] | |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 5125.36it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 5121.62it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] | |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 5128.53it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 5124.11it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] | |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 5130.15it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 5126.26it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] | |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 5134.24it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 5130.77it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] | |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 5138.27it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 5135.05it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] | |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 5143.44it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 5140.29it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] | |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 5148.47it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 5145.25it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] | |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 5153.62it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 5150.45it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] | |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 5158.77it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 5155.59it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] | |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 5164.05it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 5160.98it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] | |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 5169.34it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 5166.13it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] | |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 5174.40it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 5171.21it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] | |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 5179.25it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 5175.98it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] | |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 5184.28it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 5181.13it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] | |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 5189.39it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 5186.15it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] | |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 5194.48it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 5191.25it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] | |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 5199.54it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 5195.42it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] | |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 5203.66it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 5200.42it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] | |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 5208.67it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 5205.43it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] | |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 5213.66it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 5210.63it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] | |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 5218.34it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 5215.36it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] | |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 5223.62it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 5220.60it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] | |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 5228.75it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 5225.80it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] | |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 5233.98it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 5230.88it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] | |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 5238.94it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 5235.92it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] | |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 5244.19it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 5241.12it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] | |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 5249.39it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 5246.32it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] | |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 5254.68it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 5251.54it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] | |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 5259.50it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 5256.28it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] | |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 5264.35it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 5261.33it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] | |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 5269.54it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 5266.53it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] | |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 5274.67it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 5271.61it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] | |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 5279.61it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 5276.39it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] | |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 5284.28it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 5281.20it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] | |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 5289.26it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 5286.23it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] | |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 5294.26it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 5291.25it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] | |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 5299.30it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 5296.16it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] | |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 5294.59it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 5287.65it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] | |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 5288.68it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 5283.38it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] | |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 5288.74it/s, Materializing param=vision_model.post_layernorm.bias] | |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 5285.46it/s, Materializing param=vision_model.post_layernorm.bias] | |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 5293.18it/s, Materializing param=vision_model.post_layernorm.weight] | |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 5290.22it/s, Materializing param=vision_model.post_layernorm.weight] | |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 5298.51it/s, Materializing param=vision_model.pre_layrnorm.bias] | |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 5295.60it/s, Materializing param=vision_model.pre_layrnorm.bias] | |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 5303.80it/s, Materializing param=vision_model.pre_layrnorm.weight] | |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 5300.98it/s, Materializing param=vision_model.pre_layrnorm.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5309.31it/s, Materializing param=visual_projection.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5306.55it/s, Materializing param=visual_projection.weight] | |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5299.89it/s, Materializing param=visual_projection.weight] | |
| CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32 | |
| Key | Status | | | |
| -------------------------------------+------------+--+- | |
| vision_model.embeddings.position_ids | UNEXPECTED | | | |
| text_model.embeddings.position_ids | UNEXPECTED | | | |
| Notes: | |
| - UNEXPECTED :can be ignored when loading from different task/architecture; not ok if you expect identical arch. | |
| Warning: You are sending unauthenticated requests to the HF Hub. Please set a HF_TOKEN to enable higher rate limits and faster downloads. | |
| The image processor of type `CLIPImageProcessor` is now loaded as a fast processor by default, even if the model checkpoint was saved with a slow processor. This is a breaking change and may produce slightly different outputs. To continue using the slow processor, instantiate this class with `use_fast=False`. | |
| Running search_text... | |
| Error caught in test script! | |
| Traceback (most recent call last): | |
| File "E:\GitHub\BOOTH-Lens\backend\app\routers\search.py", line 100, in search_text | |
| results = vector_db.search_similar( | |
| vector, | |
| ...<3 lines>... | |
| colors=query_data.colors | |
| ) | |
| File "E:\GitHub\BOOTH-Lens\backend\app\services\vector_db.py", line 177, in search_similar | |
| raw_results = self.client.query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=self.collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<3 lines>... | |
| with_payload=True | |
| ^^^^^^^^^^^^^^^^^ | |
| ).points | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\qdrant_client.py", line 423, in query_points | |
| return self._client.query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<14 lines>... | |
| **kwargs, | |
| ^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\qdrant_remote.py", line 538, in query_points | |
| query_result = self.http.search_api.query_points( | |
| collection_name=collection_name, | |
| ...<2 lines>... | |
| query_request=query_request, | |
| ) | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api\search_api.py", line 783, in query_points | |
| return self._build_for_query_points( | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~^ | |
| collection_name=collection_name, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<2 lines>... | |
| query_request=query_request, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api\search_api.py", line 181, in _build_for_query_points | |
| return self.api_client.request( | |
| ~~~~~~~~~~~~~~~~~~~~~~~^ | |
| type_=m.InlineResponse20021, | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| ...<5 lines>... | |
| content=body, | |
| ^^^^^^^^^^^^^ | |
| ) | |
| ^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api_client.py", line 95, in request | |
| return self.send(request, type_) | |
| ~~~~~~~~~^^^^^^^^^^^^^^^^ | |
| File "C:\Users\tyari\AppData\Roaming\Python\Python314\site-packages\qdrant_client\http\api_client.py", line 130, in send | |
| raise UnexpectedResponse.for_response(response) | |
| qdrant_client.http.exceptions.UnexpectedResponse: Unexpected Response: 400 (Bad Request) | |
| Raw response content: | |
| b'{"status":{"error":"Bad request: Index required but not found for \\"shopName\\" of one of the following types: [keyword]. Help: Create an index for this key or use a different filter."},"time":0.000 ...' | |
| During handling of the above exception, another exception occurred: | |
| Traceback (most recent call last): | |
| File "E:\GitHub\BOOTH-Lens\backend\test_search.py", line 17, in main | |
| res = await search_text(q, i, v, user) | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ | |
| File "E:\GitHub\BOOTH-Lens\backend\app\routers\search.py", line 118, in search_text | |
| raise HTTPException(status_code=500, detail=str(e)) | |
| fastapi.exceptions.HTTPException: 500: Unexpected Response: 400 (Bad Request) | |
| Raw response content: | |
| b'{"status":{"error":"Bad request: Index required but not found for \\"shopName\\" of one of the following types: [keyword]. Help: Create an index for this key or use a different filter."},"time":0.000 ...' | |