| INFO: Started server process [23584] |
| INFO: Waiting for application startup. |
| INFO: Application startup complete. |
| INFO: Uvicorn running on http://127.0.0.1:8001 (Press CTRL+C to quit) |
| Warning: You are sending unauthenticated requests to the HF Hub. Please set a HF_TOKEN to enable higher rate limits and faster downloads. |
| Loading YOLO model from: E:\GitHub\BOOTH-Lens\backend\runs\detect\runs\detect\train_v4_refinement\weights\best.pt |
| Database initialized (SQLite). |
|
|
| Loading weights: 0%| | 0/398 [00:00<?, ?it/s] |
| Loading weights: 0%| | 1/398 [00:00<00:00, 14979.66it/s, Materializing param=logit_scale] |
| Loading weights: 0%| | 1/398 [00:00<00:00, 5940.94it/s, Materializing param=logit_scale] |
| Loading weights: 1%| | 2/398 [00:00<00:00, 4874.26it/s, Materializing param=text_model.embeddings.position_embedding.weight] |
| Loading weights: 1%| | 2/398 [00:00<00:00, 4251.70it/s, Materializing param=text_model.embeddings.position_embedding.weight] |
| Loading weights: 1%| | 3/398 [00:00<00:00, 5043.25it/s, Materializing param=text_model.embeddings.token_embedding.weight] |
| Loading weights: 1%| | 3/398 [00:00<00:00, 4641.43it/s, Materializing param=text_model.embeddings.token_embedding.weight] |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 5443.61it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] |
| Loading weights: 1%|1 | 4/398 [00:00<00:00, 5133.79it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 5728.36it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] |
| Loading weights: 1%|1 | 5/398 [00:00<00:00, 5452.81it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 6000.43it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] |
| Loading weights: 2%|1 | 6/398 [00:00<00:00, 5735.15it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 6191.51it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] |
| Loading weights: 2%|1 | 7/398 [00:00<00:00, 5948.16it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 6346.59it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] |
| Loading weights: 2%|2 | 8/398 [00:00<00:00, 6124.19it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 6454.98it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] |
| Loading weights: 2%|2 | 9/398 [00:00<00:00, 6251.86it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 6561.80it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] |
| Loading weights: 3%|2 | 10/398 [00:00<00:00, 6372.39it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 6629.88it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] |
| Loading weights: 3%|2 | 11/398 [00:00<00:00, 6460.00it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 6717.16it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] |
| Loading weights: 3%|3 | 12/398 [00:00<00:00, 6545.08it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 6765.85it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] |
| Loading weights: 3%|3 | 13/398 [00:00<00:00, 6606.80it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 6816.05it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] |
| Loading weights: 4%|3 | 14/398 [00:00<00:00, 6663.67it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 6860.16it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] |
| Loading weights: 4%|3 | 15/398 [00:00<00:00, 6718.77it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 6905.63it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] |
| Loading weights: 4%|4 | 16/398 [00:00<00:00, 6769.10it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 6944.89it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] |
| Loading weights: 4%|4 | 17/398 [00:00<00:00, 6816.09it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 6979.52it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] |
| Loading weights: 5%|4 | 18/398 [00:00<00:00, 6857.17it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 7010.80it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] |
| Loading weights: 5%|4 | 19/398 [00:00<00:00, 6886.01it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 6970.76it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] |
| Loading weights: 5%|5 | 20/398 [00:00<00:00, 6829.45it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 6927.28it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] |
| Loading weights: 5%|5 | 21/398 [00:00<00:00, 6801.57it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 6876.93it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] |
| Loading weights: 6%|5 | 22/398 [00:00<00:00, 6760.55it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 6852.46it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] |
| Loading weights: 6%|5 | 23/398 [00:00<00:00, 6734.31it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 6807.55it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] |
| Loading weights: 6%|6 | 24/398 [00:00<00:00, 6682.37it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 6748.46it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] |
| Loading weights: 6%|6 | 25/398 [00:00<00:00, 6651.29it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 6725.37it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] |
| Loading weights: 7%|6 | 26/398 [00:00<00:00, 6638.17it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 6712.48it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] |
| Loading weights: 7%|6 | 27/398 [00:00<00:00, 6613.30it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 6673.51it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] |
| Loading weights: 7%|7 | 28/398 [00:00<00:00, 6576.72it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 6636.19it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] |
| Loading weights: 7%|7 | 29/398 [00:00<00:00, 6540.56it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 6598.62it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] |
| Loading weights: 8%|7 | 30/398 [00:00<00:00, 6508.18it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 6568.83it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] |
| Loading weights: 8%|7 | 31/398 [00:00<00:00, 6474.63it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 6523.66it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] |
| Loading weights: 8%|8 | 32/398 [00:00<00:00, 6440.70it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6490.60it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] |
| Loading weights: 8%|8 | 33/398 [00:00<00:00, 6410.34it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6466.23it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] |
| Loading weights: 9%|8 | 34/398 [00:00<00:00, 6393.18it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 6443.43it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] |
| Loading weights: 9%|8 | 35/398 [00:00<00:00, 6367.96it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 6413.58it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] |
| Loading weights: 9%|9 | 36/398 [00:00<00:00, 6350.20it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 6401.40it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] |
| Loading weights: 9%|9 | 37/398 [00:00<00:00, 6336.32it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 6385.05it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] |
| Loading weights: 10%|9 | 38/398 [00:00<00:00, 6318.73it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 6368.12it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] |
| Loading weights: 10%|9 | 39/398 [00:00<00:00, 6308.20it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 6365.37it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] |
| Loading weights: 10%|# | 40/398 [00:00<00:00, 6312.68it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 6372.67it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] |
| Loading weights: 10%|# | 41/398 [00:00<00:00, 6315.56it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 6367.87it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] |
| Loading weights: 11%|# | 42/398 [00:00<00:00, 6313.33it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 6360.83it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] |
| Loading weights: 11%|# | 43/398 [00:00<00:00, 6305.24it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 6353.69it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] |
| Loading weights: 11%|#1 | 44/398 [00:00<00:00, 6301.19it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 6350.94it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] |
| Loading weights: 11%|#1 | 45/398 [00:00<00:00, 6297.96it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 6344.56it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] |
| Loading weights: 12%|#1 | 46/398 [00:00<00:00, 6293.85it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 6339.27it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] |
| Loading weights: 12%|#1 | 47/398 [00:00<00:00, 6288.71it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 6334.81it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] |
| Loading weights: 12%|#2 | 48/398 [00:00<00:00, 6280.07it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 6324.69it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] |
| Loading weights: 12%|#2 | 49/398 [00:00<00:00, 6277.56it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 6321.67it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] |
| Loading weights: 13%|#2 | 50/398 [00:00<00:00, 6275.52it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 6317.10it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] |
| Loading weights: 13%|#2 | 51/398 [00:00<00:00, 6270.62it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 6313.07it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] |
| Loading weights: 13%|#3 | 52/398 [00:00<00:00, 6266.99it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 6306.87it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] |
| Loading weights: 13%|#3 | 53/398 [00:00<00:00, 6261.92it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 6303.01it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] |
| Loading weights: 14%|#3 | 54/398 [00:00<00:00, 6258.77it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 6295.69it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] |
| Loading weights: 14%|#3 | 55/398 [00:00<00:00, 6253.54it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 6292.52it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] |
| Loading weights: 14%|#4 | 56/398 [00:00<00:00, 6253.32it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 6293.61it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] |
| Loading weights: 14%|#4 | 57/398 [00:00<00:00, 6256.22it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 6283.28it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] |
| Loading weights: 15%|#4 | 58/398 [00:00<00:00, 6242.16it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 6273.80it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] |
| Loading weights: 15%|#4 | 59/398 [00:00<00:00, 6234.60it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 6269.67it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] |
| Loading weights: 15%|#5 | 60/398 [00:00<00:00, 6223.93it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 6252.96it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] |
| Loading weights: 15%|#5 | 61/398 [00:00<00:00, 6214.24it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 6236.58it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] |
| Loading weights: 16%|#5 | 62/398 [00:00<00:00, 6199.12it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 6229.16it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] |
| Loading weights: 16%|#5 | 63/398 [00:00<00:00, 6192.23it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 6225.31it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] |
| Loading weights: 16%|#6 | 64/398 [00:00<00:00, 6186.86it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 6222.58it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] |
| Loading weights: 16%|#6 | 65/398 [00:00<00:00, 6188.68it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 6218.95it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] |
| Loading weights: 17%|#6 | 66/398 [00:00<00:00, 6181.32it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 6207.20it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] |
| Loading weights: 17%|#6 | 67/398 [00:00<00:00, 6173.51it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 6202.84it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] |
| Loading weights: 17%|#7 | 68/398 [00:00<00:00, 6170.50it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 6201.27it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] |
| Loading weights: 17%|#7 | 69/398 [00:00<00:00, 6168.36it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 6197.78it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] |
| Loading weights: 18%|#7 | 70/398 [00:00<00:00, 6166.02it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 6195.43it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] |
| Loading weights: 18%|#7 | 71/398 [00:00<00:00, 6164.26it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 6191.36it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] |
| Loading weights: 18%|#8 | 72/398 [00:00<00:00, 6163.69it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 6193.42it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] |
| Loading weights: 18%|#8 | 73/398 [00:00<00:00, 6163.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 6189.25it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] |
| Loading weights: 19%|#8 | 74/398 [00:00<00:00, 6158.30it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 6149.77it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] |
| Loading weights: 19%|#8 | 75/398 [00:00<00:00, 6122.83it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 6146.80it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] |
| Loading weights: 19%|#9 | 76/398 [00:00<00:00, 6119.78it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 6145.21it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] |
| Loading weights: 19%|#9 | 77/398 [00:00<00:00, 6116.81it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 6141.35it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] |
| Loading weights: 20%|#9 | 78/398 [00:00<00:00, 6109.69it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 6134.75it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] |
| Loading weights: 20%|#9 | 79/398 [00:00<00:00, 6110.88it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 6144.60it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] |
| Loading weights: 20%|## | 80/398 [00:00<00:00, 6121.17it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 6152.12it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] |
| Loading weights: 20%|## | 81/398 [00:00<00:00, 6123.18it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 6145.83it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] |
| Loading weights: 21%|## | 82/398 [00:00<00:00, 6119.91it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 6146.10it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] |
| Loading weights: 21%|## | 83/398 [00:00<00:00, 6116.94it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 6138.11it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] |
| Loading weights: 21%|##1 | 84/398 [00:00<00:00, 6113.30it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 6136.46it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] |
| Loading weights: 21%|##1 | 85/398 [00:00<00:00, 6108.80it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 6128.69it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] |
| Loading weights: 22%|##1 | 86/398 [00:00<00:00, 6102.46it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 6124.30it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] |
| Loading weights: 22%|##1 | 87/398 [00:00<00:00, 6098.82it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 6126.02it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] |
| Loading weights: 22%|##2 | 88/398 [00:00<00:00, 6106.36it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 6136.86it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] |
| Loading weights: 22%|##2 | 89/398 [00:00<00:00, 6114.65it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 6137.61it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] |
| Loading weights: 23%|##2 | 90/398 [00:00<00:00, 6115.33it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 6126.81it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] |
| Loading weights: 23%|##2 | 91/398 [00:00<00:00, 6101.63it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 6124.14it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] |
| Loading weights: 23%|##3 | 92/398 [00:00<00:00, 6100.13it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 6120.00it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] |
| Loading weights: 23%|##3 | 93/398 [00:00<00:00, 6094.66it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 6113.29it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] |
| Loading weights: 24%|##3 | 94/398 [00:00<00:00, 6070.75it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 6065.01it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] |
| Loading weights: 24%|##3 | 95/398 [00:00<00:00, 6038.45it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 6061.22it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] |
| Loading weights: 24%|##4 | 96/398 [00:00<00:00, 6035.24it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 6058.79it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] |
| Loading weights: 24%|##4 | 97/398 [00:00<00:00, 6041.06it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 6071.88it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] |
| Loading weights: 25%|##4 | 98/398 [00:00<00:00, 6055.15it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 6086.63it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] |
| Loading weights: 25%|##4 | 99/398 [00:00<00:00, 6048.24it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 6042.62it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] |
| Loading weights: 25%|##5 | 100/398 [00:00<00:00, 6012.39it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 6026.21it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] |
| Loading weights: 25%|##5 | 101/398 [00:00<00:00, 6004.43it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 6022.65it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] |
| Loading weights: 26%|##5 | 102/398 [00:00<00:00, 6002.20it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 6021.93it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] |
| Loading weights: 26%|##5 | 103/398 [00:00<00:00, 5999.52it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 6019.81it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] |
| Loading weights: 26%|##6 | 104/398 [00:00<00:00, 6000.02it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 6020.20it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] |
| Loading weights: 26%|##6 | 105/398 [00:00<00:00, 6000.93it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 6022.05it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] |
| Loading weights: 27%|##6 | 106/398 [00:00<00:00, 6003.84it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 6022.34it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] |
| Loading weights: 27%|##6 | 107/398 [00:00<00:00, 6003.65it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 6025.42it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] |
| Loading weights: 27%|##7 | 108/398 [00:00<00:00, 6008.87it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 6029.16it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] |
| Loading weights: 27%|##7 | 109/398 [00:00<00:00, 6010.14it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 6028.27it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] |
| Loading weights: 28%|##7 | 110/398 [00:00<00:00, 6009.97it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 6028.48it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] |
| Loading weights: 28%|##7 | 111/398 [00:00<00:00, 6008.95it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 6024.14it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] |
| Loading weights: 28%|##8 | 112/398 [00:00<00:00, 6003.96it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5990.73it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] |
| Loading weights: 28%|##8 | 113/398 [00:00<00:00, 5970.88it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5995.62it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] |
| Loading weights: 29%|##8 | 114/398 [00:00<00:00, 5981.89it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 6009.56it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] |
| Loading weights: 29%|##8 | 115/398 [00:00<00:00, 5995.74it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 6023.54it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] |
| Loading weights: 29%|##9 | 116/398 [00:00<00:00, 6009.70it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 6036.01it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] |
| Loading weights: 29%|##9 | 117/398 [00:00<00:00, 6020.46it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 6046.32it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] |
| Loading weights: 30%|##9 | 118/398 [00:00<00:00, 6032.03it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 6058.78it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] |
| Loading weights: 30%|##9 | 119/398 [00:00<00:00, 6045.35it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 6070.78it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] |
| Loading weights: 30%|### | 120/398 [00:00<00:00, 6054.72it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 6075.94it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] |
| Loading weights: 30%|### | 121/398 [00:00<00:00, 6061.49it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 6083.04it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] |
| Loading weights: 31%|### | 122/398 [00:00<00:00, 6065.95it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 6073.33it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] |
| Loading weights: 31%|### | 123/398 [00:00<00:00, 6054.23it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 6059.37it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] |
| Loading weights: 31%|###1 | 124/398 [00:00<00:00, 6038.40it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 6056.58it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] |
| Loading weights: 31%|###1 | 125/398 [00:00<00:00, 6042.41it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 6066.70it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] |
| Loading weights: 32%|###1 | 126/398 [00:00<00:00, 6052.25it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 6076.76it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] |
| Loading weights: 32%|###1 | 127/398 [00:00<00:00, 6064.17it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 6087.94it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] |
| Loading weights: 32%|###2 | 128/398 [00:00<00:00, 6075.26it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 6099.81it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] |
| Loading weights: 32%|###2 | 129/398 [00:00<00:00, 6087.73it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 6113.19it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] |
| Loading weights: 33%|###2 | 130/398 [00:00<00:00, 6101.01it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 6125.67it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] |
| Loading weights: 33%|###2 | 131/398 [00:00<00:00, 6113.26it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 6137.53it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] |
| Loading weights: 33%|###3 | 132/398 [00:00<00:00, 6125.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 6149.67it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] |
| Loading weights: 33%|###3 | 133/398 [00:00<00:00, 6137.76it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 6162.14it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] |
| Loading weights: 34%|###3 | 134/398 [00:00<00:00, 6150.28it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 6174.28it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] |
| Loading weights: 34%|###3 | 135/398 [00:00<00:00, 6161.85it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 6184.95it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] |
| Loading weights: 34%|###4 | 136/398 [00:00<00:00, 6172.77it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 6195.56it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] |
| Loading weights: 34%|###4 | 137/398 [00:00<00:00, 6183.89it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 6207.25it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] |
| Loading weights: 35%|###4 | 138/398 [00:00<00:00, 6195.49it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 6218.69it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] |
| Loading weights: 35%|###4 | 139/398 [00:00<00:00, 6206.64it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 6229.74it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] |
| Loading weights: 35%|###5 | 140/398 [00:00<00:00, 6217.47it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 6240.47it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] |
| Loading weights: 35%|###5 | 141/398 [00:00<00:00, 6228.90it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 6252.01it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] |
| Loading weights: 36%|###5 | 142/398 [00:00<00:00, 6239.89it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 6261.07it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] |
| Loading weights: 36%|###5 | 143/398 [00:00<00:00, 6249.33it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 6271.34it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] |
| Loading weights: 36%|###6 | 144/398 [00:00<00:00, 6259.83it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 6281.82it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] |
| Loading weights: 36%|###6 | 145/398 [00:00<00:00, 6270.42it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 6292.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] |
| Loading weights: 37%|###6 | 146/398 [00:00<00:00, 6281.22it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 6303.03it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] |
| Loading weights: 37%|###6 | 147/398 [00:00<00:00, 6291.52it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 6313.45it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] |
| Loading weights: 37%|###7 | 148/398 [00:00<00:00, 6302.81it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 6325.61it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] |
| Loading weights: 37%|###7 | 149/398 [00:00<00:00, 6315.19it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 6338.42it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] |
| Loading weights: 38%|###7 | 150/398 [00:00<00:00, 6327.97it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 6351.95it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] |
| Loading weights: 38%|###7 | 151/398 [00:00<00:00, 6341.52it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 6365.54it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] |
| Loading weights: 38%|###8 | 152/398 [00:00<00:00, 6355.64it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 6378.76it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] |
| Loading weights: 38%|###8 | 153/398 [00:00<00:00, 6368.63it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 6391.86it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] |
| Loading weights: 39%|###8 | 154/398 [00:00<00:00, 6382.01it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 6404.91it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] |
| Loading weights: 39%|###8 | 155/398 [00:00<00:00, 6394.89it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 6418.15it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] |
| Loading weights: 39%|###9 | 156/398 [00:00<00:00, 6408.22it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 6430.28it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] |
| Loading weights: 39%|###9 | 157/398 [00:00<00:00, 6419.87it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 6441.67it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] |
| Loading weights: 40%|###9 | 158/398 [00:00<00:00, 6431.61it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 6453.65it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] |
| Loading weights: 40%|###9 | 159/398 [00:00<00:00, 6443.55it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 6464.77it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] |
| Loading weights: 40%|#### | 160/398 [00:00<00:00, 6454.76it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 6473.99it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] |
| Loading weights: 40%|#### | 161/398 [00:00<00:00, 6463.27it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 6485.36it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] |
| Loading weights: 41%|#### | 162/398 [00:00<00:00, 5632.08it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 5579.03it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] |
| Loading weights: 41%|#### | 163/398 [00:00<00:00, 5559.62it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 5563.86it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] |
| Loading weights: 41%|####1 | 164/398 [00:00<00:00, 5554.16it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 5569.59it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] |
| Loading weights: 41%|####1 | 165/398 [00:00<00:00, 5561.13it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 5578.60it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] |
| Loading weights: 42%|####1 | 166/398 [00:00<00:00, 5569.86it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 5587.05it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] |
| Loading weights: 42%|####1 | 167/398 [00:00<00:00, 5578.60it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 5596.00it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] |
| Loading weights: 42%|####2 | 168/398 [00:00<00:00, 5587.79it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 5603.77it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] |
| Loading weights: 42%|####2 | 169/398 [00:00<00:00, 5590.82it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 5599.96it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] |
| Loading weights: 43%|####2 | 170/398 [00:00<00:00, 5590.13it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 5606.66it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] |
| Loading weights: 43%|####2 | 171/398 [00:00<00:00, 5598.39it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 5615.52it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] |
| Loading weights: 43%|####3 | 172/398 [00:00<00:00, 5607.05it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 5620.21it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] |
| Loading weights: 43%|####3 | 173/398 [00:00<00:00, 5609.31it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 5617.80it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] |
| Loading weights: 44%|####3 | 174/398 [00:00<00:00, 5606.67it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 5618.86it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] |
| Loading weights: 44%|####3 | 175/398 [00:00<00:00, 5607.83it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 5620.21it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] |
| Loading weights: 44%|####4 | 176/398 [00:00<00:00, 5609.83it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 5620.43it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] |
| Loading weights: 44%|####4 | 177/398 [00:00<00:00, 5609.31it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 5620.70it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] |
| Loading weights: 45%|####4 | 178/398 [00:00<00:00, 5609.42it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 5620.62it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] |
| Loading weights: 45%|####4 | 179/398 [00:00<00:00, 5609.49it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 5621.64it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] |
| Loading weights: 45%|####5 | 180/398 [00:00<00:00, 5610.86it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 5622.72it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] |
| Loading weights: 45%|####5 | 181/398 [00:00<00:00, 5612.04it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 5623.55it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] |
| Loading weights: 46%|####5 | 182/398 [00:00<00:00, 5613.13it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 5624.99it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] |
| Loading weights: 46%|####5 | 183/398 [00:00<00:00, 5614.50it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 5626.57it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] |
| Loading weights: 46%|####6 | 184/398 [00:00<00:00, 5616.50it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 5628.39it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] |
| Loading weights: 46%|####6 | 185/398 [00:00<00:00, 5618.00it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 5629.41it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] |
| Loading weights: 47%|####6 | 186/398 [00:00<00:00, 5618.99it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 5630.42it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] |
| Loading weights: 47%|####6 | 187/398 [00:00<00:00, 5620.22it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 5631.31it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] |
| Loading weights: 47%|####7 | 188/398 [00:00<00:00, 5620.75it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 5629.94it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] |
| Loading weights: 47%|####7 | 189/398 [00:00<00:00, 5619.48it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 5630.06it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] |
| Loading weights: 48%|####7 | 190/398 [00:00<00:00, 5617.64it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 5625.87it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] |
| Loading weights: 48%|####7 | 191/398 [00:00<00:00, 5614.51it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 5624.20it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] |
| Loading weights: 48%|####8 | 192/398 [00:00<00:00, 5613.73it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 5623.56it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] |
| Loading weights: 48%|####8 | 193/398 [00:00<00:00, 5604.14it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 5613.90it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] |
| Loading weights: 49%|####8 | 194/398 [00:00<00:00, 5603.30it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 5613.67it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] |
| Loading weights: 49%|####8 | 195/398 [00:00<00:00, 5603.55it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 5614.37it/s, Materializing param=text_model.final_layer_norm.bias] |
| Loading weights: 49%|####9 | 196/398 [00:00<00:00, 5604.30it/s, Materializing param=text_model.final_layer_norm.bias] |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 5616.35it/s, Materializing param=text_model.final_layer_norm.weight] |
| Loading weights: 49%|####9 | 197/398 [00:00<00:00, 5606.67it/s, Materializing param=text_model.final_layer_norm.weight] |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 5619.27it/s, Materializing param=text_projection.weight] |
| Loading weights: 50%|####9 | 198/398 [00:00<00:00, 5609.90it/s, Materializing param=text_projection.weight] |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 5622.92it/s, Materializing param=vision_model.embeddings.class_embedding] |
| Loading weights: 50%|##### | 199/398 [00:00<00:00, 5611.39it/s, Materializing param=vision_model.embeddings.class_embedding] |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 5621.86it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] |
| Loading weights: 50%|##### | 200/398 [00:00<00:00, 5612.01it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 5622.58it/s, Materializing param=vision_model.embeddings.position_embedding.weight] |
| Loading weights: 51%|##### | 201/398 [00:00<00:00, 5612.88it/s, Materializing param=vision_model.embeddings.position_embedding.weight] |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 5625.64it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] |
| Loading weights: 51%|##### | 202/398 [00:00<00:00, 5617.95it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 5629.75it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] |
| Loading weights: 51%|#####1 | 203/398 [00:00<00:00, 5621.17it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 5631.57it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] |
| Loading weights: 51%|#####1 | 204/398 [00:00<00:00, 5622.35it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 5633.55it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] |
| Loading weights: 52%|#####1 | 205/398 [00:00<00:00, 5624.86it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 5636.11it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] |
| Loading weights: 52%|#####1 | 206/398 [00:00<00:00, 5626.46it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 5635.75it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] |
| Loading weights: 52%|#####2 | 207/398 [00:00<00:00, 5626.44it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 5636.08it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] |
| Loading weights: 52%|#####2 | 208/398 [00:00<00:00, 5627.40it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 5637.90it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] |
| Loading weights: 53%|#####2 | 209/398 [00:00<00:00, 5629.32it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 5639.89it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] |
| Loading weights: 53%|#####2 | 210/398 [00:00<00:00, 5631.05it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 5639.98it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] |
| Loading weights: 53%|#####3 | 211/398 [00:00<00:00, 5628.51it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 5637.18it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] |
| Loading weights: 53%|#####3 | 212/398 [00:00<00:00, 5628.12it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 5638.04it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] |
| Loading weights: 54%|#####3 | 213/398 [00:00<00:00, 5628.98it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 5639.10it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] |
| Loading weights: 54%|#####3 | 214/398 [00:00<00:00, 5629.83it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 5639.62it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] |
| Loading weights: 54%|#####4 | 215/398 [00:00<00:00, 5631.17it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 5642.07it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] |
| Loading weights: 54%|#####4 | 216/398 [00:00<00:00, 5634.03it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 5644.74it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] |
| Loading weights: 55%|#####4 | 217/398 [00:00<00:00, 5636.28it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 5647.05it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] |
| Loading weights: 55%|#####4 | 218/398 [00:00<00:00, 5639.59it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 5649.95it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] |
| Loading weights: 55%|#####5 | 219/398 [00:00<00:00, 5641.21it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 5651.07it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] |
| Loading weights: 55%|#####5 | 220/398 [00:00<00:00, 5642.30it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 5651.01it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] |
| Loading weights: 56%|#####5 | 221/398 [00:00<00:00, 5641.79it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 5651.84it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] |
| Loading weights: 56%|#####5 | 222/398 [00:00<00:00, 5643.48it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 5653.42it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] |
| Loading weights: 56%|#####6 | 223/398 [00:00<00:00, 5643.97it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 5653.76it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] |
| Loading weights: 56%|#####6 | 224/398 [00:00<00:00, 5645.64it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 5656.53it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] |
| Loading weights: 57%|#####6 | 225/398 [00:00<00:00, 5648.20it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 5652.43it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] |
| Loading weights: 57%|#####6 | 226/398 [00:00<00:00, 5643.51it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 5650.42it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] |
| Loading weights: 57%|#####7 | 227/398 [00:00<00:00, 5639.78it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 5647.36it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] |
| Loading weights: 57%|#####7 | 228/398 [00:00<00:00, 5638.64it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 5646.89it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] |
| Loading weights: 58%|#####7 | 229/398 [00:00<00:00, 5638.20it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 5646.22it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] |
| Loading weights: 58%|#####7 | 230/398 [00:00<00:00, 5637.41it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 5644.34it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] |
| Loading weights: 58%|#####8 | 231/398 [00:00<00:00, 5635.28it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 5643.06it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] |
| Loading weights: 58%|#####8 | 232/398 [00:00<00:00, 5634.50it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 5642.91it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] |
| Loading weights: 59%|#####8 | 233/398 [00:00<00:00, 5634.52it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 5641.85it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] |
| Loading weights: 59%|#####8 | 234/398 [00:00<00:00, 5633.10it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 5640.89it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] |
| Loading weights: 59%|#####9 | 235/398 [00:00<00:00, 5632.06it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 5640.53it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] |
| Loading weights: 59%|#####9 | 236/398 [00:00<00:00, 5631.96it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 5640.29it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] |
| Loading weights: 60%|#####9 | 237/398 [00:00<00:00, 5631.79it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 5640.28it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] |
| Loading weights: 60%|#####9 | 238/398 [00:00<00:00, 5632.42it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 5639.98it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] |
| Loading weights: 60%|###### | 239/398 [00:00<00:00, 5631.42it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 5639.81it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] |
| Loading weights: 60%|###### | 240/398 [00:00<00:00, 5631.89it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 5640.62it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] |
| Loading weights: 61%|###### | 241/398 [00:00<00:00, 5632.48it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 5640.86it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] |
| Loading weights: 61%|###### | 242/398 [00:00<00:00, 5632.75it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 5641.19it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] |
| Loading weights: 61%|######1 | 243/398 [00:00<00:00, 5633.17it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 5641.73it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] |
| Loading weights: 61%|######1 | 244/398 [00:00<00:00, 5633.84it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 5641.34it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] |
| Loading weights: 62%|######1 | 245/398 [00:00<00:00, 5631.67it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 5640.19it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] |
| Loading weights: 62%|######1 | 246/398 [00:00<00:00, 5632.77it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 5642.05it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] |
| Loading weights: 62%|######2 | 247/398 [00:00<00:00, 5634.90it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 5644.60it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] |
| Loading weights: 62%|######2 | 248/398 [00:00<00:00, 5637.51it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 5647.02it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] |
| Loading weights: 63%|######2 | 249/398 [00:00<00:00, 5639.45it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 5647.07it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] |
| Loading weights: 63%|######2 | 250/398 [00:00<00:00, 5638.93it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 5647.06it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] |
| Loading weights: 63%|######3 | 251/398 [00:00<00:00, 5639.44it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 5647.96it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] |
| Loading weights: 63%|######3 | 252/398 [00:00<00:00, 5640.42it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 5649.51it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] |
| Loading weights: 64%|######3 | 253/398 [00:00<00:00, 5642.33it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 5650.87it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] |
| Loading weights: 64%|######3 | 254/398 [00:00<00:00, 5643.15it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 5651.39it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] |
| Loading weights: 64%|######4 | 255/398 [00:00<00:00, 5643.84it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 5652.17it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] |
| Loading weights: 64%|######4 | 256/398 [00:00<00:00, 5645.69it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 5614.89it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] |
| Loading weights: 65%|######4 | 257/398 [00:00<00:00, 5603.54it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 5606.86it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] |
| Loading weights: 65%|######4 | 258/398 [00:00<00:00, 5598.94it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 5606.00it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] |
| Loading weights: 65%|######5 | 259/398 [00:00<00:00, 5598.40it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 5606.55it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] |
| Loading weights: 65%|######5 | 260/398 [00:00<00:00, 5599.70it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 5602.57it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] |
| Loading weights: 66%|######5 | 261/398 [00:00<00:00, 5593.15it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 5601.81it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] |
| Loading weights: 66%|######5 | 262/398 [00:00<00:00, 5595.74it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 5605.82it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] |
| Loading weights: 66%|######6 | 263/398 [00:00<00:00, 5596.41it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 5603.56it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] |
| Loading weights: 66%|######6 | 264/398 [00:00<00:00, 5595.66it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 5602.10it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] |
| Loading weights: 67%|######6 | 265/398 [00:00<00:00, 5596.04it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 5606.79it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] |
| Loading weights: 67%|######6 | 266/398 [00:00<00:00, 5600.38it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 5610.81it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] |
| Loading weights: 67%|######7 | 267/398 [00:00<00:00, 5605.59it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 5616.97it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] |
| Loading weights: 67%|######7 | 268/398 [00:00<00:00, 5612.01it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 5623.65it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] |
| Loading weights: 68%|######7 | 269/398 [00:00<00:00, 5618.53it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 5629.97it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] |
| Loading weights: 68%|######7 | 270/398 [00:00<00:00, 5624.90it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 5636.28it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] |
| Loading weights: 68%|######8 | 271/398 [00:00<00:00, 5631.28it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 5642.64it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] |
| Loading weights: 68%|######8 | 272/398 [00:00<00:00, 5637.59it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 5649.30it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] |
| Loading weights: 69%|######8 | 273/398 [00:00<00:00, 5644.23it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 5655.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] |
| Loading weights: 69%|######8 | 274/398 [00:00<00:00, 5650.56it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 5661.61it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] |
| Loading weights: 69%|######9 | 275/398 [00:00<00:00, 5656.50it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 5668.01it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] |
| Loading weights: 69%|######9 | 276/398 [00:00<00:00, 5662.96it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 5674.04it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] |
| Loading weights: 70%|######9 | 277/398 [00:00<00:00, 5668.89it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 5679.74it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] |
| Loading weights: 70%|######9 | 278/398 [00:00<00:00, 5674.54it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 5685.74it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] |
| Loading weights: 70%|####### | 279/398 [00:00<00:00, 5680.80it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 5691.99it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] |
| Loading weights: 70%|####### | 280/398 [00:00<00:00, 5686.81it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 5697.90it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] |
| Loading weights: 71%|####### | 281/398 [00:00<00:00, 5692.86it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 5704.17it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] |
| Loading weights: 71%|####### | 282/398 [00:00<00:00, 5699.39it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 5709.78it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] |
| Loading weights: 71%|#######1 | 283/398 [00:00<00:00, 5704.73it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 5715.98it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] |
| Loading weights: 71%|#######1 | 284/398 [00:00<00:00, 5711.05it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 5722.30it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] |
| Loading weights: 72%|#######1 | 285/398 [00:00<00:00, 5717.13it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 5728.23it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] |
| Loading weights: 72%|#######1 | 286/398 [00:00<00:00, 5723.58it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 5734.10it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] |
| Loading weights: 72%|#######2 | 287/398 [00:00<00:00, 5729.16it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 5739.97it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] |
| Loading weights: 72%|#######2 | 288/398 [00:00<00:00, 5734.93it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 5745.79it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] |
| Loading weights: 73%|#######2 | 289/398 [00:00<00:00, 5740.64it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 5751.22it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] |
| Loading weights: 73%|#######2 | 290/398 [00:00<00:00, 5746.00it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 5756.33it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] |
| Loading weights: 73%|#######3 | 291/398 [00:00<00:00, 5751.33it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 5761.92it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] |
| Loading weights: 73%|#######3 | 292/398 [00:00<00:00, 5756.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 5767.17it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] |
| Loading weights: 74%|#######3 | 293/398 [00:00<00:00, 5761.92it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 5771.90it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] |
| Loading weights: 74%|#######3 | 294/398 [00:00<00:00, 5766.98it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 5777.49it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] |
| Loading weights: 74%|#######4 | 295/398 [00:00<00:00, 5772.62it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 5783.33it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] |
| Loading weights: 74%|#######4 | 296/398 [00:00<00:00, 5778.49it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 5789.39it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] |
| Loading weights: 75%|#######4 | 297/398 [00:00<00:00, 5784.49it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 5795.23it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] |
| Loading weights: 75%|#######4 | 298/398 [00:00<00:00, 5790.45it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 5800.23it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] |
| Loading weights: 75%|#######5 | 299/398 [00:00<00:00, 5795.33it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 5805.93it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] |
| Loading weights: 75%|#######5 | 300/398 [00:00<00:00, 5801.14it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 5811.80it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] |
| Loading weights: 76%|#######5 | 301/398 [00:00<00:00, 5806.83it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 5817.40it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] |
| Loading weights: 76%|#######5 | 302/398 [00:00<00:00, 5812.56it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 5822.91it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] |
| Loading weights: 76%|#######6 | 303/398 [00:00<00:00, 5818.14it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 5828.72it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] |
| Loading weights: 76%|#######6 | 304/398 [00:00<00:00, 5824.07it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 5834.67it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] |
| Loading weights: 77%|#######6 | 305/398 [00:00<00:00, 5829.86it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 5840.35it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] |
| Loading weights: 77%|#######6 | 306/398 [00:00<00:00, 5835.54it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 5845.89it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] |
| Loading weights: 77%|#######7 | 307/398 [00:00<00:00, 5841.25it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 5851.31it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] |
| Loading weights: 77%|#######7 | 308/398 [00:00<00:00, 5846.57it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 5856.91it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] |
| Loading weights: 78%|#######7 | 309/398 [00:00<00:00, 5852.04it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 5861.85it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] |
| Loading weights: 78%|#######7 | 310/398 [00:00<00:00, 5856.83it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 5864.29it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] |
| Loading weights: 78%|#######8 | 311/398 [00:00<00:00, 5859.26it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 5869.29it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] |
| Loading weights: 78%|#######8 | 312/398 [00:00<00:00, 5864.45it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 5874.27it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] |
| Loading weights: 79%|#######8 | 313/398 [00:00<00:00, 5869.36it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 5879.10it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] |
| Loading weights: 79%|#######8 | 314/398 [00:00<00:00, 5874.06it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 5882.98it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] |
| Loading weights: 79%|#######9 | 315/398 [00:00<00:00, 5878.06it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 5888.13it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] |
| Loading weights: 79%|#######9 | 316/398 [00:00<00:00, 5883.37it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 5893.07it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] |
| Loading weights: 80%|#######9 | 317/398 [00:00<00:00, 5888.35it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 5898.25it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] |
| Loading weights: 80%|#######9 | 318/398 [00:00<00:00, 5893.40it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 5903.01it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] |
| Loading weights: 80%|######## | 319/398 [00:00<00:00, 5898.28it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 5908.33it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] |
| Loading weights: 80%|######## | 320/398 [00:00<00:00, 5903.68it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 5913.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] |
| Loading weights: 81%|######## | 321/398 [00:00<00:00, 5909.26it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 5919.30it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] |
| Loading weights: 81%|######## | 322/398 [00:00<00:00, 5914.69it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 5924.37it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] |
| Loading weights: 81%|########1 | 323/398 [00:00<00:00, 5919.71it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 5929.43it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] |
| Loading weights: 81%|########1 | 324/398 [00:00<00:00, 5924.70it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 5934.42it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] |
| Loading weights: 82%|########1 | 325/398 [00:00<00:00, 5929.65it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 5939.24it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] |
| Loading weights: 82%|########1 | 326/398 [00:00<00:00, 5934.52it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 5943.98it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] |
| Loading weights: 82%|########2 | 327/398 [00:00<00:00, 5939.40it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 5949.32it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] |
| Loading weights: 82%|########2 | 328/398 [00:00<00:00, 5944.77it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 5954.55it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] |
| Loading weights: 83%|########2 | 329/398 [00:00<00:00, 5949.85it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 5959.69it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] |
| Loading weights: 83%|########2 | 330/398 [00:00<00:00, 5955.15it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 5961.99it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] |
| Loading weights: 83%|########3 | 331/398 [00:00<00:00, 5957.13it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 5966.83it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] |
| Loading weights: 83%|########3 | 332/398 [00:00<00:00, 5962.18it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 5972.03it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] |
| Loading weights: 84%|########3 | 333/398 [00:00<00:00, 5967.49it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 5977.06it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] |
| Loading weights: 84%|########3 | 334/398 [00:00<00:00, 5972.45it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 5981.91it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] |
| Loading weights: 84%|########4 | 335/398 [00:00<00:00, 5977.13it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 5986.57it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] |
| Loading weights: 84%|########4 | 336/398 [00:00<00:00, 5981.97it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 5991.38it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] |
| Loading weights: 85%|########4 | 337/398 [00:00<00:00, 5986.89it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 5996.50it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] |
| Loading weights: 85%|########4 | 338/398 [00:00<00:00, 5991.74it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 6000.69it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] |
| Loading weights: 85%|########5 | 339/398 [00:00<00:00, 5996.11it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 6005.31it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] |
| Loading weights: 85%|########5 | 340/398 [00:00<00:00, 6000.79it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 6010.14it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] |
| Loading weights: 86%|########5 | 341/398 [00:00<00:00, 6005.42it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 6014.70it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] |
| Loading weights: 86%|########5 | 342/398 [00:00<00:00, 6009.86it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 6018.94it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] |
| Loading weights: 86%|########6 | 343/398 [00:00<00:00, 6014.23it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 6023.38it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] |
| Loading weights: 86%|########6 | 344/398 [00:00<00:00, 6018.73it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 6027.68it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] |
| Loading weights: 87%|########6 | 345/398 [00:00<00:00, 6023.09it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 6031.98it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] |
| Loading weights: 87%|########6 | 346/398 [00:00<00:00, 6027.30it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 6036.17it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] |
| Loading weights: 87%|########7 | 347/398 [00:00<00:00, 6030.94it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 6040.19it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] |
| Loading weights: 87%|########7 | 348/398 [00:00<00:00, 6035.64it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 6044.66it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] |
| Loading weights: 88%|########7 | 349/398 [00:00<00:00, 6040.02it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 6049.22it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] |
| Loading weights: 88%|########7 | 350/398 [00:00<00:00, 6044.66it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 6053.61it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] |
| Loading weights: 88%|########8 | 351/398 [00:00<00:00, 6049.15it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 6057.92it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] |
| Loading weights: 88%|########8 | 352/398 [00:00<00:00, 6053.50it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 6062.60it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] |
| Loading weights: 89%|########8 | 353/398 [00:00<00:00, 6058.11it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 6067.32it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] |
| Loading weights: 89%|########8 | 354/398 [00:00<00:00, 6062.87it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 6071.79it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] |
| Loading weights: 89%|########9 | 355/398 [00:00<00:00, 6067.13it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 6076.03it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] |
| Loading weights: 89%|########9 | 356/398 [00:00<00:00, 6071.46it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 6080.13it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] |
| Loading weights: 90%|########9 | 357/398 [00:00<00:00, 6075.35it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 6083.06it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] |
| Loading weights: 90%|########9 | 358/398 [00:00<00:00, 6077.91it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 6085.36it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] |
| Loading weights: 90%|######### | 359/398 [00:00<00:00, 6029.41it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 6020.58it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] |
| Loading weights: 90%|######### | 360/398 [00:00<00:00, 6014.18it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 6020.91it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] |
| Loading weights: 91%|######### | 361/398 [00:00<00:00, 6016.24it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 6022.93it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] |
| Loading weights: 91%|######### | 362/398 [00:00<00:00, 6017.56it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 6020.94it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] |
| Loading weights: 91%|#########1| 363/398 [00:00<00:00, 6012.19it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 6017.77it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] |
| Loading weights: 91%|#########1| 364/398 [00:00<00:00, 6012.82it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 6018.69it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] |
| Loading weights: 92%|#########1| 365/398 [00:00<00:00, 6013.47it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 6020.53it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] |
| Loading weights: 92%|#########1| 366/398 [00:00<00:00, 6015.98it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 6024.29it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] |
| Loading weights: 92%|#########2| 367/398 [00:00<00:00, 6020.50it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 6029.92it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] |
| Loading weights: 92%|#########2| 368/398 [00:00<00:00, 6026.42it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 6035.89it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] |
| Loading weights: 93%|#########2| 369/398 [00:00<00:00, 6031.82it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 6041.24it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] |
| Loading weights: 93%|#########2| 370/398 [00:00<00:00, 6037.48it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 6046.81it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] |
| Loading weights: 93%|#########3| 371/398 [00:00<00:00, 6042.94it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 6052.55it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] |
| Loading weights: 93%|#########3| 372/398 [00:00<00:00, 6048.89it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 6057.26it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] |
| Loading weights: 94%|#########3| 373/398 [00:00<00:00, 6053.53it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 6062.47it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] |
| Loading weights: 94%|#########3| 374/398 [00:00<00:00, 6058.67it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 6067.47it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] |
| Loading weights: 94%|#########4| 375/398 [00:00<00:00, 6063.14it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 6070.91it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] |
| Loading weights: 94%|#########4| 376/398 [00:00<00:00, 6066.40it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 6059.53it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] |
| Loading weights: 95%|#########4| 377/398 [00:00<00:00, 6047.97it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 6032.17it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] |
| Loading weights: 95%|#########4| 378/398 [00:00<00:00, 6019.73it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 6021.28it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] |
| Loading weights: 95%|#########5| 379/398 [00:00<00:00, 6015.49it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 6021.88it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] |
| Loading weights: 95%|#########5| 380/398 [00:00<00:00, 6016.77it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 5854.08it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] |
| Loading weights: 96%|#########5| 381/398 [00:00<00:00, 5844.98it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 5839.52it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] |
| Loading weights: 96%|#########5| 382/398 [00:00<00:00, 5832.36it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 5834.03it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] |
| Loading weights: 96%|#########6| 383/398 [00:00<00:00, 5826.82it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 5829.83it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] |
| Loading weights: 96%|#########6| 384/398 [00:00<00:00, 5823.27it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 5826.58it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] |
| Loading weights: 97%|#########6| 385/398 [00:00<00:00, 5820.49it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 5824.14it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] |
| Loading weights: 97%|#########6| 386/398 [00:00<00:00, 5817.76it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 5820.14it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] |
| Loading weights: 97%|#########7| 387/398 [00:00<00:00, 5813.70it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 5817.32it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] |
| Loading weights: 97%|#########7| 388/398 [00:00<00:00, 5810.99it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 5814.36it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] |
| Loading weights: 98%|#########7| 389/398 [00:00<00:00, 5807.92it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 5811.08it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] |
| Loading weights: 98%|#########7| 390/398 [00:00<00:00, 5804.85it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 5808.36it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] |
| Loading weights: 98%|#########8| 391/398 [00:00<00:00, 5802.13it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 5805.70it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] |
| Loading weights: 98%|#########8| 392/398 [00:00<00:00, 5799.61it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 5803.33it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] |
| Loading weights: 99%|#########8| 393/398 [00:00<00:00, 5797.66it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 5801.74it/s, Materializing param=vision_model.post_layernorm.bias] |
| Loading weights: 99%|#########8| 394/398 [00:00<00:00, 5795.86it/s, Materializing param=vision_model.post_layernorm.bias] |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 5800.07it/s, Materializing param=vision_model.post_layernorm.weight] |
| Loading weights: 99%|#########9| 395/398 [00:00<00:00, 5794.35it/s, Materializing param=vision_model.post_layernorm.weight] |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 5798.80it/s, Materializing param=vision_model.pre_layrnorm.bias] |
| Loading weights: 99%|#########9| 396/398 [00:00<00:00, 5793.00it/s, Materializing param=vision_model.pre_layrnorm.bias] |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 5796.67it/s, Materializing param=vision_model.pre_layrnorm.weight] |
| Loading weights: 100%|#########9| 397/398 [00:00<00:00, 5790.52it/s, Materializing param=vision_model.pre_layrnorm.weight] |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5795.01it/s, Materializing param=visual_projection.weight] |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5789.40it/s, Materializing param=visual_projection.weight] |
| Loading weights: 100%|##########| 398/398 [00:00<00:00, 5777.90it/s, Materializing param=visual_projection.weight] |
| CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32 |
| Key | Status | | |
| -------------------------------------+------------+--+- |
| vision_model.embeddings.position_ids | UNEXPECTED | | |
| text_model.embeddings.position_ids | UNEXPECTED | | |
|
|
| Notes: |
| - UNEXPECTED :can be ignored when loading from different task/architecture; not ok if you expect identical arch. |
| The image processor of type `CLIPImageProcessor` is now loaded as a fast processor by default, even if the model checkpoint was saved with a slow processor. This is a breaking change and may produce slightly different outputs. To continue using the slow processor, instantiate this class with `use_fast=False`. |
| INFO: 127.0.0.1:61992 - "POST /api/text-search HTTP/1.1" 200 OK |
|
|