BakoAI / analysis_optimized.log
Okidi Norbert
Deployment fix: clean backend only
c6abe34
nohup: ignoring input
Retriggering analysis for video 6faca277-99bc-4a28-9a3d-15ca6b871730...
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 3 shot-clocks, 646.4ms
1: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 3 shot-clocks, 646.4ms
2: 640x1088 1 basketball, 1 hoop, 5 players, 1 referee, 3 shot-clocks, 646.4ms
3: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 3 shot-clocks, 646.4ms
4: 640x1088 1 basketball, 1 hoop, 6 players, 1 shot-clock, 646.4ms
5: 640x1088 1 basketball, 1 hoop, 6 players, 646.4ms
6: 640x1088 1 basketball, 1 hoop, 5 players, 646.4ms
7: 640x1088 1 hoop, 5 players, 646.4ms
8: 640x1088 1 basketball, 1 hoop, 4 players, 646.4ms
9: 640x1088 1 basketball, 1 hoop, 5 players, 646.4ms
Speed: 49.2ms preprocess, 646.4ms inference, 12.4ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 7 players, 217.9ms
1: 640x1088 1 basketball, 1 hoop, 6 players, 217.9ms
2: 640x1088 1 basketball, 1 hoop, 7 players, 217.9ms
3: 640x1088 1 basketball, 1 hoop, 5 players, 217.9ms
4: 640x1088 1 basketball, 1 hoop, 6 players, 217.9ms
5: 640x1088 1 basketball, 1 hoop, 8 players, 217.9ms
6: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 217.9ms
7: 640x1088 2 basketballs, 1 hoop, 5 players, 217.9ms
8: 640x1088 2 basketballs, 1 hoop, 5 players, 1 referee, 217.9ms
9: 640x1088 2 basketballs, 1 hoop, 4 players, 2 referees, 217.9ms
Speed: 34.5ms preprocess, 217.9ms inference, 0.8ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 hoop, 6 players, 178.4ms
1: 640x1088 1 hoop, 6 players, 178.4ms
2: 640x1088 1 hoop, 7 players, 178.4ms
3: 640x1088 2 basketballs, 1 hoop, 8 players, 178.4ms
4: 640x1088 1 hoop, 7 players, 178.4ms
5: 640x1088 1 basketball, 1 hoop, 7 players, 178.4ms
6: 640x1088 1 basketball, 1 hoop, 7 players, 178.4ms
7: 640x1088 1 basketball, 1 hoop, 6 players, 178.4ms
8: 640x1088 2 basketballs, 1 hoop, 8 players, 178.4ms
9: 640x1088 1 basketball, 1 hoop, 5 players, 178.4ms
Speed: 41.2ms preprocess, 178.4ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 6 players, 340.7ms
1: 640x1088 1 basketball, 1 hoop, 12 players, 340.7ms
2: 640x1088 1 basketball, 1 hoop, 8 players, 340.7ms
3: 640x1088 1 basketball, 1 hoop, 10 players, 340.7ms
4: 640x1088 1 basketball, 1 hoop, 9 players, 340.7ms
5: 640x1088 1 basketball, 1 hoop, 6 players, 340.7ms
6: 640x1088 2 basketballs, 1 hoop, 6 players, 340.7ms
7: 640x1088 1 basketball, 1 hoop, 6 players, 340.7ms
8: 640x1088 1 basketball, 2 hoops, 6 players, 340.7ms
9: 640x1088 2 basketballs, 1 hoop, 7 players, 340.7ms
Speed: 65.1ms preprocess, 340.7ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 11 players, 227.3ms
1: 640x1088 2 basketballs, 1 hoop, 9 players, 227.3ms
2: 640x1088 2 basketballs, 1 hoop, 8 players, 227.3ms
3: 640x1088 1 basketball, 1 hoop, 9 players, 227.3ms
4: 640x1088 1 basketball, 1 hoop, 10 players, 227.3ms
5: 640x1088 1 basketball, 1 hoop, 10 players, 227.3ms
6: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 227.3ms
7: 640x1088 1 basketball, 1 hoop, 6 players, 227.3ms
8: 640x1088 1 basketball, 1 hoop, 7 players, 227.3ms
9: 640x1088 1 basketball, 1 hoop, 4 players, 227.3ms
Speed: 52.8ms preprocess, 227.3ms inference, 1.7ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 5 players, 194.5ms
1: 640x1088 1 hoop, 7 players, 194.5ms
2: 640x1088 1 hoop, 7 players, 194.5ms
3: 640x1088 1 basketball, 1 hoop, 8 players, 194.5ms
4: 640x1088 1 basketball, 1 hoop, 6 players, 194.5ms
5: 640x1088 1 basketball, 1 hoop, 9 players, 194.5ms
6: 640x1088 1 basketball, 1 hoop, 7 players, 194.5ms
7: 640x1088 1 basketball, 1 hoop, 6 players, 194.5ms
8: 640x1088 1 basketball, 1 hoop, 6 players, 194.5ms
9: 640x1088 1 basketball, 1 hoop, 5 players, 194.5ms
Speed: 68.5ms preprocess, 194.5ms inference, 1.2ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 8 players, 279.7ms
1: 640x1088 2 basketballs, 1 hoop, 8 players, 279.7ms
2: 640x1088 1 basketball, 1 hoop, 5 players, 1 referee, 279.7ms
3: 640x1088 2 basketballs, 1 hoop, 5 players, 1 referee, 279.7ms
4: 640x1088 2 basketballs, 1 hoop, 5 players, 279.7ms
5: 640x1088 1 basketball, 1 hoop, 6 players, 279.7ms
6: 640x1088 1 basketball, 1 hoop, 6 players, 279.7ms
7: 640x1088 1 basketball, 1 hoop, 5 players, 1 referee, 279.7ms
8: 640x1088 1 basketball, 5 players, 1 referee, 279.7ms
9: 640x1088 1 basketball, 1 hoop, 4 players, 1 referee, 279.7ms
Speed: 65.6ms preprocess, 279.7ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 6 players, 1 referee, 364.0ms
1: 640x1088 1 basketball, 6 players, 1 referee, 364.0ms
2: 640x1088 1 basketball, 4 players, 364.0ms
3: 640x1088 1 basketball, 5 players, 364.0ms
4: 640x1088 2 basketballs, 6 players, 364.0ms
5: 640x1088 1 basketball, 8 players, 364.0ms
6: 640x1088 1 basketball, 7 players, 364.0ms
7: 640x1088 2 basketballs, 8 players, 364.0ms
8: 640x1088 1 basketball, 6 players, 364.0ms
9: 640x1088 1 basketball, 1 hoop, 6 players, 364.0ms
Speed: 44.6ms preprocess, 364.0ms inference, 6.4ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 5 players, 179.4ms
1: 640x1088 1 basketball, 6 players, 179.4ms
2: 640x1088 1 basketball, 1 hoop, 6 players, 179.4ms
3: 640x1088 1 basketball, 1 hoop, 8 players, 179.4ms
4: 640x1088 1 basketball, 9 players, 179.4ms
5: 640x1088 2 basketballs, 7 players, 179.4ms
6: 640x1088 1 basketball, 7 players, 179.4ms
7: 640x1088 8 players, 179.4ms
8: 640x1088 1 basketball, 1 hoop, 5 players, 179.4ms
9: 640x1088 1 basketball, 6 players, 179.4ms
Speed: 56.5ms preprocess, 179.4ms inference, 1.3ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 8 players, 191.1ms
1: 640x1088 2 basketballs, 8 players, 191.1ms
2: 640x1088 1 basketball, 7 players, 191.1ms
3: 640x1088 2 basketballs, 1 hoop, 11 players, 191.1ms
4: 640x1088 1 basketball, 1 hoop, 12 players, 191.1ms
5: 640x1088 1 basketball, 1 hoop, 11 players, 191.1ms
6: 640x1088 1 hoop, 8 players, 191.1ms
7: 640x1088 3 basketballs, 1 hoop, 8 players, 191.1ms
8: 640x1088 1 basketball, 1 hoop, 8 players, 191.1ms
9: 640x1088 1 basketball, 1 hoop, 7 players, 191.1ms
Speed: 74.7ms preprocess, 191.1ms inference, 1.0ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 9 players, 193.4ms
1: 640x1088 1 basketball, 6 players, 193.4ms
2: 640x1088 1 basketball, 1 hoop, 5 players, 193.4ms
3: 640x1088 1 basketball, 2 hoops, 5 players, 193.4ms
4: 640x1088 3 basketballs, 2 hoops, 4 players, 193.4ms
5: 640x1088 1 basketball, 1 hoop, 4 players, 193.4ms
6: 640x1088 3 basketballs, 3 hoops, 10 players, 193.4ms
7: 640x1088 1 basketball, 2 hoops, 11 players, 193.4ms
8: 640x1088 1 basketball, 1 hoop, 10 players, 193.4ms
9: 640x1088 1 basketball, 2 hoops, 10 players, 193.4ms
Speed: 114.1ms preprocess, 193.4ms inference, 0.7ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 8 players, 224.4ms
1: 640x1088 1 basketball, 5 players, 224.4ms
2: 640x1088 1 basketball, 2 hoops, 5 players, 224.4ms
3: 640x1088 2 hoops, 10 players, 224.4ms
4: 640x1088 1 basketball, 1 hoop, 7 players, 224.4ms
5: 640x1088 1 basketball, 6 players, 224.4ms
6: 640x1088 1 basketball, 1 hoop, 7 players, 224.4ms
7: 640x1088 1 basketball, 1 hoop, 5 players, 224.4ms
8: 640x1088 1 basketball, 2 hoops, 8 players, 224.4ms
9: 640x1088 1 basketball, 1 hoop, 5 players, 224.4ms
Speed: 78.7ms preprocess, 224.4ms inference, 1.3ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 2 hoops, 8 players, 201.3ms
1: 640x1088 2 basketballs, 2 hoops, 8 players, 201.3ms
2: 640x1088 2 basketballs, 1 hoop, 6 players, 201.3ms
3: 640x1088 3 basketballs, 2 hoops, 5 players, 201.3ms
4: 640x1088 2 basketballs, 1 hoop, 5 players, 201.3ms
5: 640x1088 1 basketball, 2 hoops, 5 players, 201.3ms
6: 640x1088 2 basketballs, 1 hoop, 4 players, 201.3ms
7: 640x1088 1 basketball, 1 hoop, 5 players, 201.3ms
8: 640x1088 1 basketball, 1 hoop, 5 players, 201.3ms
9: 640x1088 2 basketballs, 1 hoop, 5 players, 201.3ms
Speed: 83.5ms preprocess, 201.3ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 3 hoops, 3 players, 219.2ms
1: 640x1088 1 basketball, 1 hoop, 3 players, 219.2ms
2: 640x1088 1 basketball, 7 players, 219.2ms
3: 640x1088 1 basketball, 1 hoop, 4 players, 219.2ms
4: 640x1088 1 basketball, 4 hoops, 3 players, 219.2ms
5: 640x1088 1 basketball, 1 hoop, 6 players, 219.2ms
6: 640x1088 1 basketball, 1 hoop, 6 players, 219.2ms
7: 640x1088 1 hoop, 6 players, 219.2ms
8: 640x1088 2 basketballs, 2 hoops, 6 players, 219.2ms
9: 640x1088 1 basketball, 5 players, 219.2ms
Speed: 62.0ms preprocess, 219.2ms inference, 1.9ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 6 players, 356.3ms
1: 640x1088 1 basketball, 5 players, 356.3ms
2: 640x1088 1 basketball, 5 players, 356.3ms
3: 640x1088 1 basketball, 4 players, 356.3ms
4: 640x1088 2 basketballs, 5 players, 356.3ms
5: 640x1088 1 basketball, 3 players, 356.3ms
6: 640x1088 1 basketball, 3 players, 356.3ms
7: 640x1088 1 basketball, 5 players, 356.3ms
8: 640x1088 1 basketball, 6 players, 356.3ms
9: 640x1088 1 basketball, 7 players, 356.3ms
Speed: 62.1ms preprocess, 356.3ms inference, 3.0ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 9 players, 227.4ms
1: 640x1088 1 basketball, 9 players, 227.4ms
2: 640x1088 1 basketball, 1 hoop, 8 players, 227.4ms
3: 640x1088 2 basketballs, 4 players, 227.4ms
4: 640x1088 1 basketball, 1 hoop, 5 players, 227.4ms
5: 640x1088 1 basketball, 2 hoops, 5 players, 227.4ms
6: 640x1088 6 players, 227.4ms
7: 640x1088 1 basketball, 6 players, 227.4ms
8: 640x1088 1 basketball, 8 players, 227.4ms
9: 640x1088 1 basketball, 9 players, 227.4ms
Speed: 77.5ms preprocess, 227.4ms inference, 0.8ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 8 players, 243.5ms
1: 640x1088 8 players, 243.5ms
2: 640x1088 10 players, 243.5ms
3: 640x1088 8 players, 243.5ms
4: 640x1088 10 players, 243.5ms
5: 640x1088 1 basketball, 9 players, 243.5ms
6: 640x1088 5 players, 1 referee, 243.5ms
7: 640x1088 1 basketball, 8 players, 243.5ms
8: 640x1088 9 players, 243.5ms
9: 640x1088 7 players, 243.5ms
Speed: 60.9ms preprocess, 243.5ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 7 players, 294.1ms
1: 640x1088 1 hoop, 7 players, 294.1ms
2: 640x1088 6 players, 294.1ms
3: 640x1088 2 basketballs, 6 players, 294.1ms
4: 640x1088 1 basketball, 7 players, 294.1ms
5: 640x1088 8 players, 294.1ms
6: 640x1088 7 players, 294.1ms
7: 640x1088 9 players, 294.1ms
8: 640x1088 10 players, 294.1ms
9: 640x1088 10 players, 294.1ms
Speed: 64.1ms preprocess, 294.1ms inference, 3.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 7 players, 155.6ms
1: 640x1088 1 basketball, 8 players, 155.6ms
2: 640x1088 2 basketballs, 7 players, 155.6ms
3: 640x1088 4 basketballs, 1 hoop, 6 players, 155.6ms
4: 640x1088 1 basketball, 5 players, 155.6ms
5: 640x1088 2 basketballs, 5 players, 155.6ms
6: 640x1088 4 basketballs, 8 players, 155.6ms
7: 640x1088 2 basketballs, 6 players, 155.6ms
8: 640x1088 2 basketballs, 6 players, 1 referee, 155.6ms
9: 640x1088 3 basketballs, 5 players, 1 referee, 155.6ms
Speed: 58.3ms preprocess, 155.6ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 5 players, 1 referee, 175.5ms
1: 640x1088 4 basketballs, 6 players, 1 referee, 175.5ms
2: 640x1088 1 basketball, 7 players, 175.5ms
3: 640x1088 1 basketball, 7 players, 1 referee, 175.5ms
4: 640x1088 3 basketballs, 8 players, 1 referee, 175.5ms
5: 640x1088 1 basketball, 6 players, 1 referee, 175.5ms
6: 640x1088 1 basketball, 2 hoops, 6 players, 1 referee, 175.5ms
7: 640x1088 5 basketballs, 1 hoop, 5 players, 175.5ms
8: 640x1088 1 basketball, 6 players, 175.5ms
9: 640x1088 3 basketballs, 1 hoop, 9 players, 1 referee, 175.5ms
Speed: 88.6ms preprocess, 175.5ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 3 basketballs, 1 hoop, 8 players, 1 referee, 174.0ms
1: 640x1088 5 basketballs, 2 hoops, 7 players, 2 referees, 174.0ms
2: 640x1088 5 basketballs, 1 hoop, 12 players, 1 referee, 174.0ms
3: 640x1088 1 basketball, 1 hoop, 12 players, 1 referee, 174.0ms
4: 640x1088 1 basketball, 1 hoop, 11 players, 2 referees, 174.0ms
5: 640x1088 1 basketball, 1 hoop, 8 players, 2 referees, 174.0ms
6: 640x1088 1 basketball, 1 hoop, 10 players, 3 referees, 174.0ms
7: 640x1088 1 basketball, 1 hoop, 9 players, 2 referees, 174.0ms
8: 640x1088 3 basketballs, 1 hoop, 8 players, 2 referees, 1 shot-clock, 174.0ms
9: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 2 shot-clocks, 174.0ms
Speed: 62.4ms preprocess, 174.0ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 161.3ms
1: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 161.3ms
2: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 161.3ms
3: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 161.3ms
4: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 161.3ms
5: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 161.3ms
6: 640x1088 1 basketball, 1 hoop, 8 players, 2 referees, 1 shot-clock, 161.3ms
7: 640x1088 2 basketballs, 1 hoop, 8 players, 1 referee, 1 shot-clock, 161.3ms
8: 640x1088 1 basketball, 1 hoop, 8 players, 2 referees, 1 shot-clock, 161.3ms
9: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 161.3ms
Speed: 66.1ms preprocess, 161.3ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 233.6ms
1: 640x1088 1 basketball, 2 hoops, 8 players, 1 referee, 1 shot-clock, 233.6ms
2: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 233.6ms
3: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 233.6ms
4: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 233.6ms
5: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 233.6ms
6: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 233.6ms
7: 640x1088 1 basketball, 1 hoop, 11 players, 1 referee, 1 shot-clock, 233.6ms
8: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 233.6ms
9: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 233.6ms
Speed: 68.6ms preprocess, 233.6ms inference, 0.7ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 5 players, 1 shot-clock, 152.8ms
1: 640x1088 1 basketball, 1 hoop, 7 players, 1 shot-clock, 152.8ms
2: 640x1088 1 basketball, 1 hoop, 6 players, 1 shot-clock, 152.8ms
3: 640x1088 1 basketball, 1 hoop, 6 players, 1 shot-clock, 152.8ms
4: 640x1088 1 basketball, 1 hoop, 7 players, 1 shot-clock, 152.8ms
5: 640x1088 1 basketball, 1 hoop, 5 players, 1 shot-clock, 152.8ms
6: 640x1088 1 basketball, 1 hoop, 5 players, 1 shot-clock, 152.8ms
7: 640x1088 1 basketball, 1 hoop, 5 players, 1 shot-clock, 152.8ms
8: 640x1088 2 basketballs, 1 hoop, 7 players, 1 shot-clock, 152.8ms
9: 640x1088 2 basketballs, 1 hoop, 8 players, 1 referee, 1 shot-clock, 152.8ms
Speed: 77.2ms preprocess, 152.8ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 179.5ms
1: 640x1088 1 basketball, 1 hoop, 12 players, 1 referee, 1 shot-clock, 179.5ms
2: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 179.5ms
3: 640x1088 1 basketball, 1 hoop, 14 players, 1 referee, 1 shot-clock, 179.5ms
4: 640x1088 2 basketballs, 1 hoop, 11 players, 1 referee, 1 shot-clock, 179.5ms
5: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 179.5ms
6: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 179.5ms
7: 640x1088 1 basketball, 1 hoop, 10 players, 1 referee, 1 shot-clock, 179.5ms
8: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 179.5ms
9: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 179.5ms
Speed: 64.7ms preprocess, 179.5ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 166.5ms
1: 640x1088 1 basketball, 1 hoop, 9 players, 2 referees, 1 shot-clock, 166.5ms
2: 640x1088 1 basketball, 1 hoop, 10 players, 2 referees, 1 shot-clock, 166.5ms
3: 640x1088 1 basketball, 1 hoop, 10 players, 2 referees, 1 shot-clock, 166.5ms
4: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 166.5ms
5: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 166.5ms
6: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 166.5ms
7: 640x1088 1 basketball, 1 hoop, 4 players, 1 referee, 1 shot-clock, 166.5ms
8: 640x1088 1 basketball, 1 hoop, 5 players, 1 referee, 1 shot-clock, 166.5ms
9: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 1 shot-clock, 166.5ms
Speed: 55.3ms preprocess, 166.5ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 1 hoop, 8 players, 1 referee, 1 shot-clock, 220.8ms
1: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 1 shot-clock, 220.8ms
2: 640x1088 1 basketball, 1 hoop, 7 players, 1 shot-clock, 220.8ms
3: 640x1088 1 basketball, 1 hoop, 7 players, 1 shot-clock, 220.8ms
4: 640x1088 1 basketball, 1 hoop, 6 players, 1 shot-clock, 220.8ms
5: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 220.8ms
6: 640x1088 1 hoop, 6 players, 1 referee, 1 shot-clock, 220.8ms
7: 640x1088 1 hoop, 8 players, 1 referee, 1 shot-clock, 220.8ms
8: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 220.8ms
9: 640x1088 1 hoop, 7 players, 1 referee, 1 shot-clock, 220.8ms
Speed: 78.1ms preprocess, 220.8ms inference, 3.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 2 basketballs, 1 hoop, 8 players, 1 referee, 1 shot-clock, 171.5ms
1: 640x1088 2 basketballs, 1 hoop, 7 players, 1 referee, 1 shot-clock, 171.5ms
2: 640x1088 2 basketballs, 1 hoop, 5 players, 1 referee, 1 shot-clock, 171.5ms
3: 640x1088 1 basketball, 1 hoop, 5 players, 1 referee, 1 shot-clock, 171.5ms
4: 640x1088 2 basketballs, 1 hoop, 5 players, 1 referee, 1 shot-clock, 171.5ms
5: 640x1088 1 hoop, 6 players, 1 referee, 1 shot-clock, 171.5ms
6: 640x1088 1 hoop, 6 players, 1 referee, 1 shot-clock, 171.5ms
7: 640x1088 1 hoop, 4 players, 1 referee, 1 shot-clock, 171.5ms
8: 640x1088 1 hoop, 5 players, 1 referee, 1 shot-clock, 171.5ms
9: 640x1088 1 hoop, 6 players, 2 referees, 1 shot-clock, 171.5ms
Speed: 81.2ms preprocess, 171.5ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 hoop, 6 players, 1 referee, 1 shot-clock, 175.8ms
1: 640x1088 2 basketballs, 1 hoop, 6 players, 1 referee, 1 shot-clock, 175.8ms
2: 640x1088 1 basketball, 1 hoop, 7 players, 2 referees, 1 shot-clock, 175.8ms
3: 640x1088 2 basketballs, 1 hoop, 11 players, 2 referees, 1 shot-clock, 175.8ms
4: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 175.8ms
5: 640x1088 1 basketball, 1 hoop, 9 players, 2 referees, 1 shot-clock, 175.8ms
6: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 175.8ms
7: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 175.8ms
8: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 175.8ms
9: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 175.8ms
Speed: 74.0ms preprocess, 175.8ms inference, 0.5ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 163.7ms
1: 640x1088 2 basketballs, 1 hoop, 7 players, 1 referee, 1 shot-clock, 163.7ms
2: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 1 shot-clock, 163.7ms
3: 640x1088 1 basketball, 1 hoop, 6 players, 1 referee, 1 shot-clock, 163.7ms
4: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 163.7ms
5: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 163.7ms
6: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 163.7ms
7: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 163.7ms
8: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 163.7ms
9: 640x1088 1 basketball, 1 hoop, 9 players, 1 referee, 1 shot-clock, 163.7ms
Speed: 58.0ms preprocess, 163.7ms inference, 0.7ms postprocess per image at shape (1, 3, 640, 1088)
WARNING ⚠️ imgsz=[1080] must be multiple of max stride 32, updating to [1088]
0: 640x1088 1 basketball, 1 hoop, 8 players, 1 referee, 1 shot-clock, 250.7ms
1: 640x1088 1 basketball, 1 hoop, 7 players, 1 referee, 1 shot-clock, 250.7ms
Speed: 43.9ms preprocess, 250.7ms inference, 1.2ms postprocess per image at shape (1, 3, 640, 1088)
0: 384x640 1 basketball, 1519.6ms
1: 384x640 1 basketball, 1519.6ms
2: 384x640 (no detections), 1519.6ms
3: 384x640 (no detections), 1519.6ms
4: 384x640 (no detections), 1519.6ms
5: 384x640 (no detections), 1519.6ms
6: 384x640 (no detections), 1519.6ms
7: 384x640 (no detections), 1519.6ms
8: 384x640 (no detections), 1519.6ms
9: 384x640 1 basketball, 1519.6ms
10: 384x640 1 basketball, 1519.6ms
11: 384x640 1 basketball, 1519.6ms
12: 384x640 1 basketball, 1519.6ms
13: 384x640 1 basketball, 1519.6ms
14: 384x640 (no detections), 1519.6ms
15: 384x640 1 basketball, 1519.6ms
16: 384x640 1 basketball, 1519.6ms
17: 384x640 1 basketball, 1519.6ms
18: 384x640 1 basketball, 1519.6ms
19: 384x640 (no detections), 1519.6ms
Speed: 9.0ms preprocess, 1519.6ms inference, 1.8ms postprocess per image at shape (1, 3, 384, 640)
0: 384x640 (no detections), 1299.9ms
1: 384x640 2 basketballs, 1299.9ms
2: 384x640 (no detections), 1299.9ms
3: 384x640 (no detections), 1299.9ms
4: 384x640 (no detections), 1299.9ms
5: 384x640 (no detections), 1299.9ms
6: 384x640 1 basketball, 1299.9ms
7: 384x640 (no detections), 1299.9ms
8: 384x640 (no detections), 1299.9ms
9: 384x640 (no detections), 1299.9ms
10: 384x640 (no detections), 1299.9ms
Speed: 9.3ms preprocess, 1299.9ms inference, 0.3ms postprocess per image at shape (1, 3, 384, 640)
Loading Team Assignment Model (Fashion-CLIP)...
Loading weights: 0%| | 0/398 [00:00<?, ?it/s] Loading weights: 0%| | 1/398 [00:00<00:05, 74.31it/s, Materializing param=logit_scale] Loading weights: 0%| | 1/398 [00:00<00:05, 73.08it/s, Materializing param=logit_scale] Loading weights: 1%| | 2/398 [00:00<00:02, 139.99it/s, Materializing param=text_model.embeddings.position_embedding.weight] Loading weights: 1%| | 2/398 [00:00<00:02, 139.31it/s, Materializing param=text_model.embeddings.position_embedding.weight] Loading weights: 1%| | 3/398 [00:00<00:02, 184.44it/s, Materializing param=text_model.embeddings.token_embedding.weight] Loading weights: 1%| | 3/398 [00:00<00:02, 183.30it/s, Materializing param=text_model.embeddings.token_embedding.weight] Loading weights: 1%| | 4/398 [00:00<00:01, 238.55it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] Loading weights: 1%| | 4/398 [00:00<00:01, 236.50it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.bias] Loading weights: 1%|▏ | 5/398 [00:00<00:01, 251.41it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] Loading weights: 1%|▏ | 5/398 [00:00<00:01, 249.82it/s, Materializing param=text_model.encoder.layers.0.layer_norm1.weight] Loading weights: 2%|▏ | 6/398 [00:00<00:01, 297.23it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] Loading weights: 2%|▏ | 6/398 [00:00<00:01, 295.70it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.bias] Loading weights: 2%|▏ | 7/398 [00:00<00:01, 336.87it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] Loading weights: 2%|▏ | 7/398 [00:00<00:01, 335.00it/s, Materializing param=text_model.encoder.layers.0.layer_norm2.weight] Loading weights: 2%|▏ | 8/398 [00:00<00:01, 353.54it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] Loading weights: 2%|▏ | 8/398 [00:00<00:01, 311.73it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.bias] Loading weights: 2%|▏ | 9/398 [00:00<00:01, 346.30it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] Loading weights: 2%|▏ | 9/398 [00:00<00:01, 327.53it/s, Materializing param=text_model.encoder.layers.0.mlp.fc1.weight] Loading weights: 3%|β–Ž | 10/398 [00:00<00:01, 326.45it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] Loading weights: 3%|β–Ž | 10/398 [00:00<00:01, 324.48it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.bias] Loading weights: 3%|β–Ž | 11/398 [00:00<00:01, 353.84it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] Loading weights: 3%|β–Ž | 11/398 [00:00<00:01, 338.71it/s, Materializing param=text_model.encoder.layers.0.mlp.fc2.weight] Loading weights: 3%|β–Ž | 12/398 [00:00<00:01, 350.36it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] Loading weights: 3%|β–Ž | 12/398 [00:00<00:01, 336.43it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.bias] Loading weights: 3%|β–Ž | 13/398 [00:00<00:01, 362.78it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] Loading weights: 3%|β–Ž | 13/398 [00:00<00:01, 362.08it/s, Materializing param=text_model.encoder.layers.0.self_attn.k_proj.weight] Loading weights: 4%|β–Ž | 14/398 [00:00<00:01, 374.34it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] Loading weights: 4%|β–Ž | 14/398 [00:00<00:01, 373.53it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.bias] Loading weights: 4%|▍ | 15/398 [00:00<00:00, 398.42it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] Loading weights: 4%|▍ | 15/398 [00:00<00:00, 384.70it/s, Materializing param=text_model.encoder.layers.0.self_attn.out_proj.weight] Loading weights: 4%|▍ | 16/398 [00:00<00:00, 391.35it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] Loading weights: 4%|▍ | 16/398 [00:00<00:00, 390.21it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.bias] Loading weights: 4%|▍ | 17/398 [00:00<00:00, 397.89it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] Loading weights: 4%|▍ | 17/398 [00:00<00:00, 385.57it/s, Materializing param=text_model.encoder.layers.0.self_attn.q_proj.weight] Loading weights: 5%|▍ | 18/398 [00:00<00:00, 404.94it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] Loading weights: 5%|▍ | 18/398 [00:00<00:00, 393.48it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.bias] Loading weights: 5%|▍ | 19/398 [00:00<00:00, 413.06it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] Loading weights: 5%|▍ | 19/398 [00:00<00:00, 412.08it/s, Materializing param=text_model.encoder.layers.0.self_attn.v_proj.weight] Loading weights: 5%|β–Œ | 20/398 [00:00<00:00, 418.21it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] Loading weights: 5%|β–Œ | 20/398 [00:00<00:00, 387.83it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.bias] Loading weights: 5%|β–Œ | 21/398 [00:00<00:00, 383.62it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] Loading weights: 5%|β–Œ | 21/398 [00:00<00:00, 382.59it/s, Materializing param=text_model.encoder.layers.1.layer_norm1.weight] Loading weights: 6%|β–Œ | 22/398 [00:00<00:00, 387.81it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] Loading weights: 6%|β–Œ | 22/398 [00:00<00:00, 386.83it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.bias] Loading weights: 6%|β–Œ | 23/398 [00:00<00:00, 376.08it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] Loading weights: 6%|β–Œ | 23/398 [00:00<00:01, 374.94it/s, Materializing param=text_model.encoder.layers.1.layer_norm2.weight] Loading weights: 6%|β–Œ | 24/398 [00:00<00:00, 390.17it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] Loading weights: 6%|β–Œ | 24/398 [00:00<00:00, 389.49it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.bias] Loading weights: 6%|β–‹ | 25/398 [00:00<00:00, 385.18it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] Loading weights: 6%|β–‹ | 25/398 [00:00<00:01, 353.26it/s, Materializing param=text_model.encoder.layers.1.mlp.fc1.weight] Loading weights: 7%|β–‹ | 26/398 [00:00<00:01, 363.59it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] Loading weights: 7%|β–‹ | 26/398 [00:00<00:01, 356.27it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.bias] Loading weights: 7%|β–‹ | 27/398 [00:00<00:01, 354.07it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] Loading weights: 7%|β–‹ | 27/398 [00:00<00:01, 353.55it/s, Materializing param=text_model.encoder.layers.1.mlp.fc2.weight] Loading weights: 7%|β–‹ | 28/398 [00:00<00:01, 365.89it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] Loading weights: 7%|β–‹ | 28/398 [00:00<00:01, 365.54it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.bias] Loading weights: 7%|β–‹ | 29/398 [00:00<00:01, 356.49it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] Loading weights: 7%|β–‹ | 29/398 [00:00<00:01, 356.07it/s, Materializing param=text_model.encoder.layers.1.self_attn.k_proj.weight] Loading weights: 8%|β–Š | 30/398 [00:00<00:01, 366.66it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] Loading weights: 8%|β–Š | 30/398 [00:00<00:01, 365.98it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.bias] Loading weights: 8%|β–Š | 31/398 [00:00<00:00, 370.75it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] Loading weights: 8%|β–Š | 31/398 [00:00<00:00, 370.16it/s, Materializing param=text_model.encoder.layers.1.self_attn.out_proj.weight] Loading weights: 8%|β–Š | 32/398 [00:00<00:00, 380.91it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] Loading weights: 8%|β–Š | 32/398 [00:00<00:00, 369.90it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.bias] Loading weights: 8%|β–Š | 33/398 [00:00<00:00, 378.49it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] Loading weights: 8%|β–Š | 33/398 [00:00<00:00, 377.53it/s, Materializing param=text_model.encoder.layers.1.self_attn.q_proj.weight] Loading weights: 9%|β–Š | 34/398 [00:00<00:00, 376.57it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] Loading weights: 9%|β–Š | 34/398 [00:00<00:01, 355.31it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.bias] Loading weights: 9%|β–‰ | 35/398 [00:00<00:00, 364.80it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] Loading weights: 9%|β–‰ | 35/398 [00:00<00:00, 364.50it/s, Materializing param=text_model.encoder.layers.1.self_attn.v_proj.weight] Loading weights: 9%|β–‰ | 36/398 [00:00<00:00, 372.54it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] Loading weights: 9%|β–‰ | 36/398 [00:00<00:01, 360.79it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] Loading weights: 9%|β–‰ | 37/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.bias] Loading weights: 9%|β–‰ | 37/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] Loading weights: 9%|β–‰ | 37/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm1.weight] Loading weights: 10%|β–‰ | 38/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] Loading weights: 10%|β–‰ | 38/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.bias] Loading weights: 10%|β–‰ | 39/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] Loading weights: 10%|β–‰ | 39/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.layer_norm2.weight] Loading weights: 10%|β–ˆ | 40/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] Loading weights: 10%|β–ˆ | 40/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.bias] Loading weights: 10%|β–ˆ | 41/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] Loading weights: 10%|β–ˆ | 41/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc1.weight] Loading weights: 11%|β–ˆ | 42/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] Loading weights: 11%|β–ˆ | 42/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.bias] Loading weights: 11%|β–ˆ | 43/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] Loading weights: 11%|β–ˆ | 43/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.mlp.fc2.weight] Loading weights: 11%|β–ˆ | 44/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] Loading weights: 11%|β–ˆ | 44/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.bias] Loading weights: 11%|β–ˆβ– | 45/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] Loading weights: 11%|β–ˆβ– | 45/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.k_proj.weight] Loading weights: 12%|β–ˆβ– | 46/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] Loading weights: 12%|β–ˆβ– | 46/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.bias] Loading weights: 12%|β–ˆβ– | 47/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] Loading weights: 12%|β–ˆβ– | 47/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.out_proj.weight] Loading weights: 12%|β–ˆβ– | 48/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] Loading weights: 12%|β–ˆβ– | 48/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.bias] Loading weights: 12%|β–ˆβ– | 49/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] Loading weights: 12%|β–ˆβ– | 49/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.q_proj.weight] Loading weights: 13%|β–ˆβ–Ž | 50/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] Loading weights: 13%|β–ˆβ–Ž | 50/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.bias] Loading weights: 13%|β–ˆβ–Ž | 51/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] Loading weights: 13%|β–ˆβ–Ž | 51/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.2.self_attn.v_proj.weight] Loading weights: 13%|β–ˆβ–Ž | 52/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] Loading weights: 13%|β–ˆβ–Ž | 52/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.bias] Loading weights: 13%|β–ˆβ–Ž | 53/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] Loading weights: 13%|β–ˆβ–Ž | 53/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm1.weight] Loading weights: 14%|β–ˆβ–Ž | 54/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] Loading weights: 14%|β–ˆβ–Ž | 54/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.bias] Loading weights: 14%|β–ˆβ– | 55/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] Loading weights: 14%|β–ˆβ– | 55/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.layer_norm2.weight] Loading weights: 14%|β–ˆβ– | 56/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] Loading weights: 14%|β–ˆβ– | 56/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.bias] Loading weights: 14%|β–ˆβ– | 57/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] Loading weights: 14%|β–ˆβ– | 57/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc1.weight] Loading weights: 15%|β–ˆβ– | 58/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] Loading weights: 15%|β–ˆβ– | 58/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.bias] Loading weights: 15%|β–ˆβ– | 59/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] Loading weights: 15%|β–ˆβ– | 59/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.mlp.fc2.weight] Loading weights: 15%|β–ˆβ–Œ | 60/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] Loading weights: 15%|β–ˆβ–Œ | 60/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.bias] Loading weights: 15%|β–ˆβ–Œ | 61/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] Loading weights: 15%|β–ˆβ–Œ | 61/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.k_proj.weight] Loading weights: 16%|β–ˆβ–Œ | 62/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] Loading weights: 16%|β–ˆβ–Œ | 62/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.bias] Loading weights: 16%|β–ˆβ–Œ | 63/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] Loading weights: 16%|β–ˆβ–Œ | 63/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.out_proj.weight] Loading weights: 16%|β–ˆβ–Œ | 64/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] Loading weights: 16%|β–ˆβ–Œ | 64/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.bias] Loading weights: 16%|β–ˆβ–‹ | 65/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] Loading weights: 16%|β–ˆβ–‹ | 65/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.q_proj.weight] Loading weights: 17%|β–ˆβ–‹ | 66/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] Loading weights: 17%|β–ˆβ–‹ | 66/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.bias] Loading weights: 17%|β–ˆβ–‹ | 67/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] Loading weights: 17%|β–ˆβ–‹ | 67/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.3.self_attn.v_proj.weight] Loading weights: 17%|β–ˆβ–‹ | 68/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] Loading weights: 17%|β–ˆβ–‹ | 68/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.bias] Loading weights: 17%|β–ˆβ–‹ | 69/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] Loading weights: 17%|β–ˆβ–‹ | 69/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm1.weight] Loading weights: 18%|β–ˆβ–Š | 70/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] Loading weights: 18%|β–ˆβ–Š | 70/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.bias] Loading weights: 18%|β–ˆβ–Š | 71/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] Loading weights: 18%|β–ˆβ–Š | 71/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.layer_norm2.weight] Loading weights: 18%|β–ˆβ–Š | 72/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] Loading weights: 18%|β–ˆβ–Š | 72/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.bias] Loading weights: 18%|β–ˆβ–Š | 73/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] Loading weights: 18%|β–ˆβ–Š | 73/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc1.weight] Loading weights: 19%|β–ˆβ–Š | 74/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] Loading weights: 19%|β–ˆβ–Š | 74/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.bias] Loading weights: 19%|β–ˆβ–‰ | 75/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] Loading weights: 19%|β–ˆβ–‰ | 75/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.mlp.fc2.weight] Loading weights: 19%|β–ˆβ–‰ | 76/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] Loading weights: 19%|β–ˆβ–‰ | 76/398 [00:00<00:00, 370.00it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] Loading weights: 19%|β–ˆβ–‰ | 77/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.bias] Loading weights: 19%|β–ˆβ–‰ | 77/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] Loading weights: 19%|β–ˆβ–‰ | 77/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.k_proj.weight] Loading weights: 20%|β–ˆβ–‰ | 78/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] Loading weights: 20%|β–ˆβ–‰ | 78/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.bias] Loading weights: 20%|β–ˆβ–‰ | 79/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] Loading weights: 20%|β–ˆβ–‰ | 79/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.out_proj.weight] Loading weights: 20%|β–ˆβ–ˆ | 80/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] Loading weights: 20%|β–ˆβ–ˆ | 80/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.bias] Loading weights: 20%|β–ˆβ–ˆ | 81/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] Loading weights: 20%|β–ˆβ–ˆ | 81/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.q_proj.weight] Loading weights: 21%|β–ˆβ–ˆ | 82/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] Loading weights: 21%|β–ˆβ–ˆ | 82/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.bias] Loading weights: 21%|β–ˆβ–ˆ | 83/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] Loading weights: 21%|β–ˆβ–ˆ | 83/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.4.self_attn.v_proj.weight] Loading weights: 21%|β–ˆβ–ˆ | 84/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] Loading weights: 21%|β–ˆβ–ˆ | 84/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.bias] Loading weights: 21%|β–ˆβ–ˆβ– | 85/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] Loading weights: 21%|β–ˆβ–ˆβ– | 85/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm1.weight] Loading weights: 22%|β–ˆβ–ˆβ– | 86/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] Loading weights: 22%|β–ˆβ–ˆβ– | 86/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.bias] Loading weights: 22%|β–ˆβ–ˆβ– | 87/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] Loading weights: 22%|β–ˆβ–ˆβ– | 87/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.layer_norm2.weight] Loading weights: 22%|β–ˆβ–ˆβ– | 88/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] Loading weights: 22%|β–ˆβ–ˆβ– | 88/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.bias] Loading weights: 22%|β–ˆβ–ˆβ– | 89/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] Loading weights: 22%|β–ˆβ–ˆβ– | 89/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc1.weight] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 90/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 90/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.bias] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 91/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 91/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.mlp.fc2.weight] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 92/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 92/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.bias] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 93/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] Loading weights: 23%|β–ˆβ–ˆβ–Ž | 93/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.k_proj.weight] Loading weights: 24%|β–ˆβ–ˆβ–Ž | 94/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] Loading weights: 24%|β–ˆβ–ˆβ–Ž | 94/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.bias] Loading weights: 24%|β–ˆβ–ˆβ– | 95/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] Loading weights: 24%|β–ˆβ–ˆβ– | 95/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.out_proj.weight] Loading weights: 24%|β–ˆβ–ˆβ– | 96/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] Loading weights: 24%|β–ˆβ–ˆβ– | 96/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.bias] Loading weights: 24%|β–ˆβ–ˆβ– | 97/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] Loading weights: 24%|β–ˆβ–ˆβ– | 97/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.q_proj.weight] Loading weights: 25%|β–ˆβ–ˆβ– | 98/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] Loading weights: 25%|β–ˆβ–ˆβ– | 98/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.bias] Loading weights: 25%|β–ˆβ–ˆβ– | 99/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] Loading weights: 25%|β–ˆβ–ˆβ– | 99/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.5.self_attn.v_proj.weight] Loading weights: 25%|β–ˆβ–ˆβ–Œ | 100/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] Loading weights: 25%|β–ˆβ–ˆβ–Œ | 100/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.bias] Loading weights: 25%|β–ˆβ–ˆβ–Œ | 101/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] Loading weights: 25%|β–ˆβ–ˆβ–Œ | 101/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm1.weight] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 102/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 102/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.bias] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 103/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 103/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.layer_norm2.weight] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 104/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] Loading weights: 26%|β–ˆβ–ˆβ–Œ | 104/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.bias] Loading weights: 26%|β–ˆβ–ˆβ–‹ | 105/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] Loading weights: 26%|β–ˆβ–ˆβ–‹ | 105/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc1.weight] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 106/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 106/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.bias] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 107/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 107/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.mlp.fc2.weight] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 108/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 108/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.bias] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 109/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] Loading weights: 27%|β–ˆβ–ˆβ–‹ | 109/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.k_proj.weight] Loading weights: 28%|β–ˆβ–ˆβ–Š | 110/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] Loading weights: 28%|β–ˆβ–ˆβ–Š | 110/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.bias] Loading weights: 28%|β–ˆβ–ˆβ–Š | 111/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] Loading weights: 28%|β–ˆβ–ˆβ–Š | 111/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.out_proj.weight] Loading weights: 28%|β–ˆβ–ˆβ–Š | 112/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] Loading weights: 28%|β–ˆβ–ˆβ–Š | 112/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.bias] Loading weights: 28%|β–ˆβ–ˆβ–Š | 113/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] Loading weights: 28%|β–ˆβ–ˆβ–Š | 113/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.q_proj.weight] Loading weights: 29%|β–ˆβ–ˆβ–Š | 114/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] Loading weights: 29%|β–ˆβ–ˆβ–Š | 114/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.bias] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 115/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 115/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.6.self_attn.v_proj.weight] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 116/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 116/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.bias] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 117/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] Loading weights: 29%|β–ˆβ–ˆβ–‰ | 117/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm1.weight] Loading weights: 30%|β–ˆβ–ˆβ–‰ | 118/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] Loading weights: 30%|β–ˆβ–ˆβ–‰ | 118/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.bias] Loading weights: 30%|β–ˆβ–ˆβ–‰ | 119/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] Loading weights: 30%|β–ˆβ–ˆβ–‰ | 119/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.layer_norm2.weight] Loading weights: 30%|β–ˆβ–ˆβ–ˆ | 120/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] Loading weights: 30%|β–ˆβ–ˆβ–ˆ | 120/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.bias] Loading weights: 30%|β–ˆβ–ˆβ–ˆ | 121/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] Loading weights: 30%|β–ˆβ–ˆβ–ˆ | 121/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc1.weight] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 122/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 122/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.bias] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 123/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 123/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.mlp.fc2.weight] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 124/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] Loading weights: 31%|β–ˆβ–ˆβ–ˆ | 124/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.bias] Loading weights: 31%|β–ˆβ–ˆβ–ˆβ– | 125/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] Loading weights: 31%|β–ˆβ–ˆβ–ˆβ– | 125/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.k_proj.weight] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 126/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 126/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.bias] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 127/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 127/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.out_proj.weight] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 128/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 128/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.bias] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 129/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] Loading weights: 32%|β–ˆβ–ˆβ–ˆβ– | 129/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.q_proj.weight] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 130/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 130/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.bias] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 131/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 131/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.7.self_attn.v_proj.weight] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 132/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 132/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.bias] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 133/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] Loading weights: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 133/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm1.weight] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ–Ž | 134/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ–Ž | 134/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.bias] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 135/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 135/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.layer_norm2.weight] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 136/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 136/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.bias] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 137/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] Loading weights: 34%|β–ˆβ–ˆβ–ˆβ– | 137/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc1.weight] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ– | 138/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ– | 138/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.bias] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ– | 139/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ– | 139/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.mlp.fc2.weight] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 140/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 140/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.bias] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 141/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] Loading weights: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 141/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.k_proj.weight] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 142/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 142/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.bias] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 143/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 143/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.out_proj.weight] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 144/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 144/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.bias] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–‹ | 145/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] Loading weights: 36%|β–ˆβ–ˆβ–ˆβ–‹ | 145/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.q_proj.weight] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 146/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 146/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.bias] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 147/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 147/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.8.self_attn.v_proj.weight] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 148/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 148/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.bias] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 149/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] Loading weights: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 149/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm1.weight] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 150/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 150/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.bias] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 151/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 151/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.layer_norm2.weight] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 152/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 152/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.bias] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 153/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] Loading weights: 38%|β–ˆβ–ˆβ–ˆβ–Š | 153/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc1.weight] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–Š | 154/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–Š | 154/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.bias] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 155/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 155/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.mlp.fc2.weight] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 156/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 156/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.bias] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 157/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] Loading weights: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 157/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.k_proj.weight] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 158/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 158/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.bias] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 159/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 159/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.out_proj.weight] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 160/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 160/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.bias] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 161/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] Loading weights: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 161/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.q_proj.weight] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 162/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 162/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.bias] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 163/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 163/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.9.self_attn.v_proj.weight] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 164/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 164/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.bias] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 165/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] Loading weights: 41%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 165/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm1.weight] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 166/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 166/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.bias] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 167/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 167/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.layer_norm2.weight] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 168/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 168/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.bias] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 169/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] Loading weights: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 169/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc1.weight] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 170/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 170/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.bias] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 171/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 171/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.mlp.fc2.weight] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 172/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 172/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.bias] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 173/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] Loading weights: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 173/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.k_proj.weight] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 174/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 174/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.bias] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 175/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 175/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.out_proj.weight] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 176/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 176/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.bias] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 177/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] Loading weights: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 177/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.q_proj.weight] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 178/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 178/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.bias] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 179/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 179/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.10.self_attn.v_proj.weight] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 180/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 180/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.bias] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 181/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] Loading weights: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 181/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm1.weight] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 182/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 182/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.bias] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 183/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 183/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.layer_norm2.weight] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 184/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 184/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.bias] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 185/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] Loading weights: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 185/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc1.weight] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 186/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 186/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.bias] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 187/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 187/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.mlp.fc2.weight] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 188/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 188/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.bias] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 189/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] Loading weights: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 189/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.k_proj.weight] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 190/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 190/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.bias] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 191/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 191/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.out_proj.weight] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 192/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 192/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.bias] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 193/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] Loading weights: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 193/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.q_proj.weight] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 194/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 194/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.bias] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 195/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 195/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.encoder.layers.11.self_attn.v_proj.weight] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 196/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.final_layer_norm.bias] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 196/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.final_layer_norm.bias] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 197/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.final_layer_norm.weight] Loading weights: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 197/398 [00:00<00:00, 381.71it/s, Materializing param=text_model.final_layer_norm.weight] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 198/398 [00:00<00:00, 381.71it/s, Materializing param=text_projection.weight] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 198/398 [00:00<00:00, 381.71it/s, Materializing param=text_projection.weight] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 199/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.class_embedding] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 199/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.class_embedding] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 200/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] Loading weights: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 200/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.patch_embedding.weight] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 201/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.position_embedding.weight] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 201/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.embeddings.position_embedding.weight] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 202/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 202/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.bias] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 203/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 203/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm1.weight] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 204/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] Loading weights: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 204/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.bias] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 205/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 205/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.layer_norm2.weight] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 206/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 206/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.bias] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 207/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 207/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc1.weight] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 208/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] Loading weights: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 208/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.bias] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 209/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 209/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.mlp.fc2.weight] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 210/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 210/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.bias] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 211/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 211/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.k_proj.weight] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 212/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] Loading weights: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 212/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.bias] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 213/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 213/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.out_proj.weight] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 214/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 214/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.bias] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 215/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 215/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.q_proj.weight] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 216/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] Loading weights: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 216/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.bias] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 217/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 217/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.0.self_attn.v_proj.weight] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 218/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 218/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.bias] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 219/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 219/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm1.weight] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 220/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] Loading weights: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 220/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.bias] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 221/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 221/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.layer_norm2.weight] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 222/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 222/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.bias] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 223/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 223/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc1.weight] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 224/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] Loading weights: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 224/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.bias] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 225/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 225/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.mlp.fc2.weight] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 226/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 226/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.bias] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 227/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 227/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.k_proj.weight] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 228/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] Loading weights: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 228/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.bias] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 229/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 229/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.out_proj.weight] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 230/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 230/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.bias] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 231/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 231/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.q_proj.weight] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 232/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] Loading weights: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 232/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.bias] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 233/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 233/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.1.self_attn.v_proj.weight] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 234/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 234/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.bias] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 235/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 235/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm1.weight] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 236/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] Loading weights: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 236/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.bias] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 237/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 237/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.layer_norm2.weight] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 238/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 238/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.bias] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 239/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 239/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc1.weight] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 240/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] Loading weights: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 240/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.bias] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 241/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 241/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.mlp.fc2.weight] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 242/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 242/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.bias] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 243/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 243/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.k_proj.weight] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 244/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] Loading weights: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 244/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.bias] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 245/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 245/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.out_proj.weight] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 246/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 246/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.bias] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 247/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 247/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.q_proj.weight] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 248/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] Loading weights: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 248/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.bias] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 249/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 249/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.2.self_attn.v_proj.weight] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 250/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 250/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.bias] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 251/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 251/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm1.weight] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 252/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] Loading weights: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 252/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.bias] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 253/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 253/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.layer_norm2.weight] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 254/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 254/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.bias] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 255/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 255/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc1.weight] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 256/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] Loading weights: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 256/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.bias] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 257/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 257/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.mlp.fc2.weight] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 258/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 258/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.bias] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 259/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 259/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.k_proj.weight] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 260/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] Loading weights: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 260/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.bias] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 261/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 261/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.out_proj.weight] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 262/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 262/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.bias] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 263/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 263/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.q_proj.weight] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 264/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] Loading weights: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 264/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.bias] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 265/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 265/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.3.self_attn.v_proj.weight] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 266/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 266/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.bias] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 267/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 267/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm1.weight] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 268/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] Loading weights: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 268/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.bias] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 269/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 269/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.layer_norm2.weight] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 270/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 270/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.bias] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 271/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 271/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc1.weight] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 272/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] Loading weights: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 272/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.bias] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 273/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 273/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.mlp.fc2.weight] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 274/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 274/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.bias] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 275/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 275/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.k_proj.weight] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 276/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] Loading weights: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 276/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.bias] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 277/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 277/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.out_proj.weight] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 278/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 278/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.bias] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 279/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 279/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.q_proj.weight] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 280/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] Loading weights: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 280/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.bias] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 281/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 281/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.4.self_attn.v_proj.weight] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 282/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 282/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.bias] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 283/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 283/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm1.weight] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 284/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] Loading weights: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 284/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.bias] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 285/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 285/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.layer_norm2.weight] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 286/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 286/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.bias] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 287/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 287/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc1.weight] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 288/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] Loading weights: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 288/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.bias] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 289/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 289/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.mlp.fc2.weight] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 290/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 290/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.bias] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 291/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 291/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.k_proj.weight] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 292/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] Loading weights: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 292/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.bias] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 293/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 293/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.out_proj.weight] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 294/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 294/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.bias] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 295/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 295/398 [00:00<00:00, 381.71it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 296/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.q_proj.weight] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 296/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] Loading weights: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 296/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.bias] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 297/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 297/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.5.self_attn.v_proj.weight] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 298/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 298/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.bias] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 299/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 299/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm1.weight] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 300/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] Loading weights: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 300/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.bias] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 301/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 301/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.layer_norm2.weight] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 302/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 302/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.bias] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 303/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 303/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc1.weight] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 304/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] Loading weights: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 304/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.bias] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 305/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 305/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.mlp.fc2.weight] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 306/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 306/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.bias] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 307/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 307/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.k_proj.weight] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 308/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] Loading weights: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 308/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.bias] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 309/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 309/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.out_proj.weight] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 310/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 310/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.bias] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 311/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 311/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.q_proj.weight] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 312/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] Loading weights: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 312/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.bias] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 313/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 313/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.6.self_attn.v_proj.weight] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 314/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 314/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.bias] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 315/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 315/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm1.weight] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 316/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] Loading weights: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 316/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.bias] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 317/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 317/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.layer_norm2.weight] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 318/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 318/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.bias] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 319/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 319/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc1.weight] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 320/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] Loading weights: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 320/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.bias] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 321/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 321/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.mlp.fc2.weight] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 322/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 322/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.bias] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 323/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 323/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.k_proj.weight] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 324/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] Loading weights: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 324/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.bias] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 325/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 325/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.out_proj.weight] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 326/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 326/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.bias] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 327/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 327/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.q_proj.weight] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 328/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] Loading weights: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 328/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.bias] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 329/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 329/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.7.self_attn.v_proj.weight] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 330/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 330/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.bias] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 331/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 331/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm1.weight] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 332/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] Loading weights: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 332/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.bias] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 333/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 333/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.layer_norm2.weight] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 334/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 334/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.bias] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 335/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 335/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc1.weight] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 336/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] Loading weights: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 336/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.bias] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 337/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 337/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.mlp.fc2.weight] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 338/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 338/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.bias] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 339/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 339/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.k_proj.weight] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 340/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] Loading weights: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 340/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.bias] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 341/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 341/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.out_proj.weight] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 342/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 342/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.bias] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 343/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 343/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.q_proj.weight] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 344/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] Loading weights: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 344/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.bias] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 345/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 345/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.8.self_attn.v_proj.weight] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 346/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 346/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.bias] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 347/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 347/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm1.weight] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 348/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] Loading weights: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 348/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.bias] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 349/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 349/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.layer_norm2.weight] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 350/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 350/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.bias] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 351/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 351/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc1.weight] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 352/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] Loading weights: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 352/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.bias] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 353/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 353/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.mlp.fc2.weight] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 354/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 354/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.bias] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 355/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 355/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.k_proj.weight] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 356/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] Loading weights: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 356/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.bias] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 357/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 357/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.out_proj.weight] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 358/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 358/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.bias] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 359/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 359/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.q_proj.weight] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 360/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] Loading weights: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 360/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.bias] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 361/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 361/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.9.self_attn.v_proj.weight] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 362/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 362/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.bias] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 363/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 363/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm1.weight] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 364/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] Loading weights: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 364/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.bias] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 365/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 365/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.layer_norm2.weight] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 366/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 366/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.bias] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 367/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 367/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc1.weight] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 368/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] Loading weights: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 368/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.bias] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 369/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 369/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.mlp.fc2.weight] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 370/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 370/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.bias] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 371/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 371/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.k_proj.weight] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 372/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] Loading weights: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 372/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.bias] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 373/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 373/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.out_proj.weight] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 374/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 374/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.bias] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 375/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 375/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.q_proj.weight] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 376/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] Loading weights: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 376/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.bias] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 377/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 377/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.10.self_attn.v_proj.weight] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 378/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 378/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.bias] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 379/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 379/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm1.weight] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 380/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] Loading weights: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 380/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.bias] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 381/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 381/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.layer_norm2.weight] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 382/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 382/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.bias] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 383/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 383/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc1.weight] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 384/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] Loading weights: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 384/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.bias] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 385/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 385/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.mlp.fc2.weight] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 386/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 386/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.bias] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 387/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 387/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.k_proj.weight] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 388/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] Loading weights: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 388/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.bias] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 389/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 389/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.out_proj.weight] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 390/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 390/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.bias] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 391/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 391/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.q_proj.weight] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 392/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] Loading weights: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 392/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.bias] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 393/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 393/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.encoder.layers.11.self_attn.v_proj.weight] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 394/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.post_layernorm.bias] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 394/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.post_layernorm.bias] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 395/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.post_layernorm.weight] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 395/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.post_layernorm.weight] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 396/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.pre_layrnorm.bias] Loading weights: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 396/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.pre_layrnorm.bias] Loading weights: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 397/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.pre_layrnorm.weight] Loading weights: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 397/398 [00:00<00:00, 1198.78it/s, Materializing param=vision_model.pre_layrnorm.weight] Loading weights: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 398/398 [00:00<00:00, 1198.78it/s, Materializing param=visual_projection.weight] Loading weights: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 398/398 [00:00<00:00, 1198.78it/s, Materializing param=visual_projection.weight] Loading weights: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 398/398 [00:00<00:00, 1149.65it/s, Materializing param=visual_projection.weight]
CLIPModel LOAD REPORT from: patrickjohncyh/fashion-clip
Key | Status | |
-------------------------------------+------------+--+-
vision_model.embeddings.position_ids | UNEXPECTED | |
text_model.embeddings.position_ids | UNEXPECTED | |
Notes:
- UNEXPECTED :can be ignored when loading from different task/architecture; not ok if you expect identical arch.
Warning: You are sending unauthenticated requests to the HF Hub. Please set a HF_TOKEN to enable higher rate limits and faster downloads.
The image processor of type `CLIPImageProcessor` is now loaded as a fast processor by default, even if the model checkpoint was saved with a slow processor. This is a breaking change and may produce slightly different outputs. To continue using the slow processor, instantiate this class with `use_fast=False`.
βœ… Team Assignment Model loaded.
Analysis background task finished.