InternVL2-2B / eval_mm_niah /reasoning-text-test.log
cuierfei's picture
Upload folder using huggingface_hub
b537a0f verified
raw
history blame
316 kB
language_model.model.layers.0 4
language_model.model.layers.1 4
language_model.model.layers.2 4
language_model.model.layers.3 4
language_model.model.layers.4 4
language_model.model.layers.5 4
language_model.model.layers.6 4
language_model.model.layers.7 4
language_model.model.layers.8 4
language_model.model.layers.9 4
language_model.model.layers.10 4
language_model.model.layers.11 4
language_model.model.layers.12 4
language_model.model.layers.13 4
language_model.model.layers.14 4
language_model.model.layers.15 4
language_model.model.layers.16 4
language_model.model.layers.17 4
language_model.model.layers.18 4
language_model.model.layers.19 4
language_model.model.layers.20 4
language_model.model.layers.21 4
language_model.model.layers.22 4
language_model.model.layers.23 4
vision_model.encoder.layers.0 0
vision_model.encoder.layers.1 0
vision_model.encoder.layers.2 0
vision_model.encoder.layers.3 0
vision_model.encoder.layers.4 0
vision_model.encoder.layers.5 0
vision_model.encoder.layers.6 0
vision_model.encoder.layers.7 0
vision_model.encoder.layers.8 0
vision_model.encoder.layers.9 0
vision_model.encoder.layers.10 0
vision_model.encoder.layers.11 0
vision_model.encoder.layers.12 0
vision_model.encoder.layers.13 0
vision_model.encoder.layers.14 0
vision_model.encoder.layers.15 0
vision_model.encoder.layers.16 0
vision_model.encoder.layers.17 0
vision_model.encoder.layers.18 0
vision_model.encoder.layers.19 0
vision_model.encoder.layers.20 0
vision_model.encoder.layers.21 0
vision_model.encoder.layers.22 0
vision_model.encoder.layers.23 0
vision_model.embeddings 0
mlp1 0
language_model.model.tok_embeddings 4
language_model.model.norm 4
language_model.output 4
language_model.model.embed_tokens 4
language_model.lm_head 4
The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored.
The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored.
The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored.
The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored.
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Rank [0] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-text-test, devices: {device(type='cuda', index=0), device(type='cuda', index=4)}
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Rank [2] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-text-test, devices: {device(type='cuda', index=2), device(type='cuda', index=6)}
Rank [1] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-text-test, devices: {device(type='cuda', index=1), device(type='cuda', index=5)}
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
Rank [3] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-text-test, devices: {device(type='cuda', index=3), device(type='cuda', index=7)}
Rank 3 len(skip_idx)=0
Rank 1 len(skip_idx)=0
Rank 2 len(skip_idx)=0
Rank 0 len(skip_idx)=0
[2024-08-03 15:13:15] [Rank 3] totoal_tokens=523, outputs='Mark'
[2024-08-03 15:13:15] [Rank 1] totoal_tokens=532, outputs='orange'
[2024-08-03 15:13:15] [Rank 0] totoal_tokens=519, outputs='Tim'
[2024-08-03 15:13:15] [Rank 2] totoal_tokens=623, outputs='A chocolate cake'
[2024-08-03 15:13:15] [Rank 3] totoal_tokens=549, outputs='backyard'
[2024-08-03 15:13:15] [Rank 0] totoal_tokens=627, outputs='Hike'
[2024-08-03 15:13:15] [Rank 1] totoal_tokens=629, outputs='Lucy'
[2024-08-03 15:13:15] [Rank 2] totoal_tokens=638, outputs='Eat breakfast'
[2024-08-03 15:13:15] [Rank 0] totoal_tokens=629, outputs='Running'
[2024-08-03 15:13:15] [Rank 1] totoal_tokens=637, outputs='Editing'
[2024-08-03 15:13:15] [Rank 3] totoal_tokens=651, outputs='Dr. Allen'
[2024-08-03 15:13:15] [Rank 2] totoal_tokens=639, outputs='Walls'
[2024-08-03 15:13:15] [Rank 1] totoal_tokens=663, outputs='Region X'
[2024-08-03 15:13:15] [Rank 0] totoal_tokens=654, outputs='Eat breakfast'
[2024-08-03 15:13:15] [Rank 3] totoal_tokens=654, outputs='Investment C'
[2024-08-03 15:13:15] [Rank 1] totoal_tokens=678, outputs='Bob'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=644, outputs='Chocolate cake'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=671, outputs='Running'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=681, outputs='The sales team'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=794, outputs='Jupiter'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=653, outputs='Linda'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=685, outputs='Region X'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=758, outputs='Wakes up'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=822, outputs='dog'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=757, outputs='orange'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=670, outputs='Project Alpha'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=764, outputs='Tim'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=834, outputs='The main course'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=776, outputs='orange'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=671, outputs='Lunch'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=765, outputs='Store Z'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=841, outputs='dog'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=675, outputs='Launch'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=778, outputs='Chocolate cake'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=780, outputs='Car A'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=849, outputs='Gamma'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=702, outputs='Lunch'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=821, outputs='Linda'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=851, outputs='Sue'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=815, outputs='Impressionist'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=739, outputs='Sales team'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=856, outputs='Dr. Smith'
[2024-08-03 15:13:16] [Rank 3] totoal_tokens=823, outputs='dragonfly'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=878, outputs='Sweat'
[2024-08-03 15:13:16] [Rank 0] totoal_tokens=882, outputs='Tara'
Processing InternVL2-2B_reasoning-text-test.jsonl: 0%| | 0/751 [00:00<?, ?it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 0%| | 1/751 [00:00<10:40, 1.17it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 0%| | 2/751 [00:00<05:19, 2.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%| | 4/751 [00:01<02:57, 4.20it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%| | 5/751 [00:01<02:35, 4.81it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%| | 6/751 [00:01<02:13, 5.58it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%| | 8/751 [00:01<01:46, 6.97it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%| | 9/751 [00:01<01:45, 7.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%|▏ | 10/751 [00:01<01:40, 7.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 1%|▏ | 11/751 [00:02<01:41, 7.32it/s] Processing InternVL2-2B_reasoning-t[2024-08-03 15:13:16] [Rank 3] totoal_tokens=835, outputs='Initiative Alpha'
[2024-08-03 15:13:16] [Rank 2] totoal_tokens=797, outputs='at the end of the day'
[2024-08-03 15:13:16] [Rank 1] totoal_tokens=879, outputs='Funicular'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=885, outputs='Walls'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=836, outputs='Initiative Alpha'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=883, outputs='Charlie'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=802, outputs='Dr. Carter'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=885, outputs='Walls'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=897, outputs='Region C'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=839, outputs='Painting the model'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=913, outputs='Company Z'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=809, outputs='Diet Plan C'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=850, outputs='Sales team'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=914, outputs='Maple'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=812, outputs='Backyard'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=898, outputs='At the end of the day'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=916, outputs='red'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=882, outputs='Lucas'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=832, outputs='Project Alpha'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=898, outputs='Chocolate cake'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=885, outputs='The main course'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=843, outputs='under the magazines'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=930, outputs='The product is launched to the public'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=887, outputs='Hike'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=903, outputs='Fireworks display'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=847, outputs='Frank'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=893, outputs='Tara'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=905, outputs='Fireworks display'
[2024-08-03 15:13:17] [Rank 0] totoal_tokens=944, outputs='Ursula von der'
[2024-08-03 15:13:17] [Rank 2] totoal_tokens=880, outputs='Painting the model'
[2024-08-03 15:13:17] [Rank 3] totoal_tokens=897, outputs='Dr. Smith'
[2024-08-03 15:13:17] [Rank 1] totoal_tokens=926, outputs='daisy'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=951, outputs='Linda'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=901, outputs='Frank'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=886, outputs='Cara'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=928, outputs='Bob'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=904, outputs='Red'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=953, outputs="Linda's app"
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=890, outputs='Dr. Carter'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=914, outputs='Company Y'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=932, outputs='At the end of the day'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=895, outputs='Bob'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=983, outputs='Maple'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=944, outputs='Tara'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=897, outputs='The main course'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=938, outputs='Painting the model'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1016, outputs='Adaptive Grid Snap'
ext-test.jsonl: 2%|▏ | 12/751 [00:02<01:38, 7.54it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2%|▏ | 13/751 [00:02<01:34, 7.80it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2%|▏ | 14/751 [00:02<01:31, 8.01it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2%|▏ | 15/751 [00:02<01:30, 8.16it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2%|▏ | 16/751 [00:02<01:27, 8.40it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2%|▏ | 18/751 [00:03<01:45, 6.95it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 19/751 [00:03<01:56, 6.26it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 20/751 [00:03<01:49, 6.65it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 21/751 [00:03<01:54, 6.38it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 22/751 [00:03<01:46, 6.84it/s] Processing InternVL2-2B_reasoning-text-[2024-08-03 15:13:18] [Rank 3] totoal_tokens=950, outputs='Tulip'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=906, outputs='Alpha'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=939, outputs='Spider plant'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1021, outputs='Region Y'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=953, outputs='Product B'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=907, outputs='Dr. Smith'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=953, outputs='owl'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=943, outputs='Painting the model'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1034, outputs='Pepinster'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=914, outputs='Spider plant'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=958, outputs='oak'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=944, outputs='orange'
[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1040, outputs='dragonfly'
[2024-08-03 15:13:18] [Rank 2] totoal_tokens=921, outputs='Rome'
[2024-08-03 15:13:18] [Rank 3] totoal_tokens=975, outputs='Brush his teeth'
[2024-08-03 15:13:18] [Rank 1] totoal_tokens=961, outputs='Store Z'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=935, outputs='Store B'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=983, outputs='Maple'
[2024-08-03 15:13:19] [Rank 1] totoal_tokens=973, outputs='Get bus'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1041, outputs='At the end of the day'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=994, outputs='Company Z'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=949, outputs='Fireworks display'
[2024-08-03 15:13:19] [Rank 1] totoal_tokens=999, outputs='Walls'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1042, outputs='daisy'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=958, outputs='backyard'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=998, outputs='Brush his teeth'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1046, outputs='Walls'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=978, outputs='Starter'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=1007, outputs='Dessert'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1047, outputs='Cara'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=985, outputs='Product A'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1050, outputs='Cara'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=1016, outputs='Under the magazines'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=1002, outputs='Madrid'
[2024-08-03 15:13:19] [Rank 1] totoal_tokens=1031, outputs='owl'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1060, outputs='The backyard'
[2024-08-03 15:13:19] [Rank 3] totoal_tokens=1044, outputs='Dessert'
[2024-08-03 15:13:19] [Rank 2] totoal_tokens=1015, outputs='visit the museum'
[2024-08-03 15:13:19] [Rank 1] totoal_tokens=1033, outputs='Lucy'
[2024-08-03 15:13:19] [Rank 0] totoal_tokens=1062, outputs='John'
test.jsonl: 3%|β–Ž | 23/751 [00:03<01:54, 6.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 24/751 [00:03<01:46, 6.82it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 25/751 [00:04<01:51, 6.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 3%|β–Ž | 26/751 [00:04<01:46, 6.83it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|β–Ž | 27/751 [00:04<02:01, 5.94it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|β–Ž | 28/751 [00:04<01:50, 6.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|▍ | 29/751 [00:04<01:45, 6.87it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|▍ | 30/751 [00:04<01:40, 7.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|▍ | 31/751 [00:04<01:36, 7.47it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 4%|▍ | 32/751 [00:05<01:37, 7.37it/s] Processing InternVL2-2B_reasoning-text-test[2024-08-03 15:13:19] [Rank 3] totoal_tokens=1045, outputs='Get the bus'
[2024-08-03 15:13:19] [Rank 1] totoal_tokens=1036, outputs='Alpha'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1080, outputs='Project Alpha'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1039, outputs='Tara'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1045, outputs='pine'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1057, outputs='Dr. Lee'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1080, outputs='Store B'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1076, outputs='red'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1072, outputs='Bob'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1044, outputs='Dr. Johnson'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1086, outputs='Eats breakfast'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1076, outputs='Alice'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1047, outputs='Cara'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1077, outputs='The product is launched to the public'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1086, outputs='Jones'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1086, outputs='Frank'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1052, outputs='Red'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1086, outputs='Y'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1094, outputs='Liam'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1052, outputs='Fireworks display'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1094, outputs='brushes his teeth'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1088, outputs='Impressionist'
[2024-08-03 15:13:20] [Rank 3] totoal_tokens=1097, outputs='daisy'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1052, outputs='Maple tree'
[2024-08-03 15:13:20] [Rank 1] totoal_tokens=1088, outputs='Gamma'
[2024-08-03 15:13:20] [Rank 0] totoal_tokens=1094, outputs='Watched the movie'
[2024-08-03 15:13:20] [Rank 2] totoal_tokens=1077, outputs='Catch the bus'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1080, outputs='Editing'
[2024-08-03 15:13:21] [Rank 3] totoal_tokens=1109, outputs='Diet Plan C'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1084, outputs='Frank'
[2024-08-03 15:13:21] [Rank 1] totoal_tokens=1092, outputs='The wings are assembled first.'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1130, outputs='dragonfly'
[2024-08-03 15:13:21] [Rank 3] totoal_tokens=1134, outputs='A chocolate cake'
[2024-08-03 15:13:21] [Rank 0] totoal_tokens=1098, outputs='Tim'
[2024-08-03 15:13:21] [Rank 1] totoal_tokens=1096, outputs='Watched the movie'
[2024-08-03 15:13:21] [Rank 3] totoal_tokens=1140, outputs='Mia'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1136, outputs='Watched the movie'
[2024-08-03 15:13:21] [Rank 0] totoal_tokens=1099, outputs='The product is launched to the public'
[2024-08-03 15:13:21] [Rank 3] totoal_tokens=1144, outputs='Dessert'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1141, outputs='Store Z'
[2024-08-03 15:13:21] [Rank 0] totoal_tokens=1129, outputs='Daniel Arnold'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1143, outputs='Editing'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1149, outputs='orange'
[2024-08-03 15:13:21] [Rank 0] totoal_tokens=1141, outputs='pine tree'
.jsonl: 4%|▍ | 33/751 [00:05<01:30, 7.97it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|▍ | 34/751 [00:05<01:28, 8.12it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|▍ | 35/751 [00:05<01:28, 8.14it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|▍ | 36/751 [00:05<01:35, 7.48it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|▍ | 37/751 [00:05<01:39, 7.19it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|β–Œ | 38/751 [00:05<01:49, 6.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|β–Œ | 39/751 [00:06<01:54, 6.23it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|β–Œ | 40/751 [00:06<03:04, 3.85it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 5%|β–Œ | 41/751 [00:06<03:03, 3.87it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–Œ | 42/751 [00:06<02:36, 4.53it/s] Processing InternVL2-2B_reasoning-text-test.jso[2024-08-03 15:13:21] [Rank 1] totoal_tokens=1100, outputs='At the end of the day'
[2024-08-03 15:13:21] [Rank 2] totoal_tokens=1151, outputs='Noah'
[2024-08-03 15:13:21] [Rank 0] totoal_tokens=1144, outputs='Company Y'
[2024-08-03 15:13:21] [Rank 3] totoal_tokens=1163, outputs='Decorations and plants are added to enhance the ambiance.'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1102, outputs='lunch'
[2024-08-03 15:13:22] [Rank 2] totoal_tokens=1156, outputs='owl'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1148, outputs='Region C'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1108, outputs='orange'
[2024-08-03 15:13:22] [Rank 3] totoal_tokens=1172, outputs='Diet Plan C'
[2024-08-03 15:13:22] [Rank 2] totoal_tokens=1173, outputs='Bob'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1173, outputs='Watch the movie'
[2024-08-03 15:13:22] [Rank 3] totoal_tokens=1172, outputs='Jupiter'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1115, outputs='maple tree'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1177, outputs='Chocolate cake'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1122, outputs='Diet Plan C'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1203, outputs='Region C'
[2024-08-03 15:13:22] [Rank 2] totoal_tokens=1177, outputs='Take a yoga class'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1140, outputs='Car A'
[2024-08-03 15:13:22] [Rank 3] totoal_tokens=1188, outputs='yellow book'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1206, outputs='Filming'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1151, outputs='The sales team'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1210, outputs='Region C'
[2024-08-03 15:13:22] [Rank 2] totoal_tokens=1180, outputs='Francois Clement'
[2024-08-03 15:13:22] [Rank 2] totoal_tokens=1204, outputs='Charlie'
[2024-08-03 15:13:22] [Rank 1] totoal_tokens=1153, outputs='Initiative Beta'
[2024-08-03 15:13:22] [Rank 0] totoal_tokens=1226, outputs='Charlie'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1190, outputs='Investment C'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1164, outputs='orange'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1226, outputs='orange'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1204, outputs='brush his teeth'
[2024-08-03 15:13:23] [Rank 0] totoal_tokens=1234, outputs='The product is launched to the public'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1175, outputs='Dr. Johnson'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1209, outputs='daisy'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1232, outputs='At the end of the day'
[2024-08-03 15:13:23] [Rank 0] totoal_tokens=1237, outputs='Linda'
nl: 6%|β–Œ | 43/751 [00:07<02:15, 5.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–Œ | 44/751 [00:07<02:02, 5.80it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–Œ | 45/751 [00:07<01:52, 6.26it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–Œ | 46/751 [00:07<01:54, 6.17it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–‹ | 47/751 [00:07<01:52, 6.26it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 6%|β–‹ | 48/751 [00:07<01:47, 6.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 49/751 [00:07<01:41, 6.93it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 50/751 [00:08<01:40, 6.98it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 51/751 [00:08<01:40, 6.97it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 52/751 [00:08<02:03, 5.68it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:13:23] [Rank 1] totoal_tokens=1175, outputs='Tara'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1209, outputs='Impressionist'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1257, outputs='Painting the model'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1182, outputs='Impressionist'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1210, outputs='dog'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1210, outputs='orange'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1264, outputs='daisy'
[2024-08-03 15:13:23] [Rank 0] totoal_tokens=1250, outputs='Get off the plane if possible at stopovers'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1220, outputs='Route X'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1265, outputs='John'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1215, outputs='Store B'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1222, outputs='Store B'
[2024-08-03 15:13:23] [Rank 1] totoal_tokens=1216, outputs='Dr. Smith'
[2024-08-03 15:13:23] [Rank 3] totoal_tokens=1224, outputs='Alpha'
[2024-08-03 15:13:23] [Rank 2] totoal_tokens=1316, outputs='Jupiter'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1230, outputs='Fireworks display'
[2024-08-03 15:13:24] [Rank 0] totoal_tokens=1264, outputs='She finishes her Bachelor’s Degree in Mathematics at the American University of Beir'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1227, outputs='There was a lunch'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1250, outputs='Work'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1227, outputs='Alice'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1351, outputs='Farm & Gin Show'
[2024-08-03 15:13:24] [Rank 0] totoal_tokens=1268, outputs='lunch'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1252, outputs='orange'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1368, outputs='salad'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1254, outputs='The main course'
[2024-08-03 15:13:24] [Rank 0] totoal_tokens=1326, outputs='sunflower'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1306, outputs='Running'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1372, outputs='Walls'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1383, outputs='sunflower'
[2024-08-03 15:13:24] [Rank 0] totoal_tokens=1349, outputs='oak'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1346, outputs='Tulip'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1397, outputs='Dragonfly'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1399, outputs='Brush his teeth'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1406, outputs='Oak'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1452, outputs='brush his teeth'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1452, outputs='Under the magazines'
[2024-08-03 15:13:24] [Rank 1] totoal_tokens=1453, outputs='Earth'
[2024-08-03 15:13:24] [Rank 3] totoal_tokens=1454, outputs='Competition'
[2024-08-03 15:13:24] [Rank 2] totoal_tokens=1456, outputs='Cara'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1350, outputs='Decorations and plants are added to enhance the ambiance.'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1455, outputs='Region C'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1455, outputs='dessert'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1354, outputs='sunflower'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1460, outputs='Alexa Chung'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1457, outputs='Banana'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1462, outputs='Visit the museum'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1482, outputs='Cook'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1374, outputs='The main course'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1480, outputs='Wash his face'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1470, outputs='backyard'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1492, outputs='dragonfly'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1510, outputs='Alice'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1491, outputs='catch the bus'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1455, outputs='Painting the model is the final step.'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1540, outputs='Eat breakfast'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1499, outputs='Impressionist painting'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1494, outputs='Cactus'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1584, outputs='Running'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1544, outputs='Bob'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1460, outputs='Watch a movie'
7%|β–‹ | 53/751 [00:08<01:51, 6.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 54/751 [00:08<02:23, 4.85it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 55/751 [00:09<03:15, 3.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 7%|β–‹ | 56/751 [00:09<02:44, 4.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 57/751 [00:09<02:44, 4.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 58/751 [00:09<02:24, 4.81it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 59/751 [00:10<03:09, 3.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 60/751 [00:10<02:39, 4.32it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 61/751 [00:10<02:24, 4.77it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%|β–Š | 62/751 [00:10<02:45, 4.16it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 8%[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1500, outputs='pine tree'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1588, outputs='Alpha'
[2024-08-03 15:13:25] [Rank 0] totoal_tokens=1464, outputs='Car A'
[2024-08-03 15:13:25] [Rank 2] totoal_tokens=1551, outputs='Car A'
[2024-08-03 15:13:25] [Rank 1] totoal_tokens=1501, outputs='Initiative Gamma'
[2024-08-03 15:13:25] [Rank 3] totoal_tokens=1744, outputs='Alice'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1599, outputs='John'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1465, outputs='Dr. Carter'
[2024-08-03 15:13:26] [Rank 3] totoal_tokens=1747, outputs='end of the day'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1467, outputs='Linda'
[2024-08-03 15:13:26] [Rank 1] totoal_tokens=1520, outputs='Eat, Pray, Love'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1631, outputs='on the coffee table'
[2024-08-03 15:13:26] [Rank 3] totoal_tokens=1755, outputs='Store B'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1477, outputs='Wash it'
[2024-08-03 15:13:26] [Rank 1] totoal_tokens=1595, outputs='Chocolate cake'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1640, outputs='The lights are turned on'
[2024-08-03 15:13:26] [Rank 3] totoal_tokens=1899, outputs='Banana'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1485, outputs='running'
[2024-08-03 15:13:26] [Rank 1] totoal_tokens=1596, outputs='Dr. Johnson'
[2024-08-03 15:13:26] [Rank 3] totoal_tokens=1912, outputs='Sue'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1753, outputs='Decorations and plants'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1489, outputs='Crowdfunding'
[2024-08-03 15:13:26] [Rank 1] totoal_tokens=1625, outputs='hike'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1855, outputs='Route Y'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1509, outputs='Walls'
[2024-08-03 15:13:26] [Rank 3] totoal_tokens=1926, outputs='La-mina Monoyra'
[2024-08-03 15:13:26] [Rank 1] totoal_tokens=1703, outputs='The dragonfly'
[2024-08-03 15:13:26] [Rank 2] totoal_tokens=1903, outputs='Dragonfly'
[2024-08-03 15:13:26] [Rank 0] totoal_tokens=1545, outputs='Charlie'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=1948, outputs='Region X'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=1707, outputs='Linda'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=1909, outputs='Liam'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1712, outputs='Dr. Allen'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=1974, outputs='Gamma'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=1907, outputs='Brushes his teeth'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=1929, outputs='lunch'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1841, outputs='cat'
|β–Š | 63/751 [00:11<02:28, 4.63it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–Š | 64/751 [00:11<02:11, 5.21it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–Š | 65/751 [00:11<02:04, 5.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 66/751 [00:11<01:54, 5.97it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 67/751 [00:11<01:54, 6.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 68/751 [00:11<01:43, 6.57it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 69/751 [00:11<01:50, 6.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 70/751 [00:12<01:44, 6.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 9%|β–‰ | 71/751 [00:12<01:42, 6.61it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–‰ | 72/751 [00:12<01:48, 6.24it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–‰[2024-08-03 15:13:27] [Rank 3] totoal_tokens=2035, outputs='under the magazines'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=1932, outputs='Bob'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=1911, outputs='pine tree'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1853, outputs='Linda'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=2038, outputs='Route X'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=2042, outputs='Bob'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=1975, outputs='Red'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1893, outputs='Salad'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=2046, outputs='salad'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=1997, outputs='Route X'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=2043, outputs='brush his teeth'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1902, outputs='Store Z'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=2049, outputs='Cara'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1905, outputs='owl'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=2026, outputs='Brush his teeth'
[2024-08-03 15:13:27] [Rank 2] totoal_tokens=2046, outputs='Initiative Gamma'
[2024-08-03 15:13:27] [Rank 0] totoal_tokens=1906, outputs='orange'
[2024-08-03 15:13:27] [Rank 3] totoal_tokens=2067, outputs='Tara'
[2024-08-03 15:13:27] [Rank 1] totoal_tokens=2027, outputs='Main course'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2047, outputs='Investment C'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1909, outputs='Watched the movie'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2071, outputs='The yellow book'
[2024-08-03 15:13:28] [Rank 1] totoal_tokens=2033, outputs='Decorations and plants'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2047, outputs='Region X'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2072, outputs='Linda'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1919, outputs='in the backyard'
[2024-08-03 15:13:28] [Rank 1] totoal_tokens=2035, outputs='Runs'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2051, outputs='Cara'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2078, outputs='Region Z'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1927, outputs='Paint the model'
[2024-08-03 15:13:28] [Rank 1] totoal_tokens=2037, outputs='Cara'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2065, outputs='Dr. Liu'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2079, outputs='Pack their school bag'
[2024-08-03 15:13:28] [Rank 1] totoal_tokens=2039, outputs='dessert'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1928, outputs='Dessert'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2075, outputs='Diet Plan C'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2080, outputs='Hike'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1950, outputs='dragonfly'
| 73/751 [00:12<01:39, 6.83it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–‰ | 74/751 [00:12<01:37, 6.92it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–‰ | 75/751 [00:12<01:37, 6.91it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–ˆ | 76/751 [00:12<01:38, 6.84it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–ˆ | 77/751 [00:13<01:32, 7.28it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 10%|β–ˆ | 78/751 [00:13<01:27, 7.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ | 79/751 [00:13<01:39, 6.73it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ | 80/751 [00:13<01:45, 6.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ | 81/751 [00:13<01:50, 6.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ | 82/751 [00:13<01:52, 5.96it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ [2024-08-03 15:13:28] [Rank 1] totoal_tokens=2070, outputs='Lunch'
[2024-08-03 15:13:28] [Rank 2] totoal_tokens=2076, outputs='Mark'
[2024-08-03 15:13:28] [Rank 3] totoal_tokens=2080, outputs='dragonfly'
[2024-08-03 15:13:28] [Rank 0] totoal_tokens=1956, outputs='Lucy'
[2024-08-03 15:13:29] [Rank 1] totoal_tokens=2078, outputs='Northwest Blend Smoking Chips'
[2024-08-03 15:13:29] [Rank 2] totoal_tokens=2076, outputs='Decorations and plants'
[2024-08-03 15:13:29] [Rank 3] totoal_tokens=2081, outputs='walls'
[2024-08-03 15:13:29] [Rank 0] totoal_tokens=2026, outputs='Post-production'
[2024-08-03 15:13:29] [Rank 1] totoal_tokens=2090, outputs='Running'
[2024-08-03 15:13:29] [Rank 2] totoal_tokens=2080, outputs='Sales team'
[2024-08-03 15:13:29] [Rank 3] totoal_tokens=2082, outputs='Cradle Of Filth'
[2024-08-03 15:13:29] [Rank 1] totoal_tokens=2092, outputs='Dr. Liu'
[2024-08-03 15:13:29] [Rank 0] totoal_tokens=2039, outputs="Conseil de l'Europe"
[2024-08-03 15:13:29] [Rank 2] totoal_tokens=2088, outputs='jogging'
[2024-08-03 15:13:29] [Rank 3] totoal_tokens=2101, outputs='The sales team'
[2024-08-03 15:13:29] [Rank 1] totoal_tokens=2105, outputs='Initiative Alpha'
[2024-08-03 15:13:29] [Rank 2] totoal_tokens=2095, outputs='Store B'
[2024-08-03 15:13:29] [Rank 0] totoal_tokens=2047, outputs='The red book'
[2024-08-03 15:13:29] [Rank 3] totoal_tokens=2106, outputs='Brush his teeth'
[2024-08-03 15:13:29] [Rank 0] totoal_tokens=2050, outputs='Company Y'
[2024-08-03 15:13:29] [Rank 3] totoal_tokens=2107, outputs='dessert'
[2024-08-03 15:13:29] [Rank 1] totoal_tokens=2133, outputs='Across the street'
[2024-08-03 15:13:29] [Rank 0] totoal_tokens=2063, outputs='Alice'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2143, outputs='Sunflower'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2108, outputs='Store Z'
[2024-08-03 15:13:30] [Rank 2] totoal_tokens=2107, outputs='Dr. Liu'
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2065, outputs='Cara'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2109, outputs='Store B'
[2024-08-03 15:13:30] [Rank 2] totoal_tokens=2127, outputs='dragonfly'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2148, outputs="Linda's"
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2076, outputs='oak'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2110, outputs='Visit the museum'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2166, outputs='Chocolate cake'
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2079, outputs='Earth'
[2024-08-03 15:13:30] [Rank 2] totoal_tokens=2130, outputs='Go to the museum'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2115, outputs='red book'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2189, outputs='Investment C'
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2080, outputs='Read the book'
| 83/751 [00:14<01:48, 6.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆ | 84/751 [00:14<01:46, 6.28it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆβ– | 85/751 [00:14<01:56, 5.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 11%|β–ˆβ– | 86/751 [00:14<02:27, 4.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 87/751 [00:14<02:20, 4.71it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 88/751 [00:15<02:13, 4.97it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 89/751 [00:15<01:56, 5.68it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 90/751 [00:15<01:51, 5.92it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 91/751 [00:15<01:54, 5.74it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 12%|β–ˆβ– | 92/751 [00:15<01:45, 6.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:13:30] [Rank 2] totoal_tokens=2130, outputs='Editing'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2139, outputs='Lucy'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2191, outputs='Store B'
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2085, outputs='Julie'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2171, outputs='Bob'
[2024-08-03 15:13:30] [Rank 2] totoal_tokens=2133, outputs='Next to the couch'
[2024-08-03 15:13:30] [Rank 1] totoal_tokens=2198, outputs='Store Z'
[2024-08-03 15:13:30] [Rank 0] totoal_tokens=2098, outputs='A chocolate cake'
[2024-08-03 15:13:30] [Rank 3] totoal_tokens=2175, outputs='C'
[2024-08-03 15:13:31] [Rank 1] totoal_tokens=2204, outputs='brush his teeth'
[2024-08-03 15:13:31] [Rank 0] totoal_tokens=2101, outputs='Rome'
[2024-08-03 15:13:31] [Rank 3] totoal_tokens=2178, outputs='Store Z'
[2024-08-03 15:13:31] [Rank 2] totoal_tokens=2135, outputs='A founding member of the group'
[2024-08-03 15:13:31] [Rank 3] totoal_tokens=2204, outputs='Editing'
[2024-08-03 15:13:31] [Rank 1] totoal_tokens=2275, outputs='banana'
[2024-08-03 15:13:31] [Rank 2] totoal_tokens=2161, outputs='Initiative Alpha'
[2024-08-03 15:13:31] [Rank 0] totoal_tokens=2104, outputs='Decorations and plants are added'
[2024-08-03 15:13:31] [Rank 3] totoal_tokens=2285, outputs='Diet Plan C'
[2024-08-03 15:13:31] [Rank 1] totoal_tokens=2286, outputs='Eat breakfast'
[2024-08-03 15:13:31] [Rank 0] totoal_tokens=2106, outputs='Bob'
[2024-08-03 15:13:31] [Rank 2] totoal_tokens=2185, outputs='Diet Plan C'
[2024-08-03 15:13:31] [Rank 1] totoal_tokens=2291, outputs='owl'
[2024-08-03 15:13:31] [Rank 3] totoal_tokens=2436, outputs='Investment C'
[2024-08-03 15:13:31] [Rank 0] totoal_tokens=2116, outputs='backyard'
[2024-08-03 15:13:31] [Rank 2] totoal_tokens=2186, outputs='Fireworks display'
[2024-08-03 15:13:31] [Rank 3] totoal_tokens=2482, outputs='Store C'
[2024-08-03 15:13:31] [Rank 1] totoal_tokens=2293, outputs='Owl'
[2024-08-03 15:13:31] [Rank 2] totoal_tokens=2193, outputs='dog'
[2024-08-03 15:13:31] [Rank 0] totoal_tokens=2130, outputs='Mia'
[2024-08-03 15:13:32] [Rank 0] totoal_tokens=2134, outputs='Lucy'
[2024-08-03 15:13:32] [Rank 2] totoal_tokens=2194, outputs='a chocolate cake'
[2024-08-03 15:13:32] [Rank 3] totoal_tokens=2487, outputs='Dr. Kildare'
[2024-08-03 15:13:32] [Rank 1] totoal_tokens=2297, outputs='Store Z'
[2024-08-03 15:13:32] [Rank 0] totoal_tokens=2135, outputs='Bob'
[2024-08-03 15:13:32] [Rank 2] totoal_tokens=2216, outputs='Watched the movie'
[2024-08-03 15:13:32] [Rank 1] totoal_tokens=2483, outputs='Walls'
[2024-08-03 15:13:32] [Rank 3] totoal_tokens=2525, outputs='Watched the movie'
[2024-08-03 15:13:32] [Rank 0] totoal_tokens=2135, outputs='Watched the movie'
12%|β–ˆβ– | 93/751 [00:15<01:48, 6.07it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 94/751 [00:16<01:52, 5.86it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 95/751 [00:16<01:53, 5.77it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 96/751 [00:16<01:49, 5.98it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 97/751 [00:16<02:06, 5.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 98/751 [00:16<01:53, 5.74it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 99/751 [00:16<01:51, 5.82it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 100/751 [00:17<02:03, 5.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 13%|β–ˆβ–Ž | 101/751 [00:17<02:02, 5.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ–Ž | 102/751 [00:17<01:51, 5.84it/s] Processing InternVL2-2B_reasonin[2024-08-03 15:13:32] [Rank 2] totoal_tokens=2295, outputs='sales team'
[2024-08-03 15:13:32] [Rank 1] totoal_tokens=2484, outputs='Susan'
[2024-08-03 15:13:32] [Rank 3] totoal_tokens=2532, outputs='walls'
[2024-08-03 15:13:32] [Rank 0] totoal_tokens=2148, outputs='sunflower'
[2024-08-03 15:13:32] [Rank 1] totoal_tokens=2498, outputs='Lunch'
[2024-08-03 15:13:32] [Rank 2] totoal_tokens=2332, outputs='Region X'
[2024-08-03 15:13:32] [Rank 3] totoal_tokens=2630, outputs='The game is released'
[2024-08-03 15:13:32] [Rank 0] totoal_tokens=2220, outputs='Store Z'
[2024-08-03 15:13:32] [Rank 2] totoal_tokens=2381, outputs='Product C'
[2024-08-03 15:13:32] [Rank 1] totoal_tokens=2520, outputs='Under the magazines'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2228, outputs='salad'
[2024-08-03 15:13:33] [Rank 3] totoal_tokens=2633, outputs='tulip'
[2024-08-03 15:13:33] [Rank 1] totoal_tokens=2533, outputs='B'
[2024-08-03 15:13:33] [Rank 2] totoal_tokens=2467, outputs='salad'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2272, outputs='owl'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2273, outputs='Alice'
[2024-08-03 15:13:33] [Rank 1] totoal_tokens=2546, outputs='Cactus'
[2024-08-03 15:13:33] [Rank 2] totoal_tokens=2521, outputs='Impressionist'
[2024-08-03 15:13:33] [Rank 1] totoal_tokens=2567, outputs='Get bus'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2290, outputs='Project Alpha'
[2024-08-03 15:13:33] [Rank 3] totoal_tokens=2642, outputs='Combine one price based indicator and one non-price based indicator to identify short-term'
[2024-08-03 15:13:33] [Rank 2] totoal_tokens=2529, outputs='Dr. Carter'
[2024-08-03 15:13:33] [Rank 1] totoal_tokens=2569, outputs='Dragonfly'
[2024-08-03 15:13:33] [Rank 3] totoal_tokens=2644, outputs='Catch the bus'
[2024-08-03 15:13:33] [Rank 2] totoal_tokens=2531, outputs='Route X'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2294, outputs='At the end of the day'
[2024-08-03 15:13:33] [Rank 1] totoal_tokens=2571, outputs='Selling'
[2024-08-03 15:13:33] [Rank 0] totoal_tokens=2302, outputs='Store B'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2651, outputs='At the end of the day'
[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2612, outputs='Eat breakfast'
[2024-08-03 15:13:34] [Rank 2] totoal_tokens=2538, outputs='Northern Michigan'
[2024-08-03 15:13:34] [Rank 0] totoal_tokens=2303, outputs='Store B'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2665, outputs='chocolate cake'
[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2633, outputs='Dr. Liu'
[2024-08-03 15:13:34] [Rank 2] totoal_tokens=2580, outputs='The blue book'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2671, outputs='Owl'
[2024-08-03 15:13:34] [Rank 0] totoal_tokens=2324, outputs='OΓ­che'
g-text-test.jsonl: 14%|β–ˆβ–Ž | 103/751 [00:17<02:05, 5.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ– | 104/751 [00:17<02:03, 5.24it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ– | 105/751 [00:18<02:01, 5.32it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ– | 106/751 [00:18<01:56, 5.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ– | 107/751 [00:18<01:46, 6.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 14%|β–ˆβ– | 108/751 [00:18<01:39, 6.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ– | 109/751 [00:18<01:59, 5.37it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ– | 110/751 [00:19<02:17, 4.67it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ– | 111/751 [00:19<02:09, 4.94it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ– | 112/751 [00:19<02:04, 5.14it/s] Proces[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2673, outputs='Lunch'
[2024-08-03 15:13:34] [Rank 2] totoal_tokens=2590, outputs='yellow book'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2672, outputs='Dog'
[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2676, outputs='Sales team'
[2024-08-03 15:13:34] [Rank 0] totoal_tokens=2375, outputs='Lucy'
[2024-08-03 15:13:34] [Rank 2] totoal_tokens=2669, outputs='Dr. Johnson'
[2024-08-03 15:13:34] [Rank 0] totoal_tokens=2427, outputs='Bob'
[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2677, outputs='Route X'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2675, outputs='Impressionist painting'
[2024-08-03 15:13:34] [Rank 2] totoal_tokens=2671, outputs='Disaster Crimes'
[2024-08-03 15:13:34] [Rank 0] totoal_tokens=2487, outputs='Royals'
[2024-08-03 15:13:34] [Rank 3] totoal_tokens=2678, outputs='Brush his teeth'
[2024-08-03 15:13:34] [Rank 1] totoal_tokens=2685, outputs='Anorexia'
[2024-08-03 15:13:35] [Rank 2] totoal_tokens=2672, outputs='Banana'
[2024-08-03 15:13:35] [Rank 3] totoal_tokens=2687, outputs='C'
[2024-08-03 15:13:35] [Rank 0] totoal_tokens=2496, outputs='Dr. Zhang'
[2024-08-03 15:13:35] [Rank 2] totoal_tokens=2678, outputs='Region C'
[2024-08-03 15:13:35] [Rank 1] totoal_tokens=2688, outputs="Chrys Fey's place"
[2024-08-03 15:13:35] [Rank 3] totoal_tokens=2697, outputs='Nick Serota'
[2024-08-03 15:13:35] [Rank 0] totoal_tokens=2524, outputs='Region C'
[2024-08-03 15:13:35] [Rank 2] totoal_tokens=2683, outputs='dessert'
[2024-08-03 15:13:35] [Rank 1] totoal_tokens=2782, outputs='pine tree'
[2024-08-03 15:13:35] [Rank 3] totoal_tokens=2697, outputs='Julie'
[2024-08-03 15:13:35] [Rank 0] totoal_tokens=2528, outputs='Aerobatics'
[2024-08-03 15:13:35] [Rank 2] totoal_tokens=2697, outputs='Catch the bus'
[2024-08-03 15:13:35] [Rank 1] totoal_tokens=2848, outputs='Tulip'
[2024-08-03 15:13:35] [Rank 3] totoal_tokens=2701, outputs='Bob'
[2024-08-03 15:13:35] [Rank 0] totoal_tokens=2544, outputs='Investment C'
[2024-08-03 15:13:35] [Rank 2] totoal_tokens=2703, outputs='Lunch'
[2024-08-03 15:13:35] [Rank 3] totoal_tokens=2746, outputs='orange'
[2024-08-03 15:13:35] [Rank 1] totoal_tokens=2856, outputs='parrot'
[2024-08-03 15:13:35] [Rank 0] totoal_tokens=2629, outputs='Next to'
[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2703, outputs='Dr. Liu'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2746, outputs='Initiative Alpha'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2861, outputs='Initiative Alpha'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2750, outputs='Mark'
[2024-08-03 15:13:36] [Rank 0] totoal_tokens=2634, outputs='Initiative Alpha'
sing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ–Œ | 113/751 [00:19<02:16, 4.67it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ–Œ | 114/751 [00:19<02:13, 4.79it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ–Œ | 115/751 [00:20<01:59, 5.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 15%|β–ˆβ–Œ | 116/751 [00:20<02:04, 5.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 117/751 [00:20<02:06, 4.99it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 118/751 [00:20<02:09, 4.90it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 119/751 [00:20<02:13, 4.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 120/751 [00:21<02:09, 4.87it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 121/751 [00:21<02:06, 5.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–Œ | 122/751 [00:2[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2704, outputs='A chocolate cake'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2863, outputs='dog'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2757, outputs='backyard'
[2024-08-03 15:13:36] [Rank 0] totoal_tokens=2663, outputs='Project Gamma'
[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2716, outputs='Lunch'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2869, outputs='Region X'
[2024-08-03 15:13:36] [Rank 0] totoal_tokens=2669, outputs='Bob'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2772, outputs='Company X'
[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2758, outputs='Store B'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2873, outputs='Mars'
[2024-08-03 15:13:36] [Rank 0] totoal_tokens=2677, outputs='Car A'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2792, outputs='The roof'
[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2790, outputs='Lucas'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2894, outputs='dog'
[2024-08-03 15:13:36] [Rank 0] totoal_tokens=2697, outputs='Company X'
[2024-08-03 15:13:36] [Rank 3] totoal_tokens=2848, outputs='Marketing department'
[2024-08-03 15:13:36] [Rank 1] totoal_tokens=2918, outputs='orange'
[2024-08-03 15:13:36] [Rank 2] totoal_tokens=2853, outputs='Walls'
[2024-08-03 15:13:37] [Rank 3] totoal_tokens=2854, outputs='Bob'
[2024-08-03 15:13:37] [Rank 1] totoal_tokens=2925, outputs='Fireworks display'
[2024-08-03 15:13:37] [Rank 2] totoal_tokens=2860, outputs='Catch the bus to school'
[2024-08-03 15:13:37] [Rank 3] totoal_tokens=2868, outputs='Impressionist'
[2024-08-03 15:13:37] [Rank 1] totoal_tokens=2950, outputs='Store B'
[2024-08-03 15:13:37] [Rank 0] totoal_tokens=2700, outputs='Mohammad Gharazi, former oil minister and one of the founders of'
[2024-08-03 15:13:37] [Rank 3] totoal_tokens=2914, outputs='lunch'
[2024-08-03 15:13:37] [Rank 2] totoal_tokens=2864, outputs='decorations and plants'
[2024-08-03 15:13:37] [Rank 0] totoal_tokens=2746, outputs='Jupiter'
[2024-08-03 15:13:37] [Rank 1] totoal_tokens=2957, outputs='Take a yoga class'
[2024-08-03 15:13:37] [Rank 3] totoal_tokens=2934, outputs='John'
[2024-08-03 15:13:37] [Rank 2] totoal_tokens=2871, outputs='Tim'
[2024-08-03 15:13:37] [Rank 0] totoal_tokens=2758, outputs='Store B'
[2024-08-03 15:13:37] [Rank 3] totoal_tokens=2943, outputs='Frank'
[2024-08-03 15:13:37] [Rank 1] totoal_tokens=2994, outputs='Dragonfly'
[2024-08-03 15:13:37] [Rank 2] totoal_tokens=2886, outputs='Cycling'
[2024-08-03 15:13:37] [Rank 0] totoal_tokens=2765, outputs='Company Y'
[2024-08-03 15:13:38] [Rank 1] totoal_tokens=2999, outputs='Lucy'
[2024-08-03 15:13:38] [Rank 3] totoal_tokens=2962, outputs='Minnesota Street Project'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2892, outputs='Cornhole'
[2024-08-03 15:13:38] [Rank 0] totoal_tokens=2843, outputs='dog'
[2024-08-03 15:13:38] [Rank 3] totoal_tokens=3019, outputs='Store Z'
[2024-08-03 15:13:38] [Rank 1] totoal_tokens=3000, outputs='executives'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2908, outputs='Store B'
[2024-08-03 15:13:38] [Rank 0] totoal_tokens=2866, outputs='Cara'
1<02:09, 4.86it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 16%|β–ˆβ–‹ | 123/751 [00:21<02:04, 5.03it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 124/751 [00:21<01:53, 5.50it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 125/751 [00:22<01:52, 5.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 126/751 [00:22<01:54, 5.47it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 127/751 [00:22<02:55, 3.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 128/751 [00:22<02:35, 4.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 129/751 [00:23<02:21, 4.39it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 130/751 [00:23<02:11, 4.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 17%|β–ˆβ–‹ | 131/751 [00:23<02:01, 5.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆ[2024-08-03 15:13:38] [Rank 3] totoal_tokens=3022, outputs='Store Z'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2913, outputs='Madrid'
[2024-08-03 15:13:38] [Rank 1] totoal_tokens=3016, outputs='Dr. Liu'
[2024-08-03 15:13:38] [Rank 0] totoal_tokens=2870, outputs='Cooking'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2920, outputs='Owl'
[2024-08-03 15:13:38] [Rank 3] totoal_tokens=3066, outputs='Decorations and plants'
[2024-08-03 15:13:38] [Rank 0] totoal_tokens=2871, outputs='Lucas'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2921, outputs='dessert'
[2024-08-03 15:13:38] [Rank 1] totoal_tokens=3043, outputs='At the end of the day'
[2024-08-03 15:13:38] [Rank 0] totoal_tokens=2899, outputs='Lucy'
[2024-08-03 15:13:38] [Rank 3] totoal_tokens=3093, outputs='The car is driven away'
[2024-08-03 15:13:38] [Rank 2] totoal_tokens=2926, outputs='Oak'
[2024-08-03 15:13:39] [Rank 1] totoal_tokens=3044, outputs='dragonfly'
[2024-08-03 15:13:39] [Rank 0] totoal_tokens=2951, outputs='on the coffee table'
[2024-08-03 15:13:39] [Rank 3] totoal_tokens=3100, outputs='Alpha'
[2024-08-03 15:13:39] [Rank 2] totoal_tokens=3005, outputs='Decorations and plants'
[2024-08-03 15:13:39] [Rank 0] totoal_tokens=2983, outputs='Read the newspaper'
[2024-08-03 15:13:39] [Rank 3] totoal_tokens=3105, outputs="Cara's backyard"
[2024-08-03 15:13:39] [Rank 2] totoal_tokens=3010, outputs='Dr. Lee'
[2024-08-03 15:13:39] [Rank 0] totoal_tokens=3025, outputs='Red Book'
[2024-08-03 15:13:39] [Rank 3] totoal_tokens=3113, outputs='Investment C'
[2024-08-03 15:13:39] [Rank 1] totoal_tokens=3049, outputs='The sales team'
[2024-08-03 15:13:39] [Rank 2] totoal_tokens=3056, outputs='Dr. Lee'
[2024-08-03 15:13:39] [Rank 0] totoal_tokens=3040, outputs='Brush his teeth'
[2024-08-03 15:13:39] [Rank 3] totoal_tokens=3116, outputs='Trees'
[2024-08-03 15:13:39] [Rank 1] totoal_tokens=3053, outputs='Take a yoga class'
[2024-08-03 15:13:39] [Rank 2] totoal_tokens=3095, outputs='Organise'
[2024-08-03 15:13:40] [Rank 0] totoal_tokens=3041, outputs='Fireworks display'
[2024-08-03 15:13:40] [Rank 3] totoal_tokens=3196, outputs='Eagle'
[2024-08-03 15:13:40] [Rank 1] totoal_tokens=3054, outputs='Lucas'
[2024-08-03 15:13:40] [Rank 2] totoal_tokens=3098, outputs='Charlie'
[2024-08-03 15:13:40] [Rank 0] totoal_tokens=3043, outputs='Bob'
[2024-08-03 15:13:40] [Rank 3] totoal_tokens=3240, outputs='pack school bag'
[2024-08-03 15:13:40] [Rank 1] totoal_tokens=3065, outputs='dessert'
[2024-08-03 15:13:40] [Rank 2] totoal_tokens=3105, outputs='The shell'
[2024-08-03 15:13:40] [Rank 0] totoal_tokens=3046, outputs='Car A'
β–Š | 132/751 [00:23<02:01, 5.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 133/751 [00:23<01:59, 5.19it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 134/751 [00:23<01:57, 5.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 135/751 [00:24<01:56, 5.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 136/751 [00:24<02:05, 4.92it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 137/751 [00:24<02:06, 4.87it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 18%|β–ˆβ–Š | 138/751 [00:24<02:02, 5.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–Š | 139/751 [00:25<02:21, 4.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–Š | 140/751 [00:25<02:17, 4.45it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–‰ | 141/751 [00:25<02:05, 4.85it/s] Processing InternVL2-2B_reasoning[2024-08-03 15:13:40] [Rank 3] totoal_tokens=3253, outputs='Investment C'
[2024-08-03 15:13:40] [Rank 1] totoal_tokens=3077, outputs='The parrot'
[2024-08-03 15:13:40] [Rank 0] totoal_tokens=3073, outputs='Brake calipers'
[2024-08-03 15:13:40] [Rank 2] totoal_tokens=3116, outputs='Hike'
[2024-08-03 15:13:40] [Rank 3] totoal_tokens=3274, outputs='Jupiter'
[2024-08-03 15:13:40] [Rank 1] totoal_tokens=3077, outputs='Yoga class'
[2024-08-03 15:13:40] [Rank 2] totoal_tokens=3139, outputs='Region C'
[2024-08-03 15:13:40] [Rank 0] totoal_tokens=3082, outputs='ZARA'
[2024-08-03 15:13:40] [Rank 3] totoal_tokens=3316, outputs='The roof'
[2024-08-03 15:13:41] [Rank 0] totoal_tokens=3082, outputs='dessert'
[2024-08-03 15:13:41] [Rank 1] totoal_tokens=3081, outputs='At the end of the day'
[2024-08-03 15:13:41] [Rank 2] totoal_tokens=3141, outputs='The software is launched to the public'
[2024-08-03 15:13:41] [Rank 3] totoal_tokens=3382, outputs='B.B. King'
[2024-08-03 15:13:41] [Rank 0] totoal_tokens=3085, outputs='Decorations and plants'
[2024-08-03 15:13:41] [Rank 2] totoal_tokens=3173, outputs='Product B'
[2024-08-03 15:13:41] [Rank 1] totoal_tokens=3083, outputs='At the end of the day'
[2024-08-03 15:13:41] [Rank 0] totoal_tokens=3098, outputs='Owl'
[2024-08-03 15:13:41] [Rank 2] totoal_tokens=3225, outputs='Liu'
[2024-08-03 15:13:41] [Rank 3] totoal_tokens=3413, outputs='The product is launched to the public.'
[2024-08-03 15:13:41] [Rank 1] totoal_tokens=3098, outputs='Region X'
[2024-08-03 15:13:41] [Rank 0] totoal_tokens=3101, outputs='Lucas'
[2024-08-03 15:13:41] [Rank 2] totoal_tokens=3227, outputs='The sales team'
[2024-08-03 15:13:41] [Rank 1] totoal_tokens=3183, outputs='Charlie'
[2024-08-03 15:13:41] [Rank 3] totoal_tokens=3423, outputs='Diet Plan C'
[2024-08-03 15:13:42] [Rank 2] totoal_tokens=3318, outputs='Dr. Liu'
[2024-08-03 15:13:42] [Rank 1] totoal_tokens=3240, outputs='Tara'
[2024-08-03 15:13:42] [Rank 0] totoal_tokens=3102, outputs='Norah Diana Maude'
[2024-08-03 15:13:42] [Rank 3] totoal_tokens=3427, outputs='Cactus'
[2024-08-03 15:13:42] [Rank 1] totoal_tokens=3252, outputs='Winter'
[2024-08-03 15:13:42] [Rank 2] totoal_tokens=3452, outputs='lunch'
[2024-08-03 15:13:42] [Rank 0] totoal_tokens=3156, outputs='Painting the model'
[2024-08-03 15:13:42] [Rank 3] totoal_tokens=3434, outputs='Diet Plan C'
[2024-08-03 15:13:42] [Rank 2] totoal_tokens=3475, outputs='Store B'
[2024-08-03 15:13:42] [Rank 1] totoal_tokens=3336, outputs='tulip'
[2024-08-03 15:13:42] [Rank 0] totoal_tokens=3167, outputs='Charlie'
[2024-08-03 15:13:42] [Rank 3] totoal_tokens=3442, outputs='Impressionist'
[2024-08-03 15:13:42] [Rank 0] totoal_tokens=3175, outputs='Store B'
-text-test.jsonl: 19%|β–ˆβ–‰ | 142/751 [00:25<02:02, 4.95it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–‰ | 143/751 [00:25<02:13, 4.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–‰ | 144/751 [00:26<02:22, 4.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–‰ | 145/751 [00:26<02:17, 4.40it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 19%|β–ˆβ–‰ | 146/751 [00:26<02:23, 4.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–‰ | 147/751 [00:26<02:18, 4.38it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–‰ | 148/751 [00:27<02:15, 4.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–‰ | 149/751 [00:27<02:23, 4.19it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–‰ | 150/751 [00:27<02:23, 4.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–ˆ | 151/751 [00:27<02:14, 4.47it/s] Process[2024-08-03 15:13:42] [Rank 2] totoal_tokens=3494, outputs='Z'
[2024-08-03 15:13:42] [Rank 1] totoal_tokens=3388, outputs='Initiative Gamma'
[2024-08-03 15:13:42] [Rank 3] totoal_tokens=3485, outputs='Bob'
[2024-08-03 15:13:42] [Rank 0] totoal_tokens=3247, outputs='Cara'
[2024-08-03 15:13:43] [Rank 1] totoal_tokens=3412, outputs='Company X'
[2024-08-03 15:13:43] [Rank 2] totoal_tokens=3506, outputs='She takes a yoga class.'
[2024-08-03 15:13:43] [Rank 3] totoal_tokens=3492, outputs='Eat at the casino'
[2024-08-03 15:13:43] [Rank 0] totoal_tokens=3299, outputs='Next to the couch'
[2024-08-03 15:13:43] [Rank 1] totoal_tokens=3426, outputs='lunch'
[2024-08-03 15:13:43] [Rank 2] totoal_tokens=3567, outputs='parade'
[2024-08-03 15:13:43] [Rank 0] totoal_tokens=3324, outputs='Impressionist'
[2024-08-03 15:13:43] [Rank 2] totoal_tokens=3617, outputs='Earth'
[2024-08-03 15:13:43] [Rank 3] totoal_tokens=3566, outputs='Chocolate cake'
[2024-08-03 15:13:43] [Rank 1] totoal_tokens=3473, outputs='Rome'
[2024-08-03 15:13:43] [Rank 0] totoal_tokens=3404, outputs='Jogging'
[2024-08-03 15:13:43] [Rank 2] totoal_tokens=3622, outputs='Initiative Gamma'
[2024-08-03 15:13:43] [Rank 3] totoal_tokens=3595, outputs='Volunteer'
[2024-08-03 15:13:43] [Rank 1] totoal_tokens=3629, outputs='The QA team'
[2024-08-03 15:13:43] [Rank 0] totoal_tokens=3416, outputs='Under the magazines'
[2024-08-03 15:13:43] [Rank 2] totoal_tokens=3623, outputs='Store Z'
[2024-08-03 15:13:43] [Rank 3] totoal_tokens=3798, outputs='Caucasian'
[2024-08-03 15:13:44] [Rank 1] totoal_tokens=3665, outputs='Paint'
[2024-08-03 15:13:44] [Rank 0] totoal_tokens=3427, outputs='Store Z'
[2024-08-03 15:13:44] [Rank 2] totoal_tokens=3633, outputs='brush his teeth'
[2024-08-03 15:13:44] [Rank 3] totoal_tokens=3799, outputs='orange'
[2024-08-03 15:13:44] [Rank 1] totoal_tokens=3665, outputs='Dr. Allen'
[2024-08-03 15:13:44] [Rank 0] totoal_tokens=3432, outputs='Diet Plan C'
[2024-08-03 15:13:44] [Rank 2] totoal_tokens=3638, outputs='Renaissance'
[2024-08-03 15:13:44] [Rank 1] totoal_tokens=3667, outputs='Diet Plan C'
[2024-08-03 15:13:44] [Rank 0] totoal_tokens=3442, outputs='Dragonfly'
[2024-08-03 15:13:44] [Rank 3] totoal_tokens=3830, outputs='Computers and other tech equipment are set up'
[2024-08-03 15:13:44] [Rank 2] totoal_tokens=3660, outputs='Liam'
[2024-08-03 15:13:44] [Rank 1] totoal_tokens=3670, outputs='Store B'
[2024-08-03 15:13:44] [Rank 3] totoal_tokens=3918, outputs='Invest C'
[2024-08-03 15:13:44] [Rank 0] totoal_tokens=3484, outputs='Impressionist'
ing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–ˆ | 152/751 [00:28<02:12, 4.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 20%|β–ˆβ–ˆ | 153/751 [00:28<02:09, 4.62it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 154/751 [00:28<02:14, 4.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 155/751 [00:28<02:18, 4.32it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 156/751 [00:28<02:16, 4.36it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 157/751 [00:29<02:15, 4.38it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 158/751 [00:29<02:12, 4.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆ | 159/751 [00:29<02:21, 4.17it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆβ– | 160/751 [00:29<02:17, 4.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 21%|β–ˆβ–ˆβ– | 161/751 [0[2024-08-03 15:13:44] [Rank 2] totoal_tokens=3663, outputs='Tan France'
[2024-08-03 15:13:45] [Rank 0] totoal_tokens=3540, outputs='Region C'
[2024-08-03 15:13:45] [Rank 3] totoal_tokens=3998, outputs='Cara'
[2024-08-03 15:13:45] [Rank 1] totoal_tokens=3671, outputs='Get ice cream'
[2024-08-03 15:13:45] [Rank 2] totoal_tokens=3675, outputs='NET-A-PORTER'
[2024-08-03 15:13:45] [Rank 0] totoal_tokens=3673, outputs='Frank'
[2024-08-03 15:13:45] [Rank 1] totoal_tokens=3795, outputs='Alice'
[2024-08-03 15:13:45] [Rank 3] totoal_tokens=4031, outputs='Banana'
[2024-08-03 15:13:45] [Rank 2] totoal_tokens=3687, outputs='Company X'
[2024-08-03 15:13:45] [Rank 1] totoal_tokens=3799, outputs='Charlie'
[2024-08-03 15:13:45] [Rank 0] totoal_tokens=3868, outputs='Next to the couch'
[2024-08-03 15:13:45] [Rank 2] totoal_tokens=3800, outputs='Cara'
[2024-08-03 15:13:45] [Rank 1] totoal_tokens=3868, outputs='MW'
[2024-08-03 15:13:45] [Rank 3] totoal_tokens=4161, outputs='Impressionist'
[2024-08-03 15:13:45] [Rank 0] totoal_tokens=3884, outputs='Store Z'
[2024-08-03 15:13:45] [Rank 2] totoal_tokens=3859, outputs='Charlie'
[2024-08-03 15:13:45] [Rank 1] totoal_tokens=3930, outputs='Crow'
[2024-08-03 15:13:45] [Rank 0] totoal_tokens=3901, outputs='Editing'
[2024-08-03 15:13:46] [Rank 2] totoal_tokens=3925, outputs='Pack their school bag'
[2024-08-03 15:13:46] [Rank 1] totoal_tokens=3974, outputs='Project Alpha'
[2024-08-03 15:13:46] [Rank 0] totoal_tokens=3904, outputs='Bob'
[2024-08-03 15:13:46] [Rank 3] totoal_tokens=4162, outputs='The software is first developed and coded by the tech team.'
[2024-08-03 15:13:46] [Rank 2] totoal_tokens=3927, outputs='yoga'
[2024-08-03 15:13:46] [Rank 1] totoal_tokens=3984, outputs='Dr. Allen'
[2024-08-03 15:13:46] [Rank 0] totoal_tokens=4035, outputs='Mark'
[2024-08-03 15:13:46] [Rank 3] totoal_tokens=4169, outputs='dragonfly'
[2024-08-03 15:13:46] [Rank 2] totoal_tokens=3965, outputs='Store X'
[2024-08-03 15:13:46] [Rank 1] totoal_tokens=3987, outputs='Linda'
[2024-08-03 15:13:46] [Rank 0] totoal_tokens=4098, outputs='Liam'
[2024-08-03 15:13:46] [Rank 3] totoal_tokens=4196, outputs='pine'
[2024-08-03 15:13:46] [Rank 1] totoal_tokens=4092, outputs='Alpha'
[2024-08-03 15:13:46] [Rank 2] totoal_tokens=4032, outputs='Dr. Johnson'
[2024-08-03 15:13:46] [Rank 0] totoal_tokens=4110, outputs='Charlie'
[2024-08-03 15:13:47] [Rank 3] totoal_tokens=4212, outputs='Bleach Realm Death Battle'
[2024-08-03 15:13:47] [Rank 1] totoal_tokens=4096, outputs='a chocolate cake'
[2024-08-03 15:13:47] [Rank 0] totoal_tokens=4154, outputs='Christa Yona'
0:30<02:19, 4.21it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 162/751 [00:30<02:15, 4.35it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 163/751 [00:30<02:08, 4.57it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 164/751 [00:30<02:19, 4.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 165/751 [00:31<02:29, 3.92it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 166/751 [00:31<02:21, 4.13it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 167/751 [00:31<02:14, 4.33it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 22%|β–ˆβ–ˆβ– | 168/751 [00:31<02:11, 4.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 169/751 [00:31<02:18, 4.22it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 170/751 [00:32<02:17, 4.21it/s] Processing InternVL2-2B_reasoning-text[2024-08-03 15:13:47] [Rank 3] totoal_tokens=4214, outputs='Jupiter'
[2024-08-03 15:13:47] [Rank 2] totoal_tokens=4034, outputs='At the end of the day'
[2024-08-03 15:13:47] [Rank 0] totoal_tokens=4221, outputs='Store Z'
[2024-08-03 15:13:47] [Rank 1] totoal_tokens=4131, outputs='Walls'
[2024-08-03 15:13:47] [Rank 3] totoal_tokens=4248, outputs='Get her backpack'
[2024-08-03 15:13:47] [Rank 2] totoal_tokens=4035, outputs='dessert'
[2024-08-03 15:13:47] [Rank 0] totoal_tokens=4244, outputs='Butler'
[2024-08-03 15:13:47] [Rank 1] totoal_tokens=4198, outputs='Product B'
[2024-08-03 15:13:47] [Rank 3] totoal_tokens=4273, outputs='owl'
[2024-08-03 15:13:47] [Rank 2] totoal_tokens=4042, outputs='Dr. Allen'
[2024-08-03 15:13:47] [Rank 0] totoal_tokens=4251, outputs='Mars'
[2024-08-03 15:13:48] [Rank 3] totoal_tokens=4453, outputs='meeting'
[2024-08-03 15:13:48] [Rank 1] totoal_tokens=4211, outputs='Harrison Ford'
[2024-08-03 15:13:48] [Rank 2] totoal_tokens=4192, outputs='Charlie'
[2024-08-03 15:13:48] [Rank 0] totoal_tokens=4383, outputs='Tara'
[2024-08-03 15:13:48] [Rank 3] totoal_tokens=4454, outputs='pine tree'
[2024-08-03 15:13:48] [Rank 1] totoal_tokens=4212, outputs='work in the office'
[2024-08-03 15:13:48] [Rank 2] totoal_tokens=4219, outputs='Get bus to school'
[2024-08-03 15:13:48] [Rank 0] totoal_tokens=4408, outputs='Dr. Smith'
[2024-08-03 15:13:48] [Rank 1] totoal_tokens=4221, outputs='Wash his face'
[2024-08-03 15:13:48] [Rank 3] totoal_tokens=4502, outputs='At the end of the day'
[2024-08-03 15:13:48] [Rank 2] totoal_tokens=4220, outputs='dragonfly'
[2024-08-03 15:13:48] [Rank 0] totoal_tokens=4529, outputs='Route X'
[2024-08-03 15:13:48] [Rank 2] totoal_tokens=4348, outputs='Rome'
[2024-08-03 15:13:48] [Rank 3] totoal_tokens=4506, outputs='Dr. Johnson'
[2024-08-03 15:13:49] [Rank 1] totoal_tokens=4268, outputs='Feast on the Tea & Talk Table'
[2024-08-03 15:13:49] [Rank 0] totoal_tokens=4530, outputs='Aloe Vera'
[2024-08-03 15:13:49] [Rank 3] totoal_tokens=4523, outputs='Store Z'
[2024-08-03 15:13:49] [Rank 2] totoal_tokens=4520, outputs='Joyce Meyer'
[2024-08-03 15:13:49] [Rank 1] totoal_tokens=4449, outputs='Bob'
[2024-08-03 15:13:49] [Rank 0] totoal_tokens=4607, outputs='Cara'
[2024-08-03 15:13:49] [Rank 3] totoal_tokens=4549, outputs='Cara'
[2024-08-03 15:13:49] [Rank 2] totoal_tokens=4521, outputs='Initiative Alpha'
[2024-08-03 15:13:49] [Rank 1] totoal_tokens=4502, outputs='Liam'
[2024-08-03 15:13:49] [Rank 0] totoal_tokens=4702, outputs='Frank'
-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 171/751 [00:32<02:36, 3.70it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 172/751 [00:32<02:30, 3.85it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 173/751 [00:33<02:26, 3.95it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 174/751 [00:33<02:26, 3.94it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 175/751 [00:33<02:24, 3.98it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 23%|β–ˆβ–ˆβ–Ž | 176/751 [00:33<02:25, 3.94it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ–Ž | 177/751 [00:34<02:27, 3.88it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ–Ž | 178/751 [00:34<02:37, 3.63it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ– | 179/751 [00:34<02:36, 3.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ– | 180/751 [00:34<02:33, 3.[2024-08-03 15:13:49] [Rank 3] totoal_tokens=4593, outputs='Next to the couch'
[2024-08-03 15:13:49] [Rank 2] totoal_tokens=4524, outputs='at the end of the day'
[2024-08-03 15:13:49] [Rank 1] totoal_tokens=4514, outputs='Jogging'
[2024-08-03 15:13:49] [Rank 0] totoal_tokens=4860, outputs='JΓ‘chym Topol'
[2024-08-03 15:13:50] [Rank 2] totoal_tokens=4524, outputs='oak'
[2024-08-03 15:13:50] [Rank 1] totoal_tokens=4517, outputs='Owl'
[2024-08-03 15:13:50] [Rank 3] totoal_tokens=4595, outputs='Charlie'
[2024-08-03 15:13:50] [Rank 0] totoal_tokens=4861, outputs='Region X'
[2024-08-03 15:13:50] [Rank 2] totoal_tokens=4525, outputs='X'
[2024-08-03 15:13:50] [Rank 1] totoal_tokens=4520, outputs='Roof'
[2024-08-03 15:13:50] [Rank 3] totoal_tokens=4597, outputs='owl'
[2024-08-03 15:13:50] [Rank 0] totoal_tokens=4903, outputs='Dr. Liu'
[2024-08-03 15:13:50] [Rank 1] totoal_tokens=4614, outputs='Kylian Mbappe'
[2024-08-03 15:13:50] [Rank 3] totoal_tokens=4755, outputs='At the end of the day'
[2024-08-03 15:13:50] [Rank 0] totoal_tokens=4922, outputs='Region B'
[2024-08-03 15:13:50] [Rank 2] totoal_tokens=4599, outputs='The trip to Ely was largely uneventful after that.'
[2024-08-03 15:13:51] [Rank 0] totoal_tokens=5060, outputs='Jupiter'
[2024-08-03 15:13:51] [Rank 3] totoal_tokens=4822, outputs='Dr. Carter'
[2024-08-03 15:13:51] [Rank 1] totoal_tokens=4618, outputs='dessert'
[2024-08-03 15:13:51] [Rank 2] totoal_tokens=4652, outputs='Elizabeth'
[2024-08-03 15:13:51] [Rank 0] totoal_tokens=5061, outputs='Charlie'
[2024-08-03 15:13:51] [Rank 3] totoal_tokens=4845, outputs='backyard'
[2024-08-03 15:13:51] [Rank 1] totoal_tokens=4628, outputs='Google'
[2024-08-03 15:13:51] [Rank 2] totoal_tokens=4802, outputs='Store Z'
[2024-08-03 15:13:51] [Rank 0] totoal_tokens=5066, outputs='pine tree'
[2024-08-03 15:13:51] [Rank 1] totoal_tokens=4665, outputs='Harry Potter'
[2024-08-03 15:13:51] [Rank 2] totoal_tokens=4848, outputs='Walls'
[2024-08-03 15:13:51] [Rank 3] totoal_tokens=4882, outputs='Next to the couch'
[2024-08-03 15:13:51] [Rank 0] totoal_tokens=5075, outputs='Diet Plan C'
[2024-08-03 15:13:51] [Rank 2] totoal_tokens=4851, outputs='Running'
[2024-08-03 15:13:51] [Rank 1] totoal_tokens=4801, outputs='Painting the model'
[2024-08-03 15:13:52] [Rank 3] totoal_tokens=4954, outputs='The Political Psychology of Democratic Citizenship'
[2024-08-03 15:13:52] [Rank 0] totoal_tokens=5102, outputs='Abbey Scholarship'
[2024-08-03 15:13:52] [Rank 1] totoal_tokens=4864, outputs='The test is passed'
[2024-08-03 15:13:52] [Rank 3] totoal_tokens=5002, outputs='Music'
[2024-08-03 15:13:52] [Rank 0] totoal_tokens=5105, outputs='oak'
73it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ– | 181/751 [00:35<02:53, 3.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ– | 182/751 [00:35<02:44, 3.47it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 24%|β–ˆβ–ˆβ– | 183/751 [00:35<02:45, 3.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ– | 184/751 [00:36<02:41, 3.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ– | 185/751 [00:36<02:38, 3.58it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ– | 186/751 [00:36<02:34, 3.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ– | 187/751 [00:36<02:31, 3.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ–Œ | 188/751 [00:37<02:38, 3.55it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ–Œ | 189/751 [00:37<02:44, 3.41it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 2[2024-08-03 15:13:52] [Rank 1] totoal_tokens=5001, outputs='Rome'
[2024-08-03 15:13:52] [Rank 2] totoal_tokens=5011, outputs='Automated monitoring of the activity on connected devices, their use and the traffic they'
[2024-08-03 15:13:52] [Rank 3] totoal_tokens=5057, outputs='Eat breakfast'
[2024-08-03 15:13:52] [Rank 0] totoal_tokens=5141, outputs='C'
[2024-08-03 15:13:52] [Rank 1] totoal_tokens=5034, outputs='Running'
[2024-08-03 15:13:52] [Rank 2] totoal_tokens=5089, outputs='cat'
[2024-08-03 15:13:52] [Rank 3] totoal_tokens=5072, outputs='Store B'
[2024-08-03 15:13:53] [Rank 1] totoal_tokens=5035, outputs='Madrid'
[2024-08-03 15:13:53] [Rank 2] totoal_tokens=5137, outputs='Store C'
[2024-08-03 15:13:53] [Rank 3] totoal_tokens=5090, outputs='dragonfly'
[2024-08-03 15:13:53] [Rank 0] totoal_tokens=5143, outputs='PATSCAN'
[2024-08-03 15:13:53] [Rank 1] totoal_tokens=5096, outputs='Bird'
[2024-08-03 15:13:53] [Rank 2] totoal_tokens=5271, outputs='Sue'
[2024-08-03 15:13:53] [Rank 3] totoal_tokens=5090, outputs='Bob'
[2024-08-03 15:13:53] [Rank 1] totoal_tokens=5129, outputs='dog'
[2024-08-03 15:13:53] [Rank 0] totoal_tokens=5200, outputs='Maria Ressa'
[2024-08-03 15:13:53] [Rank 2] totoal_tokens=5309, outputs='Impressionist'
[2024-08-03 15:13:54] [Rank 3] totoal_tokens=5126, outputs='The Florida Bar is now investigating Gaetz for witness tampering.'
[2024-08-03 15:13:54] [Rank 0] totoal_tokens=5265, outputs='Dr. Johnson'
[2024-08-03 15:13:54] [Rank 2] totoal_tokens=5380, outputs='dinner'
[2024-08-03 15:13:54] [Rank 1] totoal_tokens=5145, outputs='Route X'
[2024-08-03 15:13:54] [Rank 0] totoal_tokens=5360, outputs='Route X'
[2024-08-03 15:13:54] [Rank 3] totoal_tokens=5288, outputs='At the end of the day'
[2024-08-03 15:13:54] [Rank 2] totoal_tokens=5384, outputs='Car A'
[2024-08-03 15:13:54] [Rank 1] totoal_tokens=5205, outputs='Cacti'
[2024-08-03 15:13:54] [Rank 0] totoal_tokens=5388, outputs='Lunch'
[2024-08-03 15:13:54] [Rank 3] totoal_tokens=5320, outputs='John Wurdeman'
[2024-08-03 15:13:54] [Rank 2] totoal_tokens=5445, outputs='tulip'
[2024-08-03 15:13:55] [Rank 1] totoal_tokens=5305, outputs='Region X'
[2024-08-03 15:13:55] [Rank 0] totoal_tokens=5390, outputs='tulip'
[2024-08-03 15:13:55] [Rank 3] totoal_tokens=5384, outputs='Painting the model'
[2024-08-03 15:13:55] [Rank 2] totoal_tokens=5446, outputs='Painting the model'
[2024-08-03 15:13:55] [Rank 0] totoal_tokens=5395, outputs='dog'
[2024-08-03 15:13:55] [Rank 1] totoal_tokens=5322, outputs='Company Z'
[2024-08-03 15:13:55] [Rank 3] totoal_tokens=5408, outputs='Chocolate cake'
[2024-08-03 15:13:55] [Rank 2] totoal_tokens=5451, outputs='Bird'
[2024-08-03 15:13:55] [Rank 0] totoal_tokens=5405, outputs='Earth'
[2024-08-03 15:13:55] [Rank 2] totoal_tokens=5460, outputs='Running'
[2024-08-03 15:13:55] [Rank 3] totoal_tokens=5416, outputs='Diet Plan C'
[2024-08-03 15:13:55] [Rank 1] totoal_tokens=5382, outputs='Get watermelon from the other person'
[2024-08-03 15:13:56] [Rank 0] totoal_tokens=5476, outputs='They pack their school bag'
5%|β–ˆβ–ˆβ–Œ | 190/751 [00:37<02:39, 3.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 25%|β–ˆβ–ˆβ–Œ | 191/751 [00:38<02:32, 3.67it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 192/751 [00:38<03:21, 2.77it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 193/751 [00:39<03:50, 2.42it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 194/751 [00:39<03:30, 2.64it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 195/751 [00:39<03:15, 2.84it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 196/751 [00:40<03:15, 2.84it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–Œ | 197/751 [00:40<03:11, 2.89it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–‹ | 198/751 [00:40<03:01, 3.04it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 26%|β–ˆβ–ˆβ–‹ | 199/751 [00:40<02:51, 3.22it/s] Process[2024-08-03 15:13:56] [Rank 2] totoal_tokens=5507, outputs='Dragonfly'
[2024-08-03 15:13:56] [Rank 3] totoal_tokens=5462, outputs='A fireworks display'
[2024-08-03 15:13:56] [Rank 1] totoal_tokens=5456, outputs='Region X'
[2024-08-03 15:13:56] [Rank 0] totoal_tokens=5583, outputs='Bob'
[2024-08-03 15:13:56] [Rank 2] totoal_tokens=5587, outputs='Modern'
[2024-08-03 15:13:56] [Rank 3] totoal_tokens=5510, outputs='Tulip'
[2024-08-03 15:13:56] [Rank 1] totoal_tokens=5459, outputs='Donald Trump'
[2024-08-03 15:13:56] [Rank 0] totoal_tokens=5625, outputs='Jogging'
[2024-08-03 15:13:56] [Rank 2] totoal_tokens=5594, outputs='Alpha'
[2024-08-03 15:13:56] [Rank 1] totoal_tokens=5462, outputs='City'
[2024-08-03 15:13:56] [Rank 3] totoal_tokens=5527, outputs='Yoga'
[2024-08-03 15:13:57] [Rank 1] totoal_tokens=5471, outputs='Route X'
[2024-08-03 15:13:57] [Rank 3] totoal_tokens=5615, outputs='Company Z'
[2024-08-03 15:13:57] [Rank 2] totoal_tokens=5617, outputs='oak'
[2024-08-03 15:13:57] [Rank 0] totoal_tokens=5701, outputs='The world’s longest-running musical revue β€” known for its thin plot'
[2024-08-03 15:13:57] [Rank 1] totoal_tokens=5500, outputs='Madrid'
[2024-08-03 15:13:57] [Rank 3] totoal_tokens=5626, outputs='Product B'
[2024-08-03 15:13:57] [Rank 2] totoal_tokens=5631, outputs='Watched movie'
[2024-08-03 15:13:57] [Rank 1] totoal_tokens=5527, outputs='Route X'
[2024-08-03 15:13:57] [Rank 3] totoal_tokens=5710, outputs='Go for a walk'
[2024-08-03 15:13:57] [Rank 2] totoal_tokens=5683, outputs='A restaurant'
[2024-08-03 15:13:57] [Rank 0] totoal_tokens=5736, outputs='Bryan Maniotakis'
[2024-08-03 15:13:58] [Rank 1] totoal_tokens=5528, outputs='Company Y'
[2024-08-03 15:13:58] [Rank 2] totoal_tokens=5765, outputs='banana'
[2024-08-03 15:13:58] [Rank 0] totoal_tokens=5739, outputs='Dr. Johnson'
[2024-08-03 15:13:58] [Rank 3] totoal_tokens=5718, outputs='salad'
[2024-08-03 15:13:58] [Rank 1] totoal_tokens=5606, outputs='Route X'
[2024-08-03 15:13:58] [Rank 0] totoal_tokens=5769, outputs='Impressionist'
[2024-08-03 15:13:58] [Rank 3] totoal_tokens=5752, outputs='under the magazines'
[2024-08-03 15:13:58] [Rank 2] totoal_tokens=5786, outputs='Alice'
[2024-08-03 15:13:58] [Rank 3] totoal_tokens=5837, outputs='Cara'
[2024-08-03 15:13:59] [Rank 0] totoal_tokens=5787, outputs='Region X'
[2024-08-03 15:13:59] [Rank 1] totoal_tokens=5614, outputs='PATSCAN Platform'
[2024-08-03 15:13:59] [Rank 2] totoal_tokens=5850, outputs='The Backyard Scientist'
[2024-08-03 15:13:59] [Rank 3] totoal_tokens=5841, outputs='Get coffee'
[2024-08-03 15:13:59] [Rank 0] totoal_tokens=5858, outputs='Charlie'
[2024-08-03 15:13:59] [Rank 1] totoal_tokens=5620, outputs='Lunch'
[2024-08-03 15:13:59] [Rank 2] totoal_tokens=5915, outputs='Car A'
[2024-08-03 15:13:59] [Rank 3] totoal_tokens=5904, outputs='banana'
[2024-08-03 15:13:59] [Rank 1] totoal_tokens=5905, outputs='Alice'
[2024-08-03 15:13:59] [Rank 0] totoal_tokens=5938, outputs='Owl'
ing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 200/751 [00:41<03:05, 2.98it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 201/751 [00:41<03:03, 2.99it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 202/751 [00:42<03:01, 3.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 203/751 [00:42<03:58, 2.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 204/751 [00:43<04:24, 2.07it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 205/751 [00:43<04:00, 2.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 27%|β–ˆβ–ˆβ–‹ | 206/751 [00:43<03:43, 2.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 207/751 [00:44<03:41, 2.45it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 208/751 [00:44<03:31, 2.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š [2024-08-03 15:13:59] [Rank 2] totoal_tokens=6021, outputs='tulip'
[2024-08-03 15:13:59] [Rank 3] totoal_tokens=5911, outputs='Cara'
[2024-08-03 15:14:00] [Rank 1] totoal_tokens=6019, outputs='Charlie'
[2024-08-03 15:14:00] [Rank 0] totoal_tokens=5946, outputs='Layla Rose Hanbury'
[2024-08-03 15:14:00] [Rank 2] totoal_tokens=6043, outputs='Walls'
[2024-08-03 15:14:00] [Rank 3] totoal_tokens=5961, outputs='tulip'
[2024-08-03 15:14:00] [Rank 1] totoal_tokens=6019, outputs='Painting the model'
[2024-08-03 15:14:00] [Rank 0] totoal_tokens=6024, outputs='watch movie'
[2024-08-03 15:14:00] [Rank 2] totoal_tokens=6043, outputs='Cactus'
[2024-08-03 15:14:00] [Rank 3] totoal_tokens=5963, outputs='Hike'
[2024-08-03 15:14:00] [Rank 1] totoal_tokens=6036, outputs='Mr. Obi'
[2024-08-03 15:14:00] [Rank 0] totoal_tokens=6050, outputs='Impressionist'
[2024-08-03 15:14:00] [Rank 2] totoal_tokens=6068, outputs='Go to bed'
[2024-08-03 15:14:00] [Rank 3] totoal_tokens=6057, outputs='Cummings'
[2024-08-03 15:14:01] [Rank 2] totoal_tokens=6074, outputs='owl'
[2024-08-03 15:14:01] [Rank 0] totoal_tokens=6081, outputs='Renaissance'
[2024-08-03 15:14:01] [Rank 3] totoal_tokens=6201, outputs='Impressionist'
[2024-08-03 15:14:01] [Rank 1] totoal_tokens=6041, outputs='The final step is to use the model to predict future extreme rainfall trends.'
[2024-08-03 15:14:01] [Rank 2] totoal_tokens=6076, outputs='Cacti'
[2024-08-03 15:14:01] [Rank 0] totoal_tokens=6083, outputs='Swimming'
[2024-08-03 15:14:01] [Rank 3] totoal_tokens=6402, outputs='Fireworks display'
[2024-08-03 15:14:01] [Rank 1] totoal_tokens=6047, outputs='tulip'
[2024-08-03 15:14:01] [Rank 2] totoal_tokens=6169, outputs='Car A'
[2024-08-03 15:14:02] [Rank 0] totoal_tokens=6107, outputs='Diet Plan C'
[2024-08-03 15:14:02] [Rank 3] totoal_tokens=6409, outputs='Coffee'
[2024-08-03 15:14:02] [Rank 1] totoal_tokens=6054, outputs='She takes a yoga class.'
[2024-08-03 15:14:02] [Rank 2] totoal_tokens=6258, outputs='Toilet paper'
[2024-08-03 15:14:02] [Rank 3] totoal_tokens=6534, outputs='The Five Obstructions'
[2024-08-03 15:14:02] [Rank 0] totoal_tokens=6121, outputs='John McAfee'
[2024-08-03 15:14:02] [Rank 1] totoal_tokens=6060, outputs='Get her backpack'
[2024-08-03 15:14:02] [Rank 2] totoal_tokens=6337, outputs='Cactus'
[2024-08-03 15:14:02] [Rank 3] totoal_tokens=6568, outputs='Initiative Alpha'
[2024-08-03 15:14:02] [Rank 0] totoal_tokens=6265, outputs='Nikky Finney'
[2024-08-03 15:14:03] [Rank 2] totoal_tokens=6447, outputs='The office is set up'
[2024-08-03 15:14:03] [Rank 3] totoal_tokens=6639, outputs='Liam'
[2024-08-03 15:14:03] [Rank 0] totoal_tokens=6335, outputs='Owl'
[2024-08-03 15:14:03] [Rank 1] totoal_tokens=6129, outputs='The festival ends with a closing performance by the St. Petersburg Symphony Orchestra.'
[2024-08-03 15:14:03] [Rank 3] totoal_tokens=6728, outputs='Company X'
[2024-08-03 15:14:03] [Rank 0] totoal_tokens=6389, outputs='Car B'
| 209/751 [00:45<03:32, 2.55it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 210/751 [00:45<03:35, 2.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 211/751 [00:45<03:18, 2.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 212/751 [00:46<03:12, 2.80it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 213/751 [00:46<03:23, 2.64it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 28%|β–ˆβ–ˆβ–Š | 214/751 [00:46<03:15, 2.74it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–Š | 215/751 [00:47<03:21, 2.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 216/751 [00:47<03:33, 2.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 217/751 [00:48<03:35, 2.48it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 218/751 [00:48<03:23, 2.63it/s] Processing InternVL2-2[2024-08-03 15:14:03] [Rank 0] totoal_tokens=6401, outputs='Tulip'
[2024-08-03 15:14:03] [Rank 2] totoal_tokens=6510, outputs='fern'
[2024-08-03 15:14:03] [Rank 3] totoal_tokens=6735, outputs='at the end of the day'
[2024-08-03 15:14:04] [Rank 1] totoal_tokens=6203, outputs='adaptation into a 13 episode anime television series by Hal Film Maker in'
[2024-08-03 15:14:04] [Rank 2] totoal_tokens=6518, outputs='Google'
[2024-08-03 15:14:04] [Rank 1] totoal_tokens=6204, outputs='oak'
[2024-08-03 15:14:04] [Rank 3] totoal_tokens=6812, outputs='Cactus'
[2024-08-03 15:14:04] [Rank 0] totoal_tokens=6428, outputs='Charles C. Owen'
[2024-08-03 15:14:04] [Rank 1] totoal_tokens=6299, outputs='Walls'
[2024-08-03 15:14:04] [Rank 0] totoal_tokens=6527, outputs='Alice'
[2024-08-03 15:14:04] [Rank 2] totoal_tokens=6522, outputs='Cupcakes'
[2024-08-03 15:14:04] [Rank 3] totoal_tokens=6860, outputs='Next to it'
[2024-08-03 15:14:05] [Rank 1] totoal_tokens=6308, outputs='The sales team'
[2024-08-03 15:14:05] [Rank 0] totoal_tokens=6573, outputs='Store A'
[2024-08-03 15:14:05] [Rank 2] totoal_tokens=6534, outputs='Diet Plan C'
[2024-08-03 15:14:05] [Rank 3] totoal_tokens=6936, outputs='on top of the couch'
[2024-08-03 15:14:05] [Rank 0] totoal_tokens=6738, outputs='Diet Plan C'
[2024-08-03 15:14:05] [Rank 1] totoal_tokens=6441, outputs='Diet Plan B'
[2024-08-03 15:14:05] [Rank 3] totoal_tokens=6964, outputs='Rent'
[2024-08-03 15:14:05] [Rank 2] totoal_tokens=6566, outputs='Shannon'
[2024-08-03 15:14:05] [Rank 0] totoal_tokens=6842, outputs='Store Z'
[2024-08-03 15:14:06] [Rank 2] totoal_tokens=6723, outputs='Diet Plan C'
[2024-08-03 15:14:06] [Rank 3] totoal_tokens=6994, outputs='Painting the model'
[2024-08-03 15:14:06] [Rank 0] totoal_tokens=6888, outputs='The dessert'
[2024-08-03 15:14:06] [Rank 1] totoal_tokens=6493, outputs='Sarah'
[2024-08-03 15:14:06] [Rank 3] totoal_tokens=6998, outputs='Brush his teeth'
[2024-08-03 15:14:06] [Rank 2] totoal_tokens=6745, outputs='Diet Plan C'
[2024-08-03 15:14:06] [Rank 0] totoal_tokens=7040, outputs='Bob'
[2024-08-03 15:14:06] [Rank 1] totoal_tokens=6520, outputs='Earth'
[2024-08-03 15:14:06] [Rank 3] totoal_tokens=7042, outputs='Modern art'
[2024-08-03 15:14:06] [Rank 2] totoal_tokens=7026, outputs='Diet Plan C'
[2024-08-03 15:14:07] [Rank 0] totoal_tokens=7058, outputs='The family goes for a hike.'
B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 219/751 [00:48<03:15, 2.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 220/751 [00:49<03:13, 2.74it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 29%|β–ˆβ–ˆβ–‰ | 221/751 [00:49<03:38, 2.42it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–‰ | 222/751 [00:50<03:24, 2.59it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–‰ | 223/751 [00:50<03:19, 2.65it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–‰ | 224/751 [00:50<03:20, 2.63it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–‰ | 225/751 [00:51<03:12, 2.73it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–ˆ | 226/751 [00:51<03:23, 2.58it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–ˆ | 227/751 [00:51<03:12, 2.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–ˆ | 228/751 [[2024-08-03 15:14:07] [Rank 1] totoal_tokens=6604, outputs='Region C'
[2024-08-03 15:14:07] [Rank 3] totoal_tokens=7082, outputs='Rome'
[2024-08-03 15:14:07] [Rank 2] totoal_tokens=7034, outputs='The awards are given out at the end of the day.'
[2024-08-03 15:14:07] [Rank 0] totoal_tokens=7088, outputs='Painting the model'
[2024-08-03 15:14:07] [Rank 1] totoal_tokens=6633, outputs='Butterfly'
[2024-08-03 15:14:07] [Rank 3] totoal_tokens=7108, outputs='Fireworks display'
[2024-08-03 15:14:07] [Rank 2] totoal_tokens=7054, outputs='Lunch'
[2024-08-03 15:14:07] [Rank 1] totoal_tokens=6866, outputs='Joe Biden'
[2024-08-03 15:14:07] [Rank 0] totoal_tokens=7144, outputs='Dessert'
[2024-08-03 15:14:08] [Rank 3] totoal_tokens=7263, outputs='at the end of the day'
[2024-08-03 15:14:08] [Rank 2] totoal_tokens=7258, outputs='Tim'
[2024-08-03 15:14:08] [Rank 1] totoal_tokens=6895, outputs='Alice'
[2024-08-03 15:14:08] [Rank 0] totoal_tokens=7189, outputs='The Outsider'
[2024-08-03 15:14:08] [Rank 3] totoal_tokens=7299, outputs='Watch TV'
[2024-08-03 15:14:08] [Rank 1] totoal_tokens=6992, outputs='Rome'
[2024-08-03 15:14:08] [Rank 2] totoal_tokens=7265, outputs='Animal Kingdom'
[2024-08-03 15:14:08] [Rank 0] totoal_tokens=7240, outputs='Citrus'
[2024-08-03 15:14:08] [Rank 1] totoal_tokens=7001, outputs='Tulip'
[2024-08-03 15:14:09] [Rank 3] totoal_tokens=7344, outputs='Alan Sirois'
[2024-08-03 15:14:09] [Rank 2] totoal_tokens=7351, outputs='Charlie'
[2024-08-03 15:14:09] [Rank 0] totoal_tokens=7265, outputs='Russia'
[2024-08-03 15:14:09] [Rank 2] totoal_tokens=7521, outputs='apple'
[2024-08-03 15:14:09] [Rank 3] totoal_tokens=7347, outputs='Cailtyn'
[2024-08-03 15:14:09] [Rank 1] totoal_tokens=7003, outputs='The first person to travel is the man in the suit.'
[2024-08-03 15:14:09] [Rank 0] totoal_tokens=7283, outputs='Painting the model is the final step.'
[2024-08-03 15:14:09] [Rank 2] totoal_tokens=7562, outputs='Painting the model'
[2024-08-03 15:14:10] [Rank 1] totoal_tokens=7020, outputs='Revenge'
[2024-08-03 15:14:10] [Rank 3] totoal_tokens=7379, outputs='Joe Pastry'
[2024-08-03 15:14:10] [Rank 0] totoal_tokens=7496, outputs='Hike'
[2024-08-03 15:14:10] [Rank 2] totoal_tokens=7585, outputs='Dragonfly'
[2024-08-03 15:14:10] [Rank 1] totoal_tokens=7107, outputs='Cactus'
[2024-08-03 15:14:10] [Rank 3] totoal_tokens=7488, outputs='The northeast'
[2024-08-03 15:14:10] [Rank 0] totoal_tokens=7591, outputs='Elon Musk'
[2024-08-03 15:14:10] [Rank 2] totoal_tokens=7591, outputs='pine tree'
[2024-08-03 15:14:10] [Rank 1] totoal_tokens=7141, outputs='Queen'
[2024-08-03 15:14:10] [Rank 3] totoal_tokens=7496, outputs='Maple'
[2024-08-03 15:14:11] [Rank 0] totoal_tokens=7796, outputs='The fern'
[2024-08-03 15:14:11] [Rank 3] totoal_tokens=7523, outputs='dessert'
[2024-08-03 15:14:11] [Rank 1] totoal_tokens=7318, outputs='ZoΓ«'
[2024-08-03 15:14:11] [Rank 2] totoal_tokens=7625, outputs='at the end of the day'
[2024-08-03 15:14:11] [Rank 0] totoal_tokens=7942, outputs='Cara'
00:52<03:30, 2.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 30%|β–ˆβ–ˆβ–ˆ | 229/751 [00:52<03:31, 2.46it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆ | 230/751 [00:53<03:35, 2.42it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆ | 231/751 [00:53<03:42, 2.34it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆ | 232/751 [00:54<03:44, 2.31it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆ | 233/751 [00:54<03:31, 2.45it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆ | 234/751 [00:55<03:57, 2.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆβ– | 235/751 [00:55<03:51, 2.23it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 31%|β–ˆβ–ˆβ–ˆβ– | 236/751 [00:55<03:55, 2.19it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 237/751 [00:56<03:46, 2.27it/s] Processing InternVL2-2B_reasoni[2024-08-03 15:14:11] [Rank 1] totoal_tokens=7344, outputs='Madrid'
[2024-08-03 15:14:11] [Rank 3] totoal_tokens=7589, outputs='Store B'
[2024-08-03 15:14:11] [Rank 2] totoal_tokens=7695, outputs='Company X'
[2024-08-03 15:14:11] [Rank 0] totoal_tokens=8057, outputs='Company Z'
[2024-08-03 15:14:12] [Rank 1] totoal_tokens=7514, outputs='Walls'
[2024-08-03 15:14:12] [Rank 3] totoal_tokens=7702, outputs='Jupiter'
[2024-08-03 15:14:12] [Rank 2] totoal_tokens=7727, outputs='Rome'
[2024-08-03 15:14:12] [Rank 0] totoal_tokens=8066, outputs='He will stop doing the little things'
[2024-08-03 15:14:12] [Rank 1] totoal_tokens=7516, outputs='Walls'
[2024-08-03 15:14:12] [Rank 3] totoal_tokens=7704, outputs='Dr. Liu'
[2024-08-03 15:14:12] [Rank 2] totoal_tokens=7744, outputs='Chocolate cake'
[2024-08-03 15:14:12] [Rank 0] totoal_tokens=8076, outputs='Route X'
[2024-08-03 15:14:13] [Rank 1] totoal_tokens=7576, outputs='Ellen'
[2024-08-03 15:14:13] [Rank 3] totoal_tokens=7731, outputs='brushes his teeth'
[2024-08-03 15:14:13] [Rank 2] totoal_tokens=7881, outputs='Salad'
[2024-08-03 15:14:13] [Rank 0] totoal_tokens=8182, outputs='Chocolate cake'
[2024-08-03 15:14:13] [Rank 3] totoal_tokens=7761, outputs='C'
[2024-08-03 15:14:13] [Rank 1] totoal_tokens=7748, outputs='on top of table'
[2024-08-03 15:14:13] [Rank 2] totoal_tokens=7882, outputs='The student eats breakfast.'
[2024-08-03 15:14:13] [Rank 0] totoal_tokens=8222, outputs='Region C'
[2024-08-03 15:14:13] [Rank 3] totoal_tokens=7907, outputs='C'
[2024-08-03 15:14:14] [Rank 2] totoal_tokens=7926, outputs='Region X'
[2024-08-03 15:14:14] [Rank 1] totoal_tokens=7770, outputs='Sue'
[2024-08-03 15:14:14] [Rank 3] totoal_tokens=7945, outputs='ice cream'
[2024-08-03 15:14:14] [Rank 0] totoal_tokens=8234, outputs='Painting the model'
[2024-08-03 15:14:14] [Rank 2] totoal_tokens=7939, outputs='Earth'
[2024-08-03 15:14:14] [Rank 3] totoal_tokens=7980, outputs='Label'
[2024-08-03 15:14:14] [Rank 0] totoal_tokens=8242, outputs='Chocolate cake'
[2024-08-03 15:14:15] [Rank 1] totoal_tokens=7798, outputs='Alpha'
[2024-08-03 15:14:15] [Rank 2] totoal_tokens=8007, outputs='Lunch'
[2024-08-03 15:14:15] [Rank 3] totoal_tokens=7980, outputs='salad'
[2024-08-03 15:14:15] [Rank 0] totoal_tokens=8253, outputs='Cactus'
[2024-08-03 15:14:15] [Rank 1] totoal_tokens=7800, outputs='Apple'
[2024-08-03 15:14:15] [Rank 2] totoal_tokens=8073, outputs='Car A'
[2024-08-03 15:14:15] [Rank 3] totoal_tokens=8110, outputs='The film is shown'
[2024-08-03 15:14:15] [Rank 1] totoal_tokens=7880, outputs='Tara'
[2024-08-03 15:14:15] [Rank 0] totoal_tokens=8279, outputs='Jane Riddle'
ng-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 238/751 [00:56<03:50, 2.23it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 239/751 [00:57<03:44, 2.28it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 240/751 [00:57<04:03, 2.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 241/751 [00:58<03:56, 2.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 242/751 [00:58<04:02, 2.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 243/751 [00:59<04:06, 2.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 32%|β–ˆβ–ˆβ–ˆβ– | 244/751 [00:59<04:07, 2.05it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 245/751 [01:00<04:04, 2.07it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 246/751 [01:00<03:56, 2.14it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž [2024-08-03 15:14:16] [Rank 2] totoal_tokens=8233, outputs='dessert'
[2024-08-03 15:14:16] [Rank 3] totoal_tokens=8238, outputs='Region C'
[2024-08-03 15:14:16] [Rank 1] totoal_tokens=7888, outputs='Watch TV'
[2024-08-03 15:14:16] [Rank 0] totoal_tokens=8334, outputs='watch movie'
[2024-08-03 15:14:16] [Rank 2] totoal_tokens=8249, outputs='Get her backpack'
[2024-08-03 15:14:16] [Rank 3] totoal_tokens=8242, outputs='Nandi'
[2024-08-03 15:14:16] [Rank 1] totoal_tokens=7889, outputs='John'
[2024-08-03 15:14:16] [Rank 0] totoal_tokens=8344, outputs='She works in the office after jogging.'
[2024-08-03 15:14:17] [Rank 3] totoal_tokens=8333, outputs='bee'
[2024-08-03 15:14:17] [Rank 2] totoal_tokens=8257, outputs='Owl'
[2024-08-03 15:14:17] [Rank 1] totoal_tokens=7936, outputs='Go for a hike'
[2024-08-03 15:14:17] [Rank 0] totoal_tokens=8362, outputs='Aloe Vera'
[2024-08-03 15:14:17] [Rank 3] totoal_tokens=8340, outputs='The parade'
[2024-08-03 15:14:17] [Rank 2] totoal_tokens=8367, outputs='Invest A'
[2024-08-03 15:14:17] [Rank 1] totoal_tokens=8039, outputs='Tom Arnold'
[2024-08-03 15:14:17] [Rank 3] totoal_tokens=8353, outputs='Tulip'
[2024-08-03 15:14:17] [Rank 0] totoal_tokens=8377, outputs='banana'
[2024-08-03 15:14:17] [Rank 2] totoal_tokens=8466, outputs='oak'
[2024-08-03 15:14:18] [Rank 3] totoal_tokens=8353, outputs='Tulip'
[2024-08-03 15:14:18] [Rank 0] totoal_tokens=8392, outputs='on top of the couch'
[2024-08-03 15:14:18] [Rank 1] totoal_tokens=8052, outputs='Lelouch is undeniably compelling, but Wendy is best Dragon Slayer'
[2024-08-03 15:14:18] [Rank 2] totoal_tokens=8477, outputs='Computers and other tech equipment'
[2024-08-03 15:14:18] [Rank 3] totoal_tokens=8359, outputs='B'
[2024-08-03 15:14:18] [Rank 1] totoal_tokens=8056, outputs='owl'
[2024-08-03 15:14:18] [Rank 0] totoal_tokens=8441, outputs='Cycling'
[2024-08-03 15:14:18] [Rank 2] totoal_tokens=8520, outputs='Store C'
[2024-08-03 15:14:19] [Rank 1] totoal_tokens=8083, outputs='Cactus'
[2024-08-03 15:14:19] [Rank 0] totoal_tokens=8450, outputs='banana'
[2024-08-03 15:14:19] [Rank 2] totoal_tokens=8562, outputs='Dr. Zhang'
[2024-08-03 15:14:19] [Rank 0] totoal_tokens=8500, outputs='backyard'
[2024-08-03 15:14:19] [Rank 1] totoal_tokens=8165, outputs='The product is launched to the public'
[2024-08-03 15:14:20] [Rank 2] totoal_tokens=8585, outputs='Sicario'
[2024-08-03 15:14:20] [Rank 3] totoal_tokens=8392, outputs='TT RS'
[2024-08-03 15:14:20] [Rank 0] totoal_tokens=8589, outputs='Impressionist'
[2024-08-03 15:14:20] [Rank 1] totoal_tokens=8218, outputs='The scientists were very surprised by what they saw.'
[2024-08-03 15:14:20] [Rank 2] totoal_tokens=8658, outputs='Eat breakfast'
[2024-08-03 15:14:20] [Rank 3] totoal_tokens=8463, outputs='The Panama Canal'
[2024-08-03 15:14:20] [Rank 0] totoal_tokens=8641, outputs='watch movie'
| 247/751 [01:01<04:12, 2.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 248/751 [01:01<04:00, 2.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 249/751 [01:02<04:15, 1.96it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 250/751 [01:02<04:09, 2.01it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 33%|β–ˆβ–ˆβ–ˆβ–Ž | 251/751 [01:03<04:04, 2.04it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ–Ž | 252/751 [01:03<04:08, 2.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ–Ž | 253/751 [01:04<03:57, 2.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 254/751 [01:04<03:48, 2.17it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 255/751 [01:05<03:51, 2.14it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 256/751 [01:05<04:07, 2.00it/s] Processi[2024-08-03 15:14:21] [Rank 1] totoal_tokens=8269, outputs='Diet Plan C'
[2024-08-03 15:14:21] [Rank 3] totoal_tokens=8470, outputs='Lucy'
[2024-08-03 15:14:21] [Rank 2] totoal_tokens=8743, outputs='Diet Plan C'
[2024-08-03 15:14:21] [Rank 0] totoal_tokens=8693, outputs='Go to bed'
[2024-08-03 15:14:21] [Rank 1] totoal_tokens=8285, outputs='Painting the model'
[2024-08-03 15:14:21] [Rank 3] totoal_tokens=8764, outputs='Impressionist'
[2024-08-03 15:14:21] [Rank 2] totoal_tokens=8773, outputs='Dr. Zhang'
[2024-08-03 15:14:21] [Rank 0] totoal_tokens=8711, outputs='Get keys'
[2024-08-03 15:14:22] [Rank 3] totoal_tokens=8804, outputs='tulip'
[2024-08-03 15:14:22] [Rank 1] totoal_tokens=8319, outputs='Railway Relocation and Crossing Act'
[2024-08-03 15:14:22] [Rank 0] totoal_tokens=8873, outputs='smoke cannabis'
[2024-08-03 15:14:22] [Rank 2] totoal_tokens=8864, outputs='The product is launched to the public'
[2024-08-03 15:14:22] [Rank 1] totoal_tokens=8333, outputs='Joey'
[2024-08-03 15:14:22] [Rank 3] totoal_tokens=8831, outputs='Windex'
[2024-08-03 15:14:22] [Rank 0] totoal_tokens=8919, outputs='The walls'
[2024-08-03 15:14:22] [Rank 2] totoal_tokens=8894, outputs='Hastings Distillers'
[2024-08-03 15:14:22] [Rank 1] totoal_tokens=8353, outputs='June 1'
[2024-08-03 15:14:22] [Rank 3] totoal_tokens=8856, outputs='Tim Roth'
[2024-08-03 15:14:23] [Rank 0] totoal_tokens=8924, outputs='The fireworks display'
[2024-08-03 15:14:23] [Rank 2] totoal_tokens=8967, outputs='dog'
[2024-08-03 15:14:23] [Rank 1] totoal_tokens=8355, outputs='on top of the couch'
[2024-08-03 15:14:23] [Rank 0] totoal_tokens=8989, outputs='The daisy'
[2024-08-03 15:14:23] [Rank 3] totoal_tokens=8918, outputs='Install lights'
[2024-08-03 15:14:23] [Rank 2] totoal_tokens=8995, outputs='Walls'
[2024-08-03 15:14:23] [Rank 1] totoal_tokens=8367, outputs='brush his teeth'
[2024-08-03 15:14:24] [Rank 0] totoal_tokens=9030, outputs='Lysol'
[2024-08-03 15:14:24] [Rank 2] totoal_tokens=9031, outputs='Alice'
[2024-08-03 15:14:24] [Rank 3] totoal_tokens=8996, outputs='Region Y'
[2024-08-03 15:14:24] [Rank 0] totoal_tokens=9032, outputs='AngelList'
[2024-08-03 15:14:24] [Rank 3] totoal_tokens=8997, outputs='tulip'
[2024-08-03 15:14:24] [Rank 2] totoal_tokens=9043, outputs='Diet plan B'
[2024-08-03 15:14:25] [Rank 0] totoal_tokens=9042, outputs='Rome'
ng InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 257/751 [01:06<03:58, 2.07it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 258/751 [01:06<03:58, 2.07it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 34%|β–ˆβ–ˆβ–ˆβ– | 259/751 [01:06<03:49, 2.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ– | 260/751 [01:07<03:54, 2.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ– | 261/751 [01:07<03:52, 2.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ– | 262/751 [01:08<03:53, 2.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 263/751 [01:08<03:54, 2.08it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 264/751 [01:09<03:53, 2.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 35%|β–ˆβ–ˆβ–ˆβ–Œ | 265/751 [01:09<03:58, 2.03it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:14:25] [Rank 3] totoal_tokens=9020, outputs='Banana'
[2024-08-03 15:14:25] [Rank 2] totoal_tokens=9063, outputs='Tulip'
[2024-08-03 15:14:25] [Rank 1] totoal_tokens=8398, outputs='Dr. Smith'
[2024-08-03 15:14:25] [Rank 0] totoal_tokens=9062, outputs='Impressionist'
[2024-08-03 15:14:25] [Rank 2] totoal_tokens=9070, outputs='Dog'
[2024-08-03 15:14:25] [Rank 3] totoal_tokens=9037, outputs='Windsor'
[2024-08-03 15:14:25] [Rank 1] totoal_tokens=8556, outputs='Get bus'
[2024-08-03 15:14:26] [Rank 0] totoal_tokens=9081, outputs='Walls'
[2024-08-03 15:14:26] [Rank 2] totoal_tokens=9071, outputs='left'
[2024-08-03 15:14:26] [Rank 3] totoal_tokens=9050, outputs='Diet Plan C'
[2024-08-03 15:14:26] [Rank 1] totoal_tokens=8631, outputs='Lucy Larcom'
[2024-08-03 15:14:26] [Rank 2] totoal_tokens=9091, outputs='Brent Burns'
[2024-08-03 15:14:26] [Rank 1] totoal_tokens=8824, outputs='Pizza'
[2024-08-03 15:14:27] [Rank 0] totoal_tokens=9112, outputs='A food company'
[2024-08-03 15:14:27] [Rank 3] totoal_tokens=9115, outputs='Sue'
[2024-08-03 15:14:27] [Rank 2] totoal_tokens=9160, outputs='Gidget'
[2024-08-03 15:14:27] [Rank 1] totoal_tokens=8851, outputs='The triathlon starts with swimming.'
[2024-08-03 15:14:27] [Rank 3] totoal_tokens=9168, outputs='Car A'
[2024-08-03 15:14:27] [Rank 0] totoal_tokens=9160, outputs='Walls'
[2024-08-03 15:14:27] [Rank 2] totoal_tokens=9171, outputs='Decorations and plants'
[2024-08-03 15:14:28] [Rank 1] totoal_tokens=8924, outputs='C'
[2024-08-03 15:14:28] [Rank 0] totoal_tokens=9164, outputs='The dessert'
[2024-08-03 15:14:28] [Rank 3] totoal_tokens=9185, outputs='The fieldbus is terminated'
[2024-08-03 15:14:28] [Rank 3] totoal_tokens=9196, outputs='Cactus'
[2024-08-03 15:14:29] [Rank 1] totoal_tokens=8930, outputs='The last thing prepared was a multiline text image.'
[2024-08-03 15:14:29] [Rank 0] totoal_tokens=9179, outputs='Buy property'
[2024-08-03 15:14:29] [Rank 2] totoal_tokens=9208, outputs='Caitlin Tierney'
[2024-08-03 15:14:29] [Rank 0] totoal_tokens=9307, outputs='Store Z'
[2024-08-03 15:14:29] [Rank 2] totoal_tokens=9231, outputs='Lysol'
[2024-08-03 15:14:29] [Rank 1] totoal_tokens=9027, outputs='next to the couch'
[2024-08-03 15:14:29] [Rank 3] totoal_tokens=9208, outputs='Lion'
[2024-08-03 15:14:30] [Rank 2] totoal_tokens=9330, outputs='dog'
[2024-08-03 15:14:30] [Rank 0] totoal_tokens=9355, outputs='Tamera Childers'
[2024-08-03 15:14:30] [Rank 1] totoal_tokens=9035, outputs='Dr. Johnson'
[2024-08-03 15:14:30] [Rank 3] totoal_tokens=9289, outputs='The fireworks display'
[2024-08-03 15:14:30] [Rank 2] totoal_tokens=9341, outputs='Get coffee'
[2024-08-03 15:14:30] [Rank 1] totoal_tokens=9041, outputs='Jackdaw Capital'
[2024-08-03 15:14:30] [Rank 0] totoal_tokens=9387, outputs='Mario Vargas Llosa'
35%|β–ˆβ–ˆβ–ˆβ–Œ | 266/751 [01:10<03:57, 2.05it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 267/751 [01:10<03:55, 2.05it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 268/751 [01:11<03:49, 2.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 269/751 [01:12<05:15, 1.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 270/751 [01:13<05:27, 1.47it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 271/751 [01:13<05:02, 1.59it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–Œ | 272/751 [01:14<05:13, 1.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–‹ | 273/751 [01:14<04:55, 1.62it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 36%|β–ˆβ–ˆβ–ˆβ–‹ | 274/751 [01:15<04:45, 1.67it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 275/751 [01:16<04:[2024-08-03 15:14:30] [Rank 3] totoal_tokens=9373, outputs='cactus'
[2024-08-03 15:14:31] [Rank 2] totoal_tokens=9460, outputs='Store B'
[2024-08-03 15:14:31] [Rank 0] totoal_tokens=9444, outputs='Adam Silver'
[2024-08-03 15:14:31] [Rank 1] totoal_tokens=9062, outputs='The test is done and the doctor reads the results.'
[2024-08-03 15:14:31] [Rank 2] totoal_tokens=9463, outputs='Editing'
[2024-08-03 15:14:31] [Rank 3] totoal_tokens=9464, outputs='Wash his hands'
[2024-08-03 15:14:31] [Rank 0] totoal_tokens=9476, outputs='Git'
[2024-08-03 15:14:31] [Rank 2] totoal_tokens=9464, outputs='The roof'
[2024-08-03 15:14:32] [Rank 1] totoal_tokens=9103, outputs='backyard'
[2024-08-03 15:14:32] [Rank 3] totoal_tokens=9607, outputs='Dessert'
[2024-08-03 15:14:32] [Rank 0] totoal_tokens=9586, outputs='Banana'
[2024-08-03 15:14:32] [Rank 2] totoal_tokens=9536, outputs='Walls'
[2024-08-03 15:14:32] [Rank 3] totoal_tokens=9636, outputs='Eat breakfast'
[2024-08-03 15:14:32] [Rank 1] totoal_tokens=9149, outputs='Patrick Stuart'
[2024-08-03 15:14:32] [Rank 0] totoal_tokens=9623, outputs='oak'
[2024-08-03 15:14:33] [Rank 2] totoal_tokens=9577, outputs='Orchid'
[2024-08-03 15:14:33] [Rank 3] totoal_tokens=9661, outputs='Sue'
[2024-08-03 15:14:33] [Rank 1] totoal_tokens=9311, outputs='Greentree'
[2024-08-03 15:14:33] [Rank 2] totoal_tokens=9588, outputs='Small Stuff'
[2024-08-03 15:14:33] [Rank 3] totoal_tokens=9717, outputs='Bleach'
[2024-08-03 15:14:33] [Rank 0] totoal_tokens=9657, outputs='annual'
[2024-08-03 15:14:33] [Rank 1] totoal_tokens=9322, outputs='The walls'
[2024-08-03 15:14:34] [Rank 2] totoal_tokens=9621, outputs='Cactus'
[2024-08-03 15:14:34] [Rank 3] totoal_tokens=9839, outputs='Alpha'
[2024-08-03 15:14:34] [Rank 1] totoal_tokens=9395, outputs='Impressionist'
[2024-08-03 15:14:34] [Rank 2] totoal_tokens=9641, outputs='Chocolate cake'
[2024-08-03 15:14:34] [Rank 0] totoal_tokens=9673, outputs='The final step in constructing the model is to add the final touches and make sure'
[2024-08-03 15:14:34] [Rank 3] totoal_tokens=9842, outputs='Liam'
[2024-08-03 15:14:35] [Rank 2] totoal_tokens=9679, outputs='Car A'
[2024-08-03 15:14:35] [Rank 1] totoal_tokens=9505, outputs='on top of the couch'
[2024-08-03 15:14:35] [Rank 0] totoal_tokens=9723, outputs='Daisy'
[2024-08-03 15:14:35] [Rank 3] totoal_tokens=9860, outputs='The Underground Railroad'
[2024-08-03 15:14:35] [Rank 0] totoal_tokens=10038, outputs='Lucy'
[2024-08-03 15:14:36] [Rank 1] totoal_tokens=9654, outputs='Bob'
[2024-08-03 15:14:36] [Rank 3] totoal_tokens=10080, outputs='A food blogger'
[2024-08-03 15:14:36] [Rank 0] totoal_tokens=10043, outputs='Liam'
[2024-08-03 15:14:36] [Rank 2] totoal_tokens=9718, outputs='The final step is to run the model and obtain the results.'
[2024-08-03 15:14:36] [Rank 1] totoal_tokens=9687, outputs='Macklemore'
[2024-08-03 15:14:36] [Rank 3] totoal_tokens=10195, outputs='What is done last when setting up the office?'
[2024-08-03 15:14:36] [Rank 2] totoal_tokens=9871, outputs='C'
[2024-08-03 15:14:37] [Rank 1] totoal_tokens=10041, outputs='watch the movie'
[2024-08-03 15:14:37] [Rank 0] totoal_tokens=10105, outputs='Asteroid'
47, 1.66it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 276/751 [01:16<04:27, 1.78it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 277/751 [01:17<04:25, 1.79it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 278/751 [01:17<04:34, 1.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 279/751 [01:18<04:21, 1.80it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 280/751 [01:19<05:14, 1.50it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 37%|β–ˆβ–ˆβ–ˆβ–‹ | 281/751 [01:19<05:36, 1.40it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 282/751 [01:20<05:25, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 283/751 [01:21<05:01, 1.55it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 284/751 [01:21<04:45, 1.64it/s] Processing InternVL2-2B_reas[2024-08-03 15:14:37] [Rank 2] totoal_tokens=10038, outputs='Bob'
[2024-08-03 15:14:37] [Rank 3] totoal_tokens=10225, outputs='The car is put on the road'
[2024-08-03 15:14:37] [Rank 1] totoal_tokens=10064, outputs='Store Z'
[2024-08-03 15:14:37] [Rank 0] totoal_tokens=10112, outputs='Store Z'
[2024-08-03 15:14:38] [Rank 3] totoal_tokens=10288, outputs='Wales'
[2024-08-03 15:14:38] [Rank 2] totoal_tokens=10096, outputs='The last event is running.'
[2024-08-03 15:14:38] [Rank 0] totoal_tokens=10188, outputs='Earth'
[2024-08-03 15:14:38] [Rank 1] totoal_tokens=10102, outputs='Get his backpack'
[2024-08-03 15:14:38] [Rank 3] totoal_tokens=10365, outputs='dog'
[2024-08-03 15:14:38] [Rank 2] totoal_tokens=10186, outputs='Eat breakfast'
[2024-08-03 15:14:39] [Rank 1] totoal_tokens=10106, outputs='Turn on computer'
[2024-08-03 15:14:39] [Rank 3] totoal_tokens=10367, outputs='Sunflower'
[2024-08-03 15:14:39] [Rank 0] totoal_tokens=10205, outputs='I will admit that I was not a Blaser fan until 2019.'
[2024-08-03 15:14:39] [Rank 2] totoal_tokens=10292, outputs='Car A'
[2024-08-03 15:14:39] [Rank 0] totoal_tokens=10220, outputs='The red book'
[2024-08-03 15:14:39] [Rank 3] totoal_tokens=10498, outputs='Nymphs'
[2024-08-03 15:14:39] [Rank 2] totoal_tokens=10293, outputs='President Obama'
[2024-08-03 15:14:39] [Rank 1] totoal_tokens=10190, outputs='brush his teeth'
[2024-08-03 15:14:40] [Rank 0] totoal_tokens=10348, outputs='Lunch'
[2024-08-03 15:14:40] [Rank 3] totoal_tokens=10574, outputs='Dr. Liu'
[2024-08-03 15:14:40] [Rank 2] totoal_tokens=10294, outputs='James Albert Wales'
[2024-08-03 15:14:40] [Rank 1] totoal_tokens=10201, outputs='Get her bag'
[2024-08-03 15:14:41] [Rank 0] totoal_tokens=10370, outputs='The parade'
[2024-08-03 15:14:41] [Rank 3] totoal_tokens=10770, outputs='Jupiter'
[2024-08-03 15:14:41] [Rank 2] totoal_tokens=10361, outputs='Maddy'
[2024-08-03 15:14:41] [Rank 1] totoal_tokens=10203, outputs='The walls'
[2024-08-03 15:14:41] [Rank 0] totoal_tokens=10377, outputs='Cara'
[2024-08-03 15:14:41] [Rank 2] totoal_tokens=10361, outputs='The game'
[2024-08-03 15:14:41] [Rank 3] totoal_tokens=10887, outputs='Watched movie'
[2024-08-03 15:14:41] [Rank 1] totoal_tokens=10207, outputs='Chamomile'
[2024-08-03 15:14:42] [Rank 0] totoal_tokens=10411, outputs='bee'
[2024-08-03 15:14:42] [Rank 2] totoal_tokens=10384, outputs='Lucy'
[2024-08-03 15:14:42] [Rank 3] totoal_tokens=10917, outputs='A child'
[2024-08-03 15:14:42] [Rank 1] totoal_tokens=10209, outputs='Cartel Rule #1'
[2024-08-03 15:14:42] [Rank 0] totoal_tokens=10412, outputs='Piplup'
oning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 285/751 [01:22<05:22, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 286/751 [01:23<04:59, 1.55it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 287/751 [01:23<04:36, 1.68it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 288/751 [01:24<05:20, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 38%|β–ˆβ–ˆβ–ˆβ–Š | 289/751 [01:25<05:04, 1.52it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–Š | 290/751 [01:25<05:18, 1.45it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–Š | 291/751 [01:26<05:07, 1.50it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 292/751 [01:26<04:46, 1.60it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 293/751 [01:27<04:26, 1.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–‰ [2024-08-03 15:14:42] [Rank 2] totoal_tokens=10414, outputs='Alice'
[2024-08-03 15:14:43] [Rank 1] totoal_tokens=10209, outputs='brush his teeth'
[2024-08-03 15:14:43] [Rank 3] totoal_tokens=11051, outputs='The International Education Center station'
[2024-08-03 15:14:43] [Rank 0] totoal_tokens=10491, outputs='She takes a yoga class.'
[2024-08-03 15:14:43] [Rank 2] totoal_tokens=10429, outputs='Pine Tree'
[2024-08-03 15:14:43] [Rank 3] totoal_tokens=11087, outputs='Cactus'
[2024-08-03 15:14:43] [Rank 1] totoal_tokens=10219, outputs='Diet Plan C'
[2024-08-03 15:14:43] [Rank 0] totoal_tokens=10640, outputs="The Hundred Years' War"
[2024-08-03 15:14:44] [Rank 2] totoal_tokens=10446, outputs='Ji Yong'
[2024-08-03 15:14:44] [Rank 3] totoal_tokens=11125, outputs='Jupiter'
[2024-08-03 15:14:44] [Rank 1] totoal_tokens=10331, outputs='on his lap'
[2024-08-03 15:14:44] [Rank 0] totoal_tokens=10648, outputs='Catch the bus'
[2024-08-03 15:14:44] [Rank 2] totoal_tokens=10539, outputs='Running'
[2024-08-03 15:14:44] [Rank 3] totoal_tokens=11127, outputs='Impressionist'
[2024-08-03 15:14:44] [Rank 1] totoal_tokens=10408, outputs='Gala'
[2024-08-03 15:14:45] [Rank 1] totoal_tokens=10565, outputs='Car A'
[2024-08-03 15:14:45] [Rank 0] totoal_tokens=10649, outputs="poppy's place"
[2024-08-03 15:14:45] [Rank 2] totoal_tokens=10563, outputs='Pesticide'
[2024-08-03 15:14:45] [Rank 3] totoal_tokens=11137, outputs='Hippolytus'
[2024-08-03 15:14:46] [Rank 1] totoal_tokens=10590, outputs='Get coffee'
[2024-08-03 15:14:46] [Rank 2] totoal_tokens=10587, outputs='The roof'
[2024-08-03 15:14:46] [Rank 0] totoal_tokens=10661, outputs='on top of the couch'
[2024-08-03 15:14:46] [Rank 3] totoal_tokens=11146, outputs='Patrick Schmitt'
[2024-08-03 15:14:46] [Rank 1] totoal_tokens=10747, outputs='Abbie'
[2024-08-03 15:14:46] [Rank 0] totoal_tokens=10748, outputs='The parade'
[2024-08-03 15:14:47] [Rank 3] totoal_tokens=11201, outputs='Rome'
[2024-08-03 15:14:47] [Rank 2] totoal_tokens=10642, outputs='The office is set up with a desk, computer, and other office supplies.'
[2024-08-03 15:14:47] [Rank 1] totoal_tokens=10782, outputs='The butterfly'
[2024-08-03 15:14:47] [Rank 0] totoal_tokens=10762, outputs='Investment C'
[2024-08-03 15:14:47] [Rank 2] totoal_tokens=10953, outputs='Bob'
[2024-08-03 15:14:47] [Rank 3] totoal_tokens=11209, outputs='Car A'
[2024-08-03 15:14:47] [Rank 1] totoal_tokens=10821, outputs='ice cream'
[2024-08-03 15:14:47] [Rank 0] totoal_tokens=10883, outputs='Editing'
[2024-08-03 15:14:48] [Rank 2] totoal_tokens=11052, outputs='daisy'
[2024-08-03 15:14:48] [Rank 1] totoal_tokens=10912, outputs='Angela Skaff'
[2024-08-03 15:14:48] [Rank 3] totoal_tokens=11252, outputs='Sales team'
[2024-08-03 15:14:48] [Rank 0] totoal_tokens=10886, outputs='Berlin'
[2024-08-03 15:14:48] [Rank 2] totoal_tokens=11097, outputs='Painting the model'
[2024-08-03 15:14:49] [Rank 1] totoal_tokens=11037, outputs='oak'
[2024-08-03 15:14:49] [Rank 3] totoal_tokens=11256, outputs='C'
[2024-08-03 15:14:49] [Rank 0] totoal_tokens=10952, outputs='Coffee'
| 294/751 [01:28<04:26, 1.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 295/751 [01:28<04:32, 1.68it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 39%|β–ˆβ–ˆβ–ˆβ–‰ | 296/751 [01:29<04:35, 1.65it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 297/751 [01:30<04:53, 1.55it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 298/751 [01:30<05:09, 1.46it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 299/751 [01:31<05:12, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–‰ | 300/751 [01:32<04:59, 1.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 301/751 [01:32<04:43, 1.59it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 302/751 [01:33<04:29, 1.67it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 303/751 [01:34<04:59, 1.49it/s] Proce[2024-08-03 15:14:49] [Rank 3] totoal_tokens=11273, outputs='Company X'
[2024-08-03 15:14:49] [Rank 2] totoal_tokens=11106, outputs='The software is first developed and coded by the tech team.'
[2024-08-03 15:14:49] [Rank 1] totoal_tokens=11079, outputs='Lamington'
[2024-08-03 15:14:49] [Rank 0] totoal_tokens=11055, outputs='oak'
[2024-08-03 15:14:50] [Rank 3] totoal_tokens=11444, outputs='pine tree'
[2024-08-03 15:14:50] [Rank 0] totoal_tokens=11111, outputs='Store B'
[2024-08-03 15:14:50] [Rank 1] totoal_tokens=11100, outputs='Banana'
[2024-08-03 15:14:50] [Rank 2] totoal_tokens=11174, outputs='USC Kaufman School of Dance students follow their dreams to Broadway on a journey'
[2024-08-03 15:14:50] [Rank 0] totoal_tokens=11171, outputs='The Sunflower'
[2024-08-03 15:14:50] [Rank 3] totoal_tokens=11464, outputs='Dawn'
[2024-08-03 15:14:50] [Rank 1] totoal_tokens=11104, outputs='apple'
[2024-08-03 15:14:51] [Rank 2] totoal_tokens=11247, outputs='The walls'
[2024-08-03 15:14:51] [Rank 0] totoal_tokens=11450, outputs='orange'
[2024-08-03 15:14:51] [Rank 3] totoal_tokens=11551, outputs='Sales team'
[2024-08-03 15:14:51] [Rank 2] totoal_tokens=11248, outputs='John Simpson'
[2024-08-03 15:14:51] [Rank 1] totoal_tokens=11146, outputs='Cactus'
[2024-08-03 15:14:52] [Rank 0] totoal_tokens=11451, outputs='Store B'
[2024-08-03 15:14:52] [Rank 1] totoal_tokens=11173, outputs='Cactus'
[2024-08-03 15:14:52] [Rank 2] totoal_tokens=11251, outputs='MolΓ©culas'
[2024-08-03 15:14:52] [Rank 3] totoal_tokens=11561, outputs='Diet Plan A'
[2024-08-03 15:14:52] [Rank 1] totoal_tokens=11240, outputs='Salad'
[2024-08-03 15:14:52] [Rank 0] totoal_tokens=11517, outputs='Venture Capital'
[2024-08-03 15:14:53] [Rank 3] totoal_tokens=11643, outputs='Alice'
[2024-08-03 15:14:53] [Rank 2] totoal_tokens=11262, outputs='Computers and other tech equipment are set up.'
[2024-08-03 15:14:53] [Rank 1] totoal_tokens=11331, outputs='Dr. Johnson'
[2024-08-03 15:14:53] [Rank 0] totoal_tokens=11567, outputs='The new installation'
[2024-08-03 15:14:53] [Rank 3] totoal_tokens=11879, outputs='Chocolate cake'
[2024-08-03 15:14:53] [Rank 2] totoal_tokens=11274, outputs='Walls'
[2024-08-03 15:14:54] [Rank 1] totoal_tokens=11345, outputs='banana'
[2024-08-03 15:14:54] [Rank 0] totoal_tokens=11575, outputs='US drones: tools of modern warfare'
[2024-08-03 15:14:54] [Rank 2] totoal_tokens=11326, outputs='Shadrach'
[2024-08-03 15:14:54] [Rank 3] totoal_tokens=11965, outputs='watch the movie'
[2024-08-03 15:14:54] [Rank 1] totoal_tokens=11504, outputs='Junior managers'
[2024-08-03 15:14:54] [Rank 0] totoal_tokens=11589, outputs='X'
ssing InternVL2-2B_reasoning-text-test.jsonl: 40%|β–ˆβ–ˆβ–ˆβ–ˆ | 304/751 [01:34<04:41, 1.59it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 305/751 [01:35<04:24, 1.69it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 306/751 [01:35<04:19, 1.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 307/751 [01:36<04:12, 1.76it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 308/751 [01:36<04:08, 1.78it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆ | 309/751 [01:37<04:16, 1.72it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 310/751 [01:38<05:06, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 41%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 311/751 [01:39<05:10, 1.42it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 312/751 [01:39<05:08, 1.42it/s] Processing InternVL2-2B_reasoning-text-tes[2024-08-03 15:14:55] [Rank 2] totoal_tokens=11408, outputs='Female'
[2024-08-03 15:14:55] [Rank 3] totoal_tokens=11983, outputs='oak'
[2024-08-03 15:14:55] [Rank 1] totoal_tokens=11534, outputs='The Space Marines'
[2024-08-03 15:14:55] [Rank 0] totoal_tokens=11622, outputs='Akurate'
[2024-08-03 15:14:55] [Rank 3] totoal_tokens=11985, outputs='Banana'
[2024-08-03 15:14:55] [Rank 2] totoal_tokens=11423, outputs='Birds of Prey'
[2024-08-03 15:14:56] [Rank 0] totoal_tokens=11648, outputs='oak'
[2024-08-03 15:14:56] [Rank 1] totoal_tokens=11570, outputs='The last event in the triathlon is running.'
[2024-08-03 15:14:56] [Rank 2] totoal_tokens=11440, outputs='Chocolate cake'
[2024-08-03 15:14:56] [Rank 0] totoal_tokens=11724, outputs='Store X'
[2024-08-03 15:14:57] [Rank 1] totoal_tokens=11651, outputs='Eat breakfast'
[2024-08-03 15:14:57] [Rank 3] totoal_tokens=12001, outputs='Boywonder'
[2024-08-03 15:14:57] [Rank 2] totoal_tokens=11507, outputs='brush his teeth'
[2024-08-03 15:14:57] [Rank 0] totoal_tokens=11805, outputs='Coffee break'
[2024-08-03 15:14:57] [Rank 1] totoal_tokens=11715, outputs='mallow'
[2024-08-03 15:14:57] [Rank 3] totoal_tokens=12019, outputs='Read'
[2024-08-03 15:14:58] [Rank 2] totoal_tokens=11577, outputs='Wash dishes'
[2024-08-03 15:14:58] [Rank 1] totoal_tokens=11955, outputs='Dr. Allen'
[2024-08-03 15:14:58] [Rank 0] totoal_tokens=11835, outputs='Editing'
[2024-08-03 15:14:58] [Rank 3] totoal_tokens=12172, outputs='Cactus'
[2024-08-03 15:14:58] [Rank 2] totoal_tokens=11604, outputs='Earth'
[2024-08-03 15:14:59] [Rank 1] totoal_tokens=12077, outputs='The parade'
[2024-08-03 15:14:59] [Rank 3] totoal_tokens=12200, outputs='The book'
[2024-08-03 15:14:59] [Rank 0] totoal_tokens=11893, outputs='Dr. Johnson'
[2024-08-03 15:14:59] [Rank 2] totoal_tokens=11621, outputs='Open door'
[2024-08-03 15:14:59] [Rank 0] totoal_tokens=11897, outputs='South'
[2024-08-03 15:15:00] [Rank 1] totoal_tokens=12082, outputs='Diet Plan C'
[2024-08-03 15:15:00] [Rank 2] totoal_tokens=11633, outputs='Linn Exaktbox 6'
[2024-08-03 15:15:00] [Rank 3] totoal_tokens=12228, outputs='Cara'
[2024-08-03 15:15:00] [Rank 0] totoal_tokens=11959, outputs='red book'
[2024-08-03 15:15:00] [Rank 1] totoal_tokens=12087, outputs='The firm is considered innovative.'
[2024-08-03 15:15:00] [Rank 2] totoal_tokens=11641, outputs='Alice'
[2024-08-03 15:15:01] [Rank 3] totoal_tokens=12274, outputs='Constitution Memorial Day'
[2024-08-03 15:15:01] [Rank 0] totoal_tokens=12104, outputs='Bird'
t.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 313/751 [01:40<04:45, 1.53it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 314/751 [01:40<04:50, 1.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 315/751 [01:41<04:51, 1.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 316/751 [01:42<04:42, 1.54it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 317/751 [01:42<04:33, 1.58it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 318/751 [01:43<05:31, 1.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 319/751 [01:44<05:20, 1.35it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 320/751 [01:45<05:10, 1.39it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 321/751 [01:45<04:55, 1.46it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆοΏ½[2024-08-03 15:15:01] [Rank 1] totoal_tokens=12110, outputs='Shakur Stevenson'
[2024-08-03 15:15:01] [Rank 2] totoal_tokens=11654, outputs='Alpha'
[2024-08-03 15:15:01] [Rank 3] totoal_tokens=12348, outputs='Region Y'
[2024-08-03 15:15:01] [Rank 0] totoal_tokens=12107, outputs='A food company'
[2024-08-03 15:15:02] [Rank 2] totoal_tokens=11943, outputs='John'
[2024-08-03 15:15:02] [Rank 1] totoal_tokens=12166, outputs='Nancy'
[2024-08-03 15:15:02] [Rank 0] totoal_tokens=12116, outputs='Gamma'
[2024-08-03 15:15:02] [Rank 3] totoal_tokens=12353, outputs='Browns Brasserie'
[2024-08-03 15:15:02] [Rank 2] totoal_tokens=11949, outputs='Walls'
[2024-08-03 15:15:02] [Rank 1] totoal_tokens=12182, outputs='oak'
[2024-08-03 15:15:03] [Rank 0] totoal_tokens=12151, outputs='sunflower'
[2024-08-03 15:15:03] [Rank 3] totoal_tokens=12364, outputs='Northern Thailand'
[2024-08-03 15:15:03] [Rank 2] totoal_tokens=12001, outputs='Spirit Trap'
[2024-08-03 15:15:03] [Rank 1] totoal_tokens=12310, outputs='Independence Day'
[2024-08-03 15:15:03] [Rank 0] totoal_tokens=12254, outputs='Norway'
[2024-08-03 15:15:03] [Rank 3] totoal_tokens=12377, outputs='Peregrine falcon'
[2024-08-03 15:15:04] [Rank 1] totoal_tokens=12341, outputs='Beta'
[2024-08-03 15:15:04] [Rank 0] totoal_tokens=12273, outputs='tall grass'
[2024-08-03 15:15:04] [Rank 2] totoal_tokens=12015, outputs='The product is launched to the public'
[2024-08-03 15:15:04] [Rank 3] totoal_tokens=12396, outputs='Evalena'
[2024-08-03 15:15:04] [Rank 1] totoal_tokens=12349, outputs='Eat breakfast'
[2024-08-03 15:15:05] [Rank 2] totoal_tokens=12071, outputs='Dance'
[2024-08-03 15:15:05] [Rank 0] totoal_tokens=12334, outputs='Junior'
[2024-08-03 15:15:05] [Rank 3] totoal_tokens=12435, outputs='Target'
[2024-08-03 15:15:05] [Rank 1] totoal_tokens=12377, outputs='Strontium'
[2024-08-03 15:15:05] [Rank 0] totoal_tokens=12335, outputs='Blue book'
[2024-08-03 15:15:05] [Rank 3] totoal_tokens=12445, outputs='Charlie'
[2024-08-03 15:15:05] [Rank 2] totoal_tokens=12085, outputs='in the backyard'
[2024-08-03 15:15:06] [Rank 1] totoal_tokens=12379, outputs='Dr. Johnson'
[2024-08-03 15:15:06] [Rank 0] totoal_tokens=12414, outputs='Afiye Derrick'
[2024-08-03 15:15:06] [Rank 2] totoal_tokens=12088, outputs='Aloe Vera'
[2024-08-03 15:15:06] [Rank 3] totoal_tokens=12648, outputs='Wash her clothes'
[2024-08-03 15:15:06] [Rank 1] totoal_tokens=12396, outputs='Chocolate cake'
[2024-08-03 15:15:07] [Rank 2] totoal_tokens=12091, outputs='Cactus'
[2024-08-03 15:15:07] [Rank 0] totoal_tokens=12543, outputs='Burger King'
οΏ½β–Ž | 322/751 [01:46<04:47, 1.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 323/751 [01:47<04:44, 1.50it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 324/751 [01:47<04:34, 1.56it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 325/751 [01:48<04:46, 1.49it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 326/751 [01:49<04:43, 1.50it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 327/751 [01:49<04:41, 1.51it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 328/751 [01:50<04:37, 1.52it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 329/751 [01:51<04:37, 1.52it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 330/751 [01:51<04:44, 1.48it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 331/751 [01:52[2024-08-03 15:15:07] [Rank 3] totoal_tokens=12670, outputs='Store B'
[2024-08-03 15:15:07] [Rank 1] totoal_tokens=12410, outputs='Lemon'
[2024-08-03 15:15:07] [Rank 2] totoal_tokens=12156, outputs='Jogging'
[2024-08-03 15:15:07] [Rank 0] totoal_tokens=12580, outputs='Chocolate cake'
[2024-08-03 15:15:08] [Rank 1] totoal_tokens=12422, outputs='The Great American Read'
[2024-08-03 15:15:08] [Rank 3] totoal_tokens=13058, outputs='Joppenbergh Mountain'
[2024-08-03 15:15:08] [Rank 2] totoal_tokens=12254, outputs='The Well'
[2024-08-03 15:15:08] [Rank 0] totoal_tokens=12704, outputs='Open the door'
[2024-08-03 15:15:08] [Rank 1] totoal_tokens=12442, outputs='A manager'
[2024-08-03 15:15:09] [Rank 3] totoal_tokens=13112, outputs='Tully'
[2024-08-03 15:15:09] [Rank 2] totoal_tokens=12304, outputs='dessert'
[2024-08-03 15:15:09] [Rank 1] totoal_tokens=12445, outputs='Eagle'
[2024-08-03 15:15:09] [Rank 0] totoal_tokens=12884, outputs='children'
[2024-08-03 15:15:09] [Rank 2] totoal_tokens=12321, outputs='Charlie'
[2024-08-03 15:15:10] [Rank 3] totoal_tokens=13117, outputs="The European Union's General Data Protection Regulation (GDPR)"
[2024-08-03 15:15:10] [Rank 1] totoal_tokens=12458, outputs='Charles Rabinovich'
[2024-08-03 15:15:10] [Rank 0] totoal_tokens=13126, outputs='Palm'
[2024-08-03 15:15:10] [Rank 2] totoal_tokens=12369, outputs='Region Z'
[2024-08-03 15:15:10] [Rank 3] totoal_tokens=13127, outputs='Mauritanians'
[2024-08-03 15:15:11] [Rank 0] totoal_tokens=13132, outputs='The final performance'
[2024-08-03 15:15:11] [Rank 1] totoal_tokens=12880, outputs='Beatriz Michelena'
[2024-08-03 15:15:11] [Rank 2] totoal_tokens=12381, outputs='backyard'
[2024-08-03 15:15:11] [Rank 3] totoal_tokens=13156, outputs='backyard'
[2024-08-03 15:15:11] [Rank 0] totoal_tokens=13172, outputs='Paint the model'
[2024-08-03 15:15:11] [Rank 1] totoal_tokens=13021, outputs='Editing'
[2024-08-03 15:15:11] [Rank 2] totoal_tokens=12423, outputs='on top of couch'
[2024-08-03 15:15:12] [Rank 3] totoal_tokens=13172, outputs='Dr. Smith'
[2024-08-03 15:15:12] [Rank 0] totoal_tokens=13201, outputs='Company Z'
[2024-08-03 15:15:12] [Rank 1] totoal_tokens=13033, outputs='ice cream'
[2024-08-03 15:15:12] [Rank 2] totoal_tokens=12573, outputs='Play tennis'
[2024-08-03 15:15:12] [Rank 3] totoal_tokens=13175, outputs='Every week'
[2024-08-03 15:15:13] [Rank 1] totoal_tokens=13049, outputs='Store B'
[2024-08-03 15:15:13] [Rank 0] totoal_tokens=13281, outputs='June 2018'
[2024-08-03 15:15:13] [Rank 2] totoal_tokens=12596, outputs='bleach'
[2024-08-03 15:15:13] [Rank 3] totoal_tokens=13307, outputs='The Great Wall'
[2024-08-03 15:15:13] [Rank 1] totoal_tokens=13051, outputs='Watch TV'
[2024-08-03 15:15:13] [Rank 2] totoal_tokens=12771, outputs='Banana'
[2024-08-03 15:15:13] [Rank 0] totoal_tokens=13293, outputs='Coffee break'
[2024-08-03 15:15:14] [Rank 3] totoal_tokens=13314, outputs='oak'
[2024-08-03 15:15:14] [Rank 2] totoal_tokens=12990, outputs='Gamma'
[2024-08-03 15:15:14] [Rank 1] totoal_tokens=13163, outputs='Car B'
[2024-08-03 15:15:14] [Rank 0] totoal_tokens=13302, outputs='Computers and other tech equipment'
<04:51, 1.44it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 332/751 [01:53<04:57, 1.41it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 333/751 [01:53<04:53, 1.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 44%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 334/751 [01:54<05:22, 1.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 335/751 [01:55<05:20, 1.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 336/751 [01:56<05:07, 1.35it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 337/751 [01:57<04:57, 1.39it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 338/751 [01:57<04:49, 1.43it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 339/751 [01:58<05:02, 1.36it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 340/751 [01:59<05:08, 1.33it/s] Proces[2024-08-03 15:15:14] [Rank 3] totoal_tokens=13337, outputs='Cactus'
[2024-08-03 15:15:15] [Rank 1] totoal_tokens=13211, outputs='ice cream'
[2024-08-03 15:15:15] [Rank 0] totoal_tokens=13313, outputs='Alice solves the first problem before Bob.'
[2024-08-03 15:15:15] [Rank 3] totoal_tokens=13430, outputs='Car A'
[2024-08-03 15:15:16] [Rank 2] totoal_tokens=13035, outputs='The Lord of the Rings'
[2024-08-03 15:15:16] [Rank 0] totoal_tokens=13314, outputs='Dr. Johnson'
[2024-08-03 15:15:16] [Rank 1] totoal_tokens=13260, outputs='SAGE'
[2024-08-03 15:15:16] [Rank 3] totoal_tokens=13487, outputs='The cook'
[2024-08-03 15:15:16] [Rank 2] totoal_tokens=13041, outputs='Female'
[2024-08-03 15:15:17] [Rank 0] totoal_tokens=13334, outputs='Wash'
[2024-08-03 15:15:17] [Rank 1] totoal_tokens=13289, outputs='pigeon'
[2024-08-03 15:15:17] [Rank 3] totoal_tokens=13578, outputs='The final concert'
[2024-08-03 15:15:17] [Rank 2] totoal_tokens=13054, outputs='Jacob'
[2024-08-03 15:15:17] [Rank 0] totoal_tokens=13353, outputs='Cactus'
[2024-08-03 15:15:18] [Rank 3] totoal_tokens=13661, outputs='Oil painting'
[2024-08-03 15:15:18] [Rank 2] totoal_tokens=13150, outputs='France'
[2024-08-03 15:15:18] [Rank 1] totoal_tokens=13312, outputs='The manager meets the CEO of the company.'
[2024-08-03 15:15:18] [Rank 0] totoal_tokens=13385, outputs='Store Z'
[2024-08-03 15:15:18] [Rank 3] totoal_tokens=13699, outputs='Swimming'
[2024-08-03 15:15:18] [Rank 2] totoal_tokens=13251, outputs='tall grass'
[2024-08-03 15:15:19] [Rank 0] totoal_tokens=13440, outputs='tulipa'
[2024-08-03 15:15:19] [Rank 1] totoal_tokens=13343, outputs='MURDER STRIKES A POSE'
[2024-08-03 15:15:19] [Rank 3] totoal_tokens=13710, outputs='What happens after testing?'
[2024-08-03 15:15:19] [Rank 2] totoal_tokens=13336, outputs='West Germany'
[2024-08-03 15:15:19] [Rank 0] totoal_tokens=13523, outputs='Swimming'
[2024-08-03 15:15:20] [Rank 1] totoal_tokens=13355, outputs='Northern Superchargers'
[2024-08-03 15:15:20] [Rank 3] totoal_tokens=13710, outputs='The test is passed.'
[2024-08-03 15:15:20] [Rank 0] totoal_tokens=13531, outputs='Jupiter'
[2024-08-03 15:15:20] [Rank 2] totoal_tokens=13343, outputs='MURDER STRIKES A POSE'
[2024-08-03 15:15:20] [Rank 1] totoal_tokens=13365, outputs='Cactus'
[2024-08-03 15:15:21] [Rank 3] totoal_tokens=13809, outputs='Queen Victoria'
[2024-08-03 15:15:21] [Rank 2] totoal_tokens=13350, outputs='Earth'
[2024-08-03 15:15:21] [Rank 0] totoal_tokens=13542, outputs='Swimming'
sing InternVL2-2B_reasoning-text-test.jsonl: 45%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 341/751 [02:00<05:09, 1.32it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 342/751 [02:00<05:24, 1.26it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 343/751 [02:01<05:21, 1.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 344/751 [02:02<05:01, 1.35it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 345/751 [02:02<04:45, 1.42it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 346/751 [02:03<04:51, 1.39it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 347/751 [02:04<04:58, 1.35it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 348/751 [02:05<04:49, 1.39it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 46%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 349/751 [02:05<04:46, 1.40it/s] Processing InternVL2-2B_reasoni[2024-08-03 15:15:21] [Rank 1] totoal_tokens=13386, outputs='Cactus'
[2024-08-03 15:15:21] [Rank 3] totoal_tokens=13825, outputs='John'
[2024-08-03 15:15:22] [Rank 1] totoal_tokens=13500, outputs='Gail'
[2024-08-03 15:15:22] [Rank 3] totoal_tokens=13870, outputs='The starter'
[2024-08-03 15:15:22] [Rank 2] totoal_tokens=13404, outputs='Plan B'
[2024-08-03 15:15:22] [Rank 0] totoal_tokens=13558, outputs='Aston Martin DB5'
[2024-08-03 15:15:23] [Rank 2] totoal_tokens=13432, outputs='Yellow'
[2024-08-03 15:15:23] [Rank 3] totoal_tokens=13961, outputs='Y'
[2024-08-03 15:15:23] [Rank 1] totoal_tokens=13504, outputs='tulip'
[2024-08-03 15:15:23] [Rank 0] totoal_tokens=13810, outputs='Mr. President'
[2024-08-03 15:15:24] [Rank 3] totoal_tokens=13980, outputs='Nick Saban'
[2024-08-03 15:15:24] [Rank 1] totoal_tokens=13546, outputs='Aloe Vera'
[2024-08-03 15:15:24] [Rank 2] totoal_tokens=13585, outputs='Astral Weeks'
[2024-08-03 15:15:24] [Rank 0] totoal_tokens=13811, outputs='Courtship'
[2024-08-03 15:15:24] [Rank 3] totoal_tokens=14039, outputs='Niger'
[2024-08-03 15:15:25] [Rank 1] totoal_tokens=13579, outputs='oak'
[2024-08-03 15:15:25] [Rank 0] totoal_tokens=13817, outputs='Banana'
[2024-08-03 15:15:25] [Rank 2] totoal_tokens=13609, outputs='Earth'
[2024-08-03 15:15:25] [Rank 3] totoal_tokens=14062, outputs='Birds'
[2024-08-03 15:15:25] [Rank 0] totoal_tokens=13944, outputs='Coffee'
[2024-08-03 15:15:26] [Rank 1] totoal_tokens=13583, outputs='The Penguin Book'
[2024-08-03 15:15:26] [Rank 2] totoal_tokens=13754, outputs='Mauritanian'
[2024-08-03 15:15:26] [Rank 3] totoal_tokens=14097, outputs='Diet Plan C'
[2024-08-03 15:15:26] [Rank 0] totoal_tokens=13962, outputs='Company Z'
[2024-08-03 15:15:26] [Rank 2] totoal_tokens=14064, outputs='bleach'
[2024-08-03 15:15:27] [Rank 1] totoal_tokens=13625, outputs='Cactus'
[2024-08-03 15:15:27] [Rank 3] totoal_tokens=14101, outputs='The fireworks display'
[2024-08-03 15:15:27] [Rank 0] totoal_tokens=14072, outputs='Banana'
[2024-08-03 15:15:27] [Rank 2] totoal_tokens=14096, outputs='The Venus flytrap'
[2024-08-03 15:15:27] [Rank 3] totoal_tokens=14237, outputs='Ranbir Kapoor'
[2024-08-03 15:15:27] [Rank 1] totoal_tokens=13750, outputs='Mauritanian NGOs and television stations'
[2024-08-03 15:15:28] [Rank 0] totoal_tokens=14092, outputs='Star Wars'
[2024-08-03 15:15:28] [Rank 2] totoal_tokens=14127, outputs='Lindsay Langton of Bottle Logic noted, β€œJosh Emrich does all'
[2024-08-03 15:15:28] [Rank 1] totoal_tokens=13755, outputs='Car A'
[2024-08-03 15:15:28] [Rank 3] totoal_tokens=14242, outputs='Hippo'
[2024-08-03 15:15:28] [Rank 0] totoal_tokens=14106, outputs='Ethiopian goat herder'
ng-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 350/751 [02:06<05:35, 1.20it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 351/751 [02:08<06:17, 1.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 352/751 [02:08<06:01, 1.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 353/751 [02:09<05:36, 1.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 354/751 [02:10<05:23, 1.23it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 355/751 [02:11<05:17, 1.25it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 356/751 [02:11<05:06, 1.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 357/751 [02:12<05:02, 1.30it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 358/751 [02:13<04:58, 1.32it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|[2024-08-03 15:15:29] [Rank 2] totoal_tokens=14242, outputs='Eduard'
[2024-08-03 15:15:29] [Rank 1] totoal_tokens=13802, outputs='Jupiter'
[2024-08-03 15:15:29] [Rank 3] totoal_tokens=14273, outputs='Peanuts'
[2024-08-03 15:15:29] [Rank 0] totoal_tokens=14131, outputs='Sony'
[2024-08-03 15:15:30] [Rank 2] totoal_tokens=14252, outputs='Amanda Seyfried'
[2024-08-03 15:15:30] [Rank 1] totoal_tokens=13804, outputs='Dr. Liu'
[2024-08-03 15:15:30] [Rank 0] totoal_tokens=14134, outputs='AIB'
[2024-08-03 15:15:30] [Rank 3] totoal_tokens=14315, outputs='Oak tree planted in the backyard'
[2024-08-03 15:15:30] [Rank 2] totoal_tokens=14254, outputs='The dog'
[2024-08-03 15:15:31] [Rank 1] totoal_tokens=13807, outputs='tulip'
[2024-08-03 15:15:31] [Rank 0] totoal_tokens=14208, outputs='Oil on canvas'
[2024-08-03 15:15:31] [Rank 3] totoal_tokens=14411, outputs='The manager meets the manager.'
[2024-08-03 15:15:31] [Rank 2] totoal_tokens=14281, outputs='Get coffee'
[2024-08-03 15:15:31] [Rank 1] totoal_tokens=13943, outputs='Charlie'
[2024-08-03 15:15:32] [Rank 3] totoal_tokens=14450, outputs='owl'
[2024-08-03 15:15:32] [Rank 0] totoal_tokens=14345, outputs='The end of the day'
[2024-08-03 15:15:32] [Rank 2] totoal_tokens=14349, outputs='Store Z'
[2024-08-03 15:15:32] [Rank 1] totoal_tokens=13954, outputs='fresh fruit'
[2024-08-03 15:15:33] [Rank 3] totoal_tokens=14465, outputs='backyard'
[2024-08-03 15:15:33] [Rank 0] totoal_tokens=14420, outputs='MolaGora'
[2024-08-03 15:15:33] [Rank 1] totoal_tokens=13977, outputs='Get fish'
[2024-08-03 15:15:33] [Rank 2] totoal_tokens=14748, outputs='The Outland Zone'
[2024-08-03 15:15:34] [Rank 3] totoal_tokens=14512, outputs='The Space Between'
[2024-08-03 15:15:34] [Rank 2] totoal_tokens=14785, outputs='Coffee'
[2024-08-03 15:15:34] [Rank 0] totoal_tokens=14447, outputs='moth'
[2024-08-03 15:15:34] [Rank 1] totoal_tokens=14048, outputs='She is a fan of Harry Nilsson'
[2024-08-03 15:15:35] [Rank 1] totoal_tokens=14051, outputs='Dr. Lee'
[2024-08-03 15:15:35] [Rank 0] totoal_tokens=14449, outputs='Circa 1916'
[2024-08-03 15:15:35] [Rank 3] totoal_tokens=14519, outputs='Rome'
[2024-08-03 15:15:35] [Rank 1] totoal_tokens=14141, outputs='Ellen'
[2024-08-03 15:15:35] [Rank 0] totoal_tokens=14512, outputs='The Air Raid'
[2024-08-03 15:15:36] [Rank 3] totoal_tokens=14612, outputs='The product is launched to the public.'
[2024-08-03 15:15:36] [Rank 1] totoal_tokens=14195, outputs='Raqqa'
[2024-08-03 15:15:36] [Rank 0] totoal_tokens=14702, outputs='Route X'
β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 359/751 [02:14<05:06, 1.28it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 360/751 [02:15<05:08, 1.27it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 361/751 [02:15<05:01, 1.29it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 362/751 [02:16<05:14, 1.24it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 363/751 [02:17<05:36, 1.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 364/751 [02:18<05:48, 1.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 365/751 [02:19<05:54, 1.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 366/751 [02:20<05:47, 1.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 367/751 [02:21<05:25, 1.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 368[2024-08-03 15:15:37] [Rank 2] totoal_tokens=14871, outputs='Earth'
[2024-08-03 15:15:37] [Rank 3] totoal_tokens=14683, outputs='Dr. Zhang'
[2024-08-03 15:15:37] [Rank 0] totoal_tokens=14786, outputs='cactus'
[2024-08-03 15:15:37] [Rank 1] totoal_tokens=14240, outputs='The final step in constructing the model is to add the finishing touches, such as'
[2024-08-03 15:15:38] [Rank 2] totoal_tokens=15092, outputs='Dessert'
[2024-08-03 15:15:38] [Rank 3] totoal_tokens=14697, outputs='Impressionist'
[2024-08-03 15:15:38] [Rank 1] totoal_tokens=14515, outputs='Alice'
[2024-08-03 15:15:38] [Rank 2] totoal_tokens=15098, outputs='The festival ends with a fireworks display.'
[2024-08-03 15:15:39] [Rank 3] totoal_tokens=15155, outputs='Cake'
[2024-08-03 15:15:39] [Rank 1] totoal_tokens=14617, outputs='Store B'
[2024-08-03 15:15:39] [Rank 3] totoal_tokens=15187, outputs='Gamma'
[2024-08-03 15:15:40] [Rank 1] totoal_tokens=14694, outputs='Dan Eisenhauer'
[2024-08-03 15:15:40] [Rank 0] totoal_tokens=14877, outputs='Impressionist'
[2024-08-03 15:15:40] [Rank 3] totoal_tokens=15191, outputs='Donald Trump'
[2024-08-03 15:15:40] [Rank 1] totoal_tokens=14778, outputs='Nike'
[2024-08-03 15:15:41] [Rank 0] totoal_tokens=14878, outputs='Northern'
[2024-08-03 15:15:41] [Rank 2] totoal_tokens=15157, outputs='The Z-score'
[2024-08-03 15:15:41] [Rank 1] totoal_tokens=14876, outputs='Superchargers'
[2024-08-03 15:15:41] [Rank 3] totoal_tokens=15258, outputs='Biscuit'
[2024-08-03 15:15:41] [Rank 0] totoal_tokens=15070, outputs='Apple'
[2024-08-03 15:15:42] [Rank 2] totoal_tokens=15200, outputs='Brewster'
[2024-08-03 15:15:42] [Rank 1] totoal_tokens=15160, outputs='oak'
[2024-08-03 15:15:42] [Rank 0] totoal_tokens=15100, outputs='Gerelkhuu'
[2024-08-03 15:15:42] [Rank 3] totoal_tokens=15297, outputs='Sunflower'
[2024-08-03 15:15:43] [Rank 2] totoal_tokens=15217, outputs='Dr. Allen'
[2024-08-03 15:15:43] [Rank 1] totoal_tokens=15168, outputs='Jane'
[2024-08-03 15:15:43] [Rank 0] totoal_tokens=15182, outputs='Coffee'
[2024-08-03 15:15:43] [Rank 2] totoal_tokens=15367, outputs='A food company'
[2024-08-03 15:15:43] [Rank 1] totoal_tokens=15183, outputs='Work'
[2024-08-03 15:15:44] [Rank 3] totoal_tokens=15383, outputs='The plant that needs the most light is the one that is in the sun.'
[2024-08-03 15:15:44] [Rank 2] totoal_tokens=15533, outputs='Earth'
[2024-08-03 15:15:45] [Rank 3] totoal_tokens=15387, outputs='Succulent'
[2024-08-03 15:15:45] [Rank 0] totoal_tokens=15198, outputs='Get your sketchbook, comic book/super hero drawing class and Fashion Illustration'
[2024-08-03 15:15:45] [Rank 1] totoal_tokens=15184, outputs='Charlie'
[2024-08-03 15:15:46] [Rank 3] totoal_tokens=15510, outputs='Region Y'
[2024-08-03 15:15:46] [Rank 2] totoal_tokens=15572, outputs='oak'
[2024-08-03 15:15:46] [Rank 0] totoal_tokens=15210, outputs='Cactus'
[2024-08-03 15:15:46] [Rank 1] totoal_tokens=15193, outputs='The Wind in the Willows'
[2024-08-03 15:15:46] [Rank 3] totoal_tokens=15523, outputs='concert'
[2024-08-03 15:15:47] [Rank 0] totoal_tokens=15265, outputs='The house'
/751 [02:22<05:20, 1.19it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 369/751 [02:22<05:06, 1.25it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 370/751 [02:25<09:01, 1.42s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 49%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 371/751 [02:26<07:46, 1.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 372/751 [02:27<06:55, 1.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 373/751 [02:28<06:23, 1.02s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 374/751 [02:28<05:50, 1.08it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 375/751 [02:30<07:33, 1.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 376/751 [02:31<06:56, 1.11s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 377/751 [02:32<06:36, 1.06s[2024-08-03 15:15:47] [Rank 1] totoal_tokens=15197, outputs='Wasp'
[2024-08-03 15:15:47] [Rank 2] totoal_tokens=15580, outputs='Cactus'
[2024-08-03 15:15:47] [Rank 3] totoal_tokens=15537, outputs='Watch TV'
[2024-08-03 15:15:48] [Rank 1] totoal_tokens=15261, outputs='Charlie'
[2024-08-03 15:15:48] [Rank 0] totoal_tokens=15375, outputs='Charlie'
[2024-08-03 15:15:48] [Rank 2] totoal_tokens=15958, outputs='Sugarcane'
[2024-08-03 15:15:48] [Rank 1] totoal_tokens=15450, outputs='Region X'
[2024-08-03 15:15:48] [Rank 3] totoal_tokens=15545, outputs='0'
[2024-08-03 15:15:48] [Rank 0] totoal_tokens=15387, outputs='Cara'
[2024-08-03 15:15:49] [Rank 2] totoal_tokens=16081, outputs='Painting the model'
[2024-08-03 15:15:49] [Rank 3] totoal_tokens=16054, outputs='Jane'
[2024-08-03 15:15:49] [Rank 1] totoal_tokens=15526, outputs='The esmorzar'
[2024-08-03 15:15:49] [Rank 0] totoal_tokens=15534, outputs='The parade'
[2024-08-03 15:15:50] [Rank 2] totoal_tokens=16228, outputs='Get milk from the fridge'
[2024-08-03 15:15:50] [Rank 3] totoal_tokens=16085, outputs='Alice'
[2024-08-03 15:15:50] [Rank 0] totoal_tokens=15550, outputs='Vegas'
[2024-08-03 15:15:50] [Rank 1] totoal_tokens=15547, outputs='Jon Lester'
[2024-08-03 15:15:51] [Rank 2] totoal_tokens=16246, outputs='George Hwang'
[2024-08-03 15:15:51] [Rank 0] totoal_tokens=15577, outputs="The couch is to the left of Tim's wallet."
[2024-08-03 15:15:51] [Rank 1] totoal_tokens=15702, outputs='Spring Forth Farm posted this dual-purpose tip for both trellising and protecting'
[2024-08-03 15:15:52] [Rank 3] totoal_tokens=16247, outputs='Sarah'
[2024-08-03 15:15:52] [Rank 2] totoal_tokens=16265, outputs='The final step in constructing the model is to have a clear understanding of the differences'
[2024-08-03 15:15:52] [Rank 0] totoal_tokens=15632, outputs='The government'
[2024-08-03 15:15:52] [Rank 3] totoal_tokens=16315, outputs='U-M'
[2024-08-03 15:15:53] [Rank 1] totoal_tokens=16026, outputs='Food'
[2024-08-03 15:15:53] [Rank 2] totoal_tokens=16417, outputs='Bob'
[2024-08-03 15:15:53] [Rank 0] totoal_tokens=15770, outputs='Alexandra Waldherr'
[2024-08-03 15:15:53] [Rank 3] totoal_tokens=16343, outputs='ARTEMIS'
[2024-08-03 15:15:53] [Rank 1] totoal_tokens=16039, outputs='Earth'
[2024-08-03 15:15:54] [Rank 0] totoal_tokens=15884, outputs='Belo Sun'
[2024-08-03 15:15:54] [Rank 1] totoal_tokens=16053, outputs='Liam'
[2024-08-03 15:15:54] [Rank 2] totoal_tokens=16696, outputs='The model is constructed using the 3D printer.'
[2024-08-03 15:15:55] [Rank 3] totoal_tokens=16406, outputs='Kevin and I went to a party at Frontier to celebrate the recent engagement of one'
[2024-08-03 15:15:55] [Rank 0] totoal_tokens=15981, outputs='Dr. Carter'
[2024-08-03 15:15:55] [Rank 1] totoal_tokens=16082, outputs='Get out of bed'
[2024-08-03 15:15:55] [Rank 2] totoal_tokens=16703, outputs='Jones'
[2024-08-03 15:15:56] [Rank 3] totoal_tokens=16459, outputs='Tulip'
[2024-08-03 15:15:56] [Rank 0] totoal_tokens=15996, outputs='Mia'
/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 378/751 [02:33<06:22, 1.02s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 50%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 379/751 [02:34<06:10, 1.00it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 380/751 [02:35<06:00, 1.03it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 381/751 [02:36<05:42, 1.08it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 382/751 [02:37<06:12, 1.01s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 383/751 [02:38<05:46, 1.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 384/751 [02:38<05:46, 1.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 385/751 [02:39<05:47, 1.05it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 386/751 [02:40<05:32, 1.10it/s] Processing Inter[2024-08-03 15:15:56] [Rank 2] totoal_tokens=17034, outputs='walls'
[2024-08-03 15:15:56] [Rank 1] totoal_tokens=16088, outputs="I don't have access to the specific details of liam's past, but"
[2024-08-03 15:15:57] [Rank 3] totoal_tokens=16535, outputs='Wooster Square'
[2024-08-03 15:15:57] [Rank 0] totoal_tokens=16089, outputs='The Royal Marsden Cancer Research Institute'
[2024-08-03 15:15:57] [Rank 2] totoal_tokens=17087, outputs='Jamta'
[2024-08-03 15:15:57] [Rank 1] totoal_tokens=16154, outputs='blue book'
[2024-08-03 15:15:58] [Rank 3] totoal_tokens=16556, outputs='Eagle'
[2024-08-03 15:15:58] [Rank 0] totoal_tokens=16118, outputs='Mzansi'
[2024-08-03 15:15:58] [Rank 2] totoal_tokens=17089, outputs='The office'
[2024-08-03 15:15:58] [Rank 1] totoal_tokens=16200, outputs='Sword of the Necromancer'
[2024-08-03 15:15:58] [Rank 3] totoal_tokens=16564, outputs='Amazon'
[2024-08-03 15:15:58] [Rank 0] totoal_tokens=16127, outputs='annually'
[2024-08-03 15:15:59] [Rank 2] totoal_tokens=17138, outputs='Cactus'
[2024-08-03 15:15:59] [Rank 3] totoal_tokens=16580, outputs='Charlie'
[2024-08-03 15:15:59] [Rank 0] totoal_tokens=16130, outputs='American Robin'
[2024-08-03 15:15:59] [Rank 1] totoal_tokens=16378, outputs='Darewise'
[2024-08-03 15:16:00] [Rank 2] totoal_tokens=17140, outputs='Dr. Johnson'
[2024-08-03 15:16:00] [Rank 3] totoal_tokens=17135, outputs='Walls'
[2024-08-03 15:16:00] [Rank 0] totoal_tokens=16183, outputs='Southern'
[2024-08-03 15:16:01] [Rank 1] totoal_tokens=16406, outputs='Gamma'
[2024-08-03 15:16:01] [Rank 2] totoal_tokens=17153, outputs='Salad'
[2024-08-03 15:16:01] [Rank 3] totoal_tokens=17169, outputs='Coffee'
[2024-08-03 15:16:01] [Rank 0] totoal_tokens=16204, outputs='Charlie'
[2024-08-03 15:16:02] [Rank 1] totoal_tokens=16494, outputs='Banana'
[2024-08-03 15:16:02] [Rank 2] totoal_tokens=17161, outputs='Walls'
[2024-08-03 15:16:02] [Rank 3] totoal_tokens=17228, outputs='Bird'
[2024-08-03 15:16:02] [Rank 0] totoal_tokens=16234, outputs='Sparrow'
[2024-08-03 15:16:03] [Rank 1] totoal_tokens=16500, outputs='Open the door'
[2024-08-03 15:16:03] [Rank 2] totoal_tokens=17193, outputs='Marina Kurakin'
[2024-08-03 15:16:03] [Rank 0] totoal_tokens=16302, outputs='South'
[2024-08-03 15:16:03] [Rank 3] totoal_tokens=17295, outputs='The test is passed.'
[2024-08-03 15:16:04] [Rank 2] totoal_tokens=17257, outputs='The results are published in the journal Nature.'
[2024-08-03 15:16:04] [Rank 1] totoal_tokens=16533, outputs='The triathlon is a competition that consists of three events: swimming, cycling'
[2024-08-03 15:16:05] [Rank 3] totoal_tokens=17369, outputs='Get the laptop'
[2024-08-03 15:16:05] [Rank 1] totoal_tokens=16546, outputs='Store B'
[2024-08-03 15:16:05] [Rank 0] totoal_tokens=16426, outputs='The manager meets the sales team first.'
nVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 387/751 [02:41<05:32, 1.10it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 388/751 [02:42<05:40, 1.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 389/751 [02:43<05:23, 1.12it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 390/751 [02:44<05:12, 1.15it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 391/751 [02:45<05:05, 1.18it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 392/751 [02:46<05:39, 1.06it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 393/751 [02:47<05:22, 1.11it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 52%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 394/751 [02:47<05:28, 1.09it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 395/751 [02:48<05:15, 1.13it/s] Processing InternVL2-2B[2024-08-03 15:16:06] [Rank 3] totoal_tokens=17415, outputs='Earth'
[2024-08-03 15:16:06] [Rank 2] totoal_tokens=17273, outputs='Cara'
[2024-08-03 15:16:07] [Rank 3] totoal_tokens=17446, outputs='Pigeon'
[2024-08-03 15:16:07] [Rank 0] totoal_tokens=16677, outputs='France'
[2024-08-03 15:16:07] [Rank 2] totoal_tokens=17276, outputs='A'
[2024-08-03 15:16:08] [Rank 0] totoal_tokens=16689, outputs='Chinese sailors'
[2024-08-03 15:16:08] [Rank 3] totoal_tokens=17487, outputs='dessert'
[2024-08-03 15:16:09] [Rank 3] totoal_tokens=17692, outputs='dog'
[2024-08-03 15:16:09] [Rank 2] totoal_tokens=17281, outputs='The walls'
[2024-08-03 15:16:09] [Rank 0] totoal_tokens=16762, outputs='Charlie'
[2024-08-03 15:16:09] [Rank 1] totoal_tokens=16554, outputs='Frank'
[2024-08-03 15:16:10] [Rank 3] totoal_tokens=17750, outputs='Swing'
[2024-08-03 15:16:10] [Rank 2] totoal_tokens=17310, outputs='Go to the beach'
[2024-08-03 15:16:10] [Rank 0] totoal_tokens=17201, outputs='Watch TV'
[2024-08-03 15:16:10] [Rank 1] totoal_tokens=17123, outputs='France'
[2024-08-03 15:16:11] [Rank 0] totoal_tokens=17369, outputs='Bodega'
[2024-08-03 15:16:11] [Rank 3] totoal_tokens=17937, outputs='Invest C'
[2024-08-03 15:16:11] [Rank 2] totoal_tokens=17344, outputs='Mars Pathfinder Mission'
[2024-08-03 15:16:11] [Rank 1] totoal_tokens=17154, outputs='Banana'
[2024-08-03 15:16:12] [Rank 0] totoal_tokens=17377, outputs='Project Alpha'
[2024-08-03 15:16:12] [Rank 1] totoal_tokens=17154, outputs='The lights are turned on'
[2024-08-03 15:16:12] [Rank 3] totoal_tokens=18114, outputs='Paint the model'
[2024-08-03 15:16:13] [Rank 2] totoal_tokens=17355, outputs='Mark Wood'
[2024-08-03 15:16:13] [Rank 0] totoal_tokens=17403, outputs="I'm sorry, but I can't answer that question at the moment. I"
[2024-08-03 15:16:14] [Rank 3] totoal_tokens=18137, outputs='Samsung'
[2024-08-03 15:16:14] [Rank 1] totoal_tokens=17211, outputs='oak'
[2024-08-03 15:16:14] [Rank 2] totoal_tokens=17429, outputs='Diet Plan B'
[2024-08-03 15:16:14] [Rank 0] totoal_tokens=17457, outputs='The manager meets the sales team.'
[2024-08-03 15:16:15] [Rank 1] totoal_tokens=17292, outputs='Dr. Johnson'
[2024-08-03 15:16:15] [Rank 3] totoal_tokens=18183, outputs='Watched movie'
[2024-08-03 15:16:15] [Rank 0] totoal_tokens=17511, outputs='Sue'
_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 396/751 [02:51<07:50, 1.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 397/751 [02:52<07:54, 1.34s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 398/751 [02:53<07:08, 1.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 399/751 [02:54<06:41, 1.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 400/751 [02:55<06:47, 1.16s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 53%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 401/751 [02:56<06:35, 1.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 402/751 [02:57<06:08, 1.05s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 403/751 [02:58<06:27, 1.11s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 404/751 [02:59<06:16, 1.09s/it] Processing InternVL2-2B_reason[2024-08-03 15:16:15] [Rank 2] totoal_tokens=17553, outputs='Fikra Forum'
[2024-08-03 15:16:16] [Rank 1] totoal_tokens=17294, outputs='Bush'
[2024-08-03 15:16:16] [Rank 3] totoal_tokens=18316, outputs='Get her backpack'
[2024-08-03 15:16:16] [Rank 0] totoal_tokens=17579, outputs='Car B'
[2024-08-03 15:16:16] [Rank 2] totoal_tokens=17760, outputs='oak'
[2024-08-03 15:16:17] [Rank 1] totoal_tokens=17305, outputs='Kestrel'
[2024-08-03 15:16:17] [Rank 0] totoal_tokens=17624, outputs='The first step is to identify the problem.'
[2024-08-03 15:16:17] [Rank 3] totoal_tokens=18319, outputs='The 10 Most Beautiful Airliners of All Time'
[2024-08-03 15:16:17] [Rank 2] totoal_tokens=18124, outputs='Ladybug'
[2024-08-03 15:16:18] [Rank 0] totoal_tokens=17679, outputs='Female'
[2024-08-03 15:16:18] [Rank 3] totoal_tokens=18359, outputs='Initiative Alpha'
[2024-08-03 15:16:18] [Rank 1] totoal_tokens=17366, outputs='Region X'
[2024-08-03 15:16:19] [Rank 0] totoal_tokens=17685, outputs='Investment C'
[2024-08-03 15:16:19] [Rank 3] totoal_tokens=18368, outputs='Mohammed'
[2024-08-03 15:16:19] [Rank 1] totoal_tokens=17518, outputs='Aloe vera'
[2024-08-03 15:16:20] [Rank 2] totoal_tokens=18246, outputs='oak'
[2024-08-03 15:16:20] [Rank 3] totoal_tokens=18390, outputs='Watch movie'
[2024-08-03 15:16:20] [Rank 1] totoal_tokens=17581, outputs='Aloe Vera'
[2024-08-03 15:16:20] [Rank 0] totoal_tokens=18085, outputs='Oil on canvas'
[2024-08-03 15:16:21] [Rank 1] totoal_tokens=17780, outputs='B'
[2024-08-03 15:16:21] [Rank 3] totoal_tokens=18509, outputs='The International Seminar for Sambo Judges'
[2024-08-03 15:16:21] [Rank 0] totoal_tokens=18098, outputs='The Garimpeiros are evicted from their homes.'
[2024-08-03 15:16:22] [Rank 2] totoal_tokens=18254, outputs='Cactus'
[2024-08-03 15:16:22] [Rank 3] totoal_tokens=18569, outputs='Wayfair'
[2024-08-03 15:16:22] [Rank 1] totoal_tokens=17865, outputs='North America'
[2024-08-03 15:16:22] [Rank 0] totoal_tokens=18229, outputs='Kerala'
[2024-08-03 15:16:23] [Rank 2] totoal_tokens=18288, outputs='Aloe vera'
[2024-08-03 15:16:23] [Rank 1] totoal_tokens=17888, outputs='Writing'
[2024-08-03 15:16:23] [Rank 3] totoal_tokens=18630, outputs='Aloe vera'
[2024-08-03 15:16:23] [Rank 0] totoal_tokens=18336, outputs='Golf'
[2024-08-03 15:16:24] [Rank 1] totoal_tokens=18088, outputs='Taryn'
[2024-08-03 15:16:24] [Rank 2] totoal_tokens=18390, outputs='Gabby Althoff'
[2024-08-03 15:16:24] [Rank 0] totoal_tokens=18360, outputs='Eagle'
ing-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 405/751 [03:00<05:57, 1.03s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 406/751 [03:01<05:51, 1.02s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 407/751 [03:02<05:59, 1.04s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 408/751 [03:03<05:37, 1.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 409/751 [03:04<05:36, 1.02it/s] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 410/751 [03:05<06:11, 1.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 411/751 [03:07<06:28, 1.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 412/751 [03:08<06:11, 1.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 413/751 [03:09<06:04, 1.08s/it] Processing InternVL2-2B_reasoning-tex[2024-08-03 15:16:24] [Rank 3] totoal_tokens=18666, outputs='Yorkshire'
[2024-08-03 15:16:25] [Rank 1] totoal_tokens=18102, outputs='Walls'
[2024-08-03 15:16:25] [Rank 2] totoal_tokens=18479, outputs='Diet Plan C'
[2024-08-03 15:16:25] [Rank 3] totoal_tokens=18836, outputs='Zwickau'
[2024-08-03 15:16:25] [Rank 0] totoal_tokens=18374, outputs='Get water bottle from table'
[2024-08-03 15:16:26] [Rank 1] totoal_tokens=18108, outputs='Car B'
[2024-08-03 15:16:26] [Rank 2] totoal_tokens=18550, outputs='Banana'
[2024-08-03 15:16:27] [Rank 3] totoal_tokens=18854, outputs='Investment C'
[2024-08-03 15:16:27] [Rank 1] totoal_tokens=18109, outputs='Jacob Frey'
[2024-08-03 15:16:27] [Rank 0] totoal_tokens=18458, outputs='oak'
[2024-08-03 15:16:27] [Rank 2] totoal_tokens=18778, outputs='Alice'
[2024-08-03 15:16:28] [Rank 3] totoal_tokens=18921, outputs='Alice'
[2024-08-03 15:16:28] [Rank 1] totoal_tokens=18114, outputs='on top of the couch'
[2024-08-03 15:16:28] [Rank 0] totoal_tokens=18468, outputs='Charles Darwin'
[2024-08-03 15:16:28] [Rank 2] totoal_tokens=19029, outputs='The oak tree is planted in the backyard.'
[2024-08-03 15:16:29] [Rank 1] totoal_tokens=18231, outputs='Alice'
[2024-08-03 15:16:29] [Rank 0] totoal_tokens=18828, outputs='Assembly'
[2024-08-03 15:16:29] [Rank 3] totoal_tokens=19109, outputs='sunflower'
[2024-08-03 15:16:30] [Rank 2] totoal_tokens=19032, outputs='Banana'
[2024-08-03 15:16:30] [Rank 1] totoal_tokens=18310, outputs='The finish line'
[2024-08-03 15:16:30] [Rank 3] totoal_tokens=19292, outputs='Go for a run'
[2024-08-03 15:16:30] [Rank 0] totoal_tokens=18838, outputs='The vehicle is then certified to meet the WLTP standard.'
[2024-08-03 15:16:31] [Rank 2] totoal_tokens=19035, outputs='Tinkerbell'
[2024-08-03 15:16:31] [Rank 1] totoal_tokens=18322, outputs='bleach'
[2024-08-03 15:16:31] [Rank 3] totoal_tokens=19321, outputs='Elanor'
[2024-08-03 15:16:32] [Rank 2] totoal_tokens=19060, outputs='Investment C'
[2024-08-03 15:16:32] [Rank 0] totoal_tokens=19075, outputs='the walls'
[2024-08-03 15:16:32] [Rank 1] totoal_tokens=18453, outputs='Store Z'
[2024-08-03 15:16:32] [Rank 3] totoal_tokens=19346, outputs='Audi Q3'
[2024-08-03 15:16:33] [Rank 2] totoal_tokens=19062, outputs='Read the book'
[2024-08-03 15:16:33] [Rank 0] totoal_tokens=19075, outputs='the walls'
[2024-08-03 15:16:33] [Rank 3] totoal_tokens=19395, outputs='backyard'
[2024-08-03 15:16:34] [Rank 0] totoal_tokens=19118, outputs='Eat'
[2024-08-03 15:16:34] [Rank 2] totoal_tokens=19111, outputs='Cactus'
[2024-08-03 15:16:34] [Rank 3] totoal_tokens=19480, outputs='Apple'
[2024-08-03 15:16:35] [Rank 0] totoal_tokens=19268, outputs='Operation Market Garden'
t-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 414/751 [03:10<05:46, 1.03s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 415/751 [03:11<05:52, 1.05s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 55%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 416/751 [03:12<06:34, 1.18s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 417/751 [03:13<06:19, 1.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 418/751 [03:14<06:03, 1.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 419/751 [03:16<06:28, 1.17s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 420/751 [03:17<06:33, 1.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 421/751 [03:18<06:32, 1.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 422/751 [03:19<06:04, 1.11s/it] Processing InternVL2-2B_reasoning-text-test.[2024-08-03 15:16:35] [Rank 2] totoal_tokens=19132, outputs='end of day'
[2024-08-03 15:16:35] [Rank 1] totoal_tokens=18527, outputs='Go for a hike'
[2024-08-03 15:16:36] [Rank 3] totoal_tokens=19617, outputs='in his pocket'
[2024-08-03 15:16:36] [Rank 1] totoal_tokens=18548, outputs='He wakes up'
[2024-08-03 15:16:36] [Rank 2] totoal_tokens=19149, outputs='The final step is to add the model to the scene.'
[2024-08-03 15:16:37] [Rank 3] totoal_tokens=19699, outputs='people'
[2024-08-03 15:16:37] [Rank 0] totoal_tokens=19269, outputs='The family goes for a hike.'
[2024-08-03 15:16:37] [Rank 1] totoal_tokens=18614, outputs='Southwest'
[2024-08-03 15:16:37] [Rank 2] totoal_tokens=19192, outputs='Siobhan Harper-Nunes'
[2024-08-03 15:16:38] [Rank 0] totoal_tokens=19469, outputs='Butterfly'
[2024-08-03 15:16:38] [Rank 3] totoal_tokens=19707, outputs='GΓ©rard Depardieu'
[2024-08-03 15:16:38] [Rank 1] totoal_tokens=18677, outputs='Linda'
[2024-08-03 15:16:38] [Rank 2] totoal_tokens=19229, outputs='The Fishy'
[2024-08-03 15:16:39] [Rank 0] totoal_tokens=19579, outputs='Lithuania'
[2024-08-03 15:16:39] [Rank 1] totoal_tokens=18841, outputs='Turn on computer'
[2024-08-03 15:16:39] [Rank 3] totoal_tokens=19715, outputs='The painting was acquired by the Metropolitan Museum of Art.'
[2024-08-03 15:16:39] [Rank 2] totoal_tokens=19295, outputs='Cactus'
[2024-08-03 15:16:40] [Rank 0] totoal_tokens=19693, outputs='Car B'
[2024-08-03 15:16:41] [Rank 2] totoal_tokens=19343, outputs='Diet Plan B'
[2024-08-03 15:16:41] [Rank 1] totoal_tokens=19036, outputs='Get his backpack'
[2024-08-03 15:16:41] [Rank 3] totoal_tokens=19840, outputs='Store C'
[2024-08-03 15:16:41] [Rank 0] totoal_tokens=19929, outputs='oak'
[2024-08-03 15:16:42] [Rank 2] totoal_tokens=19380, outputs='The parade'
[2024-08-03 15:16:42] [Rank 3] totoal_tokens=20084, outputs='Watch TV'
[2024-08-03 15:16:43] [Rank 0] totoal_tokens=19960, outputs='George Clooney'
[2024-08-03 15:16:43] [Rank 2] totoal_tokens=19394, outputs='Bradley Beal leads the Wizards to their second win in Chicago with a'
[2024-08-03 15:16:43] [Rank 1] totoal_tokens=19038, outputs='Douglas Adams'
[2024-08-03 15:16:43] [Rank 3] totoal_tokens=20086, outputs='Ray Sullivan'
[2024-08-03 15:16:44] [Rank 0] totoal_tokens=20096, outputs='The Sunflower'
[2024-08-03 15:16:44] [Rank 2] totoal_tokens=19716, outputs='Gamma'
[2024-08-03 15:16:44] [Rank 1] totoal_tokens=19105, outputs='[Image]'
[2024-08-03 15:16:45] [Rank 3] totoal_tokens=20109, outputs='Investment C'
[2024-08-03 15:16:45] [Rank 1] totoal_tokens=19225, outputs='Jim'
[2024-08-03 15:16:45] [Rank 0] totoal_tokens=20176, outputs='watch movie'
[2024-08-03 15:16:46] [Rank 2] totoal_tokens=19836, outputs='Mitt Romney'
[2024-08-03 15:16:46] [Rank 3] totoal_tokens=20195, outputs='the walls'
[2024-08-03 15:16:46] [Rank 1] totoal_tokens=19244, outputs='Oil on canvas'
[2024-08-03 15:16:47] [Rank 0] totoal_tokens=20194, outputs='Investment C'
jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 423/751 [03:20<05:53, 1.08s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 56%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 424/751 [03:22<07:15, 1.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 425/751 [03:23<06:51, 1.26s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 426/751 [03:24<06:43, 1.24s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 427/751 [03:25<06:21, 1.18s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 428/751 [03:26<06:09, 1.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 429/751 [03:28<07:12, 1.34s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 430/751 [03:29<06:59, 1.31s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 431/751 [03:31<07:05, 1.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:16:47] [Rank 3] totoal_tokens=20267, outputs='South Africa'
[2024-08-03 15:16:47] [Rank 2] totoal_tokens=19854, outputs='He was listening to music'
[2024-08-03 15:16:48] [Rank 1] totoal_tokens=19279, outputs='Rob writes: We all celebrate the success of food cooked lovingly and with purpose'
[2024-08-03 15:16:48] [Rank 0] totoal_tokens=20268, outputs='children'
[2024-08-03 15:16:48] [Rank 3] totoal_tokens=20404, outputs='Alaska'
[2024-08-03 15:16:48] [Rank 2] totoal_tokens=19942, outputs='Runs'
[2024-08-03 15:16:49] [Rank 1] totoal_tokens=19294, outputs='Next to it'
[2024-08-03 15:16:49] [Rank 0] totoal_tokens=20319, outputs='bee'
[2024-08-03 15:16:49] [Rank 3] totoal_tokens=20608, outputs='Paris'
[2024-08-03 15:16:50] [Rank 1] totoal_tokens=19383, outputs='Rome'
[2024-08-03 15:16:50] [Rank 2] totoal_tokens=19984, outputs='The final step in constructing the model is to add the model to the website.'
[2024-08-03 15:16:50] [Rank 0] totoal_tokens=20462, outputs='Route X'
[2024-08-03 15:16:51] [Rank 3] totoal_tokens=20621, outputs='Dina'
[2024-08-03 15:16:51] [Rank 1] totoal_tokens=19386, outputs='Shenbo Yu'
[2024-08-03 15:16:51] [Rank 2] totoal_tokens=20202, outputs='The final event in the triathlon is the β€œSprint”.'
[2024-08-03 15:16:51] [Rank 0] totoal_tokens=21042, outputs='frozen yogurt'
[2024-08-03 15:16:52] [Rank 3] totoal_tokens=20626, outputs='South'
[2024-08-03 15:16:52] [Rank 1] totoal_tokens=19440, outputs='Ecocity Builders'
[2024-08-03 15:16:52] [Rank 2] totoal_tokens=20222, outputs='Linda'
[2024-08-03 15:16:53] [Rank 0] totoal_tokens=21061, outputs='South Boston'
[2024-08-03 15:16:53] [Rank 3] totoal_tokens=20793, outputs='Sunset'
[2024-08-03 15:16:53] [Rank 1] totoal_tokens=19476, outputs='Aloe'
[2024-08-03 15:16:54] [Rank 2] totoal_tokens=20244, outputs='H.G. Wells'
[2024-08-03 15:16:54] [Rank 0] totoal_tokens=21135, outputs='TutuApp Alternatives'
[2024-08-03 15:16:54] [Rank 3] totoal_tokens=20855, outputs='Car B'
[2024-08-03 15:16:55] [Rank 1] totoal_tokens=19811, outputs='Laura Loomer'
[2024-08-03 15:16:55] [Rank 0] totoal_tokens=21143, outputs='Chocolate cake'
[2024-08-03 15:16:55] [Rank 2] totoal_tokens=20255, outputs='Tetsuo Kurata'
[2024-08-03 15:16:55] [Rank 3] totoal_tokens=20904, outputs='The manager meets the manager.'
[2024-08-03 15:16:56] [Rank 1] totoal_tokens=20195, outputs='Diet Plan C'
[2024-08-03 15:16:56] [Rank 2] totoal_tokens=20302, outputs='Region C'
[2024-08-03 15:16:56] [Rank 0] totoal_tokens=21502, outputs='Sanitizer'
[2024-08-03 15:16:56] [Rank 3] totoal_tokens=21139, outputs='Banana'
[2024-08-03 15:16:57] [Rank 2] totoal_tokens=20350, outputs='China'
[2024-08-03 15:16:57] [Rank 1] totoal_tokens=20216, outputs='The final event of the festival is the award ceremony.'
[2024-08-03 15:16:57] [Rank 0] totoal_tokens=22251, outputs='oak'
58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 432/751 [03:32<06:57, 1.31s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 433/751 [03:33<06:48, 1.29s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 434/751 [03:34<06:19, 1.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 435/751 [03:36<06:38, 1.26s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 436/751 [03:37<06:24, 1.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 437/751 [03:38<06:22, 1.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 438/751 [03:39<06:27, 1.24s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 439/751 [03:40<06:19, 1.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 440/751 [03:42<06:14, 1.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|οΏ½[2024-08-03 15:16:58] [Rank 3] totoal_tokens=21160, outputs='Project Alpha'
[2024-08-03 15:16:58] [Rank 1] totoal_tokens=20301, outputs='The butterfly'
[2024-08-03 15:16:58] [Rank 2] totoal_tokens=20403, outputs='dinner'
[2024-08-03 15:16:59] [Rank 3] totoal_tokens=21204, outputs='Mum'
[2024-08-03 15:16:59] [Rank 0] totoal_tokens=22387, outputs='Bird 1'
[2024-08-03 15:17:00] [Rank 2] totoal_tokens=20407, outputs='She gets a tattoo'
[2024-08-03 15:17:00] [Rank 3] totoal_tokens=21204, outputs='Alpha'
[2024-08-03 15:17:00] [Rank 0] totoal_tokens=22405, outputs='Georgia'
[2024-08-03 15:17:00] [Rank 1] totoal_tokens=20304, outputs='Cars A and B'
[2024-08-03 15:17:01] [Rank 2] totoal_tokens=20591, outputs='Coffee'
[2024-08-03 15:17:01] [Rank 3] totoal_tokens=21212, outputs='Sperling'
[2024-08-03 15:17:01] [Rank 0] totoal_tokens=22422, outputs='Cara'
[2024-08-03 15:17:01] [Rank 1] totoal_tokens=20385, outputs='Morgan La Rue'
[2024-08-03 15:17:02] [Rank 2] totoal_tokens=20621, outputs='read the book'
[2024-08-03 15:17:03] [Rank 3] totoal_tokens=21281, outputs='Charlie'
[2024-08-03 15:17:03] [Rank 1] totoal_tokens=20584, outputs='annual'
[2024-08-03 15:17:03] [Rank 0] totoal_tokens=22451, outputs="Merrill's"
[2024-08-03 15:17:03] [Rank 2] totoal_tokens=20631, outputs='The software is ready for the enterprise.'
[2024-08-03 15:17:04] [Rank 1] totoal_tokens=20637, outputs='Investment C'
[2024-08-03 15:17:04] [Rank 2] totoal_tokens=20949, outputs='Earth'
[2024-08-03 15:17:04] [Rank 3] totoal_tokens=21328, outputs='Car A'
[2024-08-03 15:17:05] [Rank 0] totoal_tokens=22792, outputs='Store C'
[2024-08-03 15:17:05] [Rank 1] totoal_tokens=20791, outputs='Charles Darwin'
[2024-08-03 15:17:05] [Rank 2] totoal_tokens=21158, outputs='Jogging'
[2024-08-03 15:17:06] [Rank 3] totoal_tokens=21475, outputs='Peregrine falcon'
[2024-08-03 15:17:06] [Rank 1] totoal_tokens=20887, outputs='Football'
[2024-08-03 15:17:06] [Rank 0] totoal_tokens=23038, outputs='Charlie'
[2024-08-03 15:17:07] [Rank 2] totoal_tokens=21173, outputs='B'
[2024-08-03 15:17:07] [Rank 1] totoal_tokens=21149, outputs='The game ends'
[2024-08-03 15:17:07] [Rank 3] totoal_tokens=21640, outputs='Diet Plan B'
[2024-08-03 15:17:08] [Rank 0] totoal_tokens=23060, outputs='The original sequence consisted of multiple independent clips.'
[2024-08-03 15:17:08] [Rank 1] totoal_tokens=21171, outputs='Store X'
[2024-08-03 15:17:09] [Rank 0] totoal_tokens=23071, outputs='Africa'
[2024-08-03 15:17:09] [Rank 3] totoal_tokens=21654, outputs='The food is prepared by the chef.'
[2024-08-03 15:17:09] [Rank 2] totoal_tokens=21215, outputs='backyard'
[2024-08-03 15:17:10] [Rank 1] totoal_tokens=21182, outputs='Coffee'
[2024-08-03 15:17:10] [Rank 3] totoal_tokens=21701, outputs='Route X'
[2024-08-03 15:17:10] [Rank 0] totoal_tokens=23089, outputs='dog'
οΏ½β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 441/751 [03:43<06:12, 1.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 442/751 [03:44<06:21, 1.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 443/751 [03:45<06:27, 1.26s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 444/751 [03:47<06:26, 1.26s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 445/751 [03:48<06:46, 1.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 446/751 [03:50<07:34, 1.49s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 447/751 [03:51<07:23, 1.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 448/751 [03:53<07:28, 1.48s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 449/751 [03:54<07:08, 1.42s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆ[2024-08-03 15:17:11] [Rank 2] totoal_tokens=21215, outputs='Go to bed'
[2024-08-03 15:17:11] [Rank 1] totoal_tokens=21184, outputs='Store B'
[2024-08-03 15:17:11] [Rank 3] totoal_tokens=22216, outputs='The finish line'
[2024-08-03 15:17:12] [Rank 0] totoal_tokens=23119, outputs='Metro'
[2024-08-03 15:17:12] [Rank 2] totoal_tokens=21308, outputs='The Scottish Housing Market'
[2024-08-03 15:17:12] [Rank 1] totoal_tokens=21231, outputs='Amazon'
[2024-08-03 15:17:13] [Rank 3] totoal_tokens=22234, outputs='Cara'
[2024-08-03 15:17:13] [Rank 0] totoal_tokens=23160, outputs='Read the book'
[2024-08-03 15:17:13] [Rank 2] totoal_tokens=21316, outputs='The Boys Are Back'
[2024-08-03 15:17:14] [Rank 3] totoal_tokens=22423, outputs='The food'
[2024-08-03 15:17:14] [Rank 1] totoal_tokens=21280, outputs='Samsung'
[2024-08-03 15:17:14] [Rank 0] totoal_tokens=23179, outputs='Owl'
[2024-08-03 15:17:14] [Rank 2] totoal_tokens=21322, outputs='Get coffee'
[2024-08-03 15:17:15] [Rank 3] totoal_tokens=22433, outputs='Dr. Smith'
[2024-08-03 15:17:16] [Rank 1] totoal_tokens=21311, outputs="The European Union's General Data Protection Regulation (GDPR) came into effect in"
[2024-08-03 15:17:16] [Rank 2] totoal_tokens=21493, outputs='She is a teacher'
[2024-08-03 15:17:16] [Rank 0] totoal_tokens=23266, outputs='The Ironman'
[2024-08-03 15:17:17] [Rank 1] totoal_tokens=21518, outputs='Get coffee'
[2024-08-03 15:17:18] [Rank 0] totoal_tokens=23319, outputs='Alpha'
[2024-08-03 15:17:18] [Rank 1] totoal_tokens=21519, outputs='The manager meets the manager.'
[2024-08-03 15:17:18] [Rank 3] totoal_tokens=22584, outputs='The food was prepared by the chef.'
[2024-08-03 15:17:19] [Rank 2] totoal_tokens=21614, outputs='Dragonfly'
[2024-08-03 15:17:19] [Rank 0] totoal_tokens=23328, outputs='Aberdeen'
[2024-08-03 15:17:19] [Rank 3] totoal_tokens=22649, outputs='Car A'
[2024-08-03 15:17:20] [Rank 1] totoal_tokens=21586, outputs='Dragonfly'
[2024-08-03 15:17:20] [Rank 2] totoal_tokens=21655, outputs='The car'
[2024-08-03 15:17:21] [Rank 0] totoal_tokens=23354, outputs='The manager meets the CEO.'
[2024-08-03 15:17:21] [Rank 3] totoal_tokens=22727, outputs='ice cream'
[2024-08-03 15:17:21] [Rank 1] totoal_tokens=21587, outputs='Chef'
[2024-08-03 15:17:22] [Rank 2] totoal_tokens=21778, outputs='The Colburn School'
[2024-08-03 15:17:22] [Rank 1] totoal_tokens=22135, outputs='Get coffee'
[2024-08-03 15:17:22] [Rank 3] totoal_tokens=23100, outputs='bleach'
[2024-08-03 15:17:22] [Rank 0] totoal_tokens=23355, outputs='ice cream'
[2024-08-03 15:17:23] [Rank 1] totoal_tokens=22316, outputs='Gordon Bajnai'
[2024-08-03 15:17:23] [Rank 2] totoal_tokens=21986, outputs='The final step in constructing the model is to add the final touches and make any'
[2024-08-03 15:17:23] [Rank 3] totoal_tokens=23125, outputs='Alice'
[2024-08-03 15:17:23] [Rank 0] totoal_tokens=23504, outputs='Basketball'
β–ˆβ–ˆβ–‰ | 450/751 [03:55<06:53, 1.37s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 451/751 [03:57<07:09, 1.43s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 452/751 [03:58<06:55, 1.39s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 453/751 [04:00<06:34, 1.32s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 454/751 [04:02<07:41, 1.55s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 455/751 [04:03<07:25, 1.50s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 456/751 [04:04<07:14, 1.47s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 457/751 [04:06<07:14, 1.48s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 458/751 [04:08<07:26, 1.52s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆοΏ½[2024-08-03 15:17:25] [Rank 1] totoal_tokens=22332, outputs='Wash his hands'
[2024-08-03 15:17:25] [Rank 2] totoal_tokens=22206, outputs='Store C'
[2024-08-03 15:17:25] [Rank 3] totoal_tokens=23127, outputs='The parade'
[2024-08-03 15:17:25] [Rank 0] totoal_tokens=23552, outputs='Store C'
[2024-08-03 15:17:26] [Rank 1] totoal_tokens=22464, outputs='Connect next'
[2024-08-03 15:17:26] [Rank 3] totoal_tokens=23213, outputs='Alice'
[2024-08-03 15:17:26] [Rank 2] totoal_tokens=22254, outputs='Tim'
[2024-08-03 15:17:26] [Rank 0] totoal_tokens=23745, outputs='The selling is working.'
[2024-08-03 15:17:27] [Rank 1] totoal_tokens=22659, outputs='Bill SkarsgΓ₯rd'
[2024-08-03 15:17:28] [Rank 3] totoal_tokens=23286, outputs='CEF No. 3'
[2024-08-03 15:17:28] [Rank 0] totoal_tokens=23851, outputs='Brooklyn'
[2024-08-03 15:17:28] [Rank 2] totoal_tokens=22405, outputs='The plant that grows the most is the one that is most likely to grow.'
[2024-08-03 15:17:29] [Rank 1] totoal_tokens=23031, outputs='Cook'
[2024-08-03 15:17:29] [Rank 3] totoal_tokens=23344, outputs='Editing'
[2024-08-03 15:17:29] [Rank 0] totoal_tokens=24038, outputs='Cactus'
[2024-08-03 15:17:30] [Rank 1] totoal_tokens=23134, outputs='The meeting starts at 3 PM.'
[2024-08-03 15:17:30] [Rank 3] totoal_tokens=23543, outputs='West'
[2024-08-03 15:17:30] [Rank 0] totoal_tokens=24429, outputs='Cacti'
[2024-08-03 15:17:31] [Rank 2] totoal_tokens=22581, outputs='Under the table'
[2024-08-03 15:17:31] [Rank 1] totoal_tokens=23154, outputs='She is a teacher'
[2024-08-03 15:17:31] [Rank 3] totoal_tokens=23739, outputs='Eat breakfast'
[2024-08-03 15:17:32] [Rank 0] totoal_tokens=24595, outputs='Instagram for Kids'
[2024-08-03 15:17:33] [Rank 1] totoal_tokens=23213, outputs='Get coffee'
[2024-08-03 15:17:33] [Rank 3] totoal_tokens=23771, outputs='Jamie'
[2024-08-03 15:17:34] [Rank 2] totoal_tokens=22584, outputs='Charlie'
[2024-08-03 15:17:34] [Rank 0] totoal_tokens=25031, outputs='Coffee'
[2024-08-03 15:17:34] [Rank 1] totoal_tokens=23736, outputs='The final step in constructing the model is to use the model to make predictions about'
[2024-08-03 15:17:34] [Rank 3] totoal_tokens=23958, outputs='The oak tree is planted in the front yard.'
[2024-08-03 15:17:35] [Rank 2] totoal_tokens=22790, outputs='Brush his teeth'
[2024-08-03 15:17:35] [Rank 0] totoal_tokens=25065, outputs='Porsche'
[2024-08-03 15:17:36] [Rank 1] totoal_tokens=23781, outputs='Store C'
[2024-08-03 15:17:36] [Rank 2] totoal_tokens=23083, outputs='Earth'
[2024-08-03 15:17:36] [Rank 3] totoal_tokens=24032, outputs='Auction'
[2024-08-03 15:17:37] [Rank 0] totoal_tokens=25086, outputs='Tetsuo Kurata'
οΏ½οΏ½ | 459/751 [04:09<06:59, 1.44s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 460/751 [04:10<07:00, 1.45s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 461/751 [04:12<06:57, 1.44s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 462/751 [04:13<07:00, 1.45s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 463/751 [04:14<06:39, 1.39s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 464/751 [04:16<06:38, 1.39s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 465/751 [04:17<06:28, 1.36s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 466/751 [04:19<07:25, 1.56s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 467/751 [04:20<07:02, 1.49s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆοΏ½[2024-08-03 15:17:37] [Rank 1] totoal_tokens=23785, outputs='Lunch'
[2024-08-03 15:17:38] [Rank 2] totoal_tokens=23093, outputs='Cara'
[2024-08-03 15:17:38] [Rank 3] totoal_tokens=24045, outputs='Jun'
[2024-08-03 15:17:39] [Rank 2] totoal_tokens=23484, outputs='Frank'
[2024-08-03 15:17:40] [Rank 1] totoal_tokens=24153, outputs='yellow book'
[2024-08-03 15:17:40] [Rank 0] totoal_tokens=25120, outputs='What happens after testing?'
[2024-08-03 15:17:40] [Rank 3] totoal_tokens=24138, outputs='Diet Plan A'
[2024-08-03 15:17:41] [Rank 2] totoal_tokens=23486, outputs='The oak tree'
[2024-08-03 15:17:42] [Rank 3] totoal_tokens=24189, outputs='Pomellato: Since 1967'
[2024-08-03 15:17:42] [Rank 1] totoal_tokens=24226, outputs='The Double Chill Speakers are great.'
[2024-08-03 15:17:42] [Rank 2] totoal_tokens=23613, outputs='bleach'
[2024-08-03 15:17:43] [Rank 0] totoal_tokens=25188, outputs='Get a book'
[2024-08-03 15:17:43] [Rank 1] totoal_tokens=24258, outputs='Alice'
[2024-08-03 15:17:43] [Rank 3] totoal_tokens=24501, outputs='Bretman Rock'
[2024-08-03 15:17:44] [Rank 0] totoal_tokens=25216, outputs='The final step in constructing the model is to use the model to answer the question'
[2024-08-03 15:17:44] [Rank 2] totoal_tokens=23783, outputs='Cooking'
[2024-08-03 15:17:45] [Rank 1] totoal_tokens=24341, outputs='Car B'
[2024-08-03 15:17:45] [Rank 3] totoal_tokens=24523, outputs='The Great Wall of China'
[2024-08-03 15:17:46] [Rank 0] totoal_tokens=25222, outputs='Watch TV'
[2024-08-03 15:17:46] [Rank 1] totoal_tokens=24385, outputs='A food delivery service'
[2024-08-03 15:17:46] [Rank 3] totoal_tokens=24744, outputs='annual'
[2024-08-03 15:17:46] [Rank 2] totoal_tokens=24133, outputs='Cactus'
[2024-08-03 15:17:47] [Rank 0] totoal_tokens=25312, outputs='Roger Federer'
[2024-08-03 15:17:47] [Rank 1] totoal_tokens=24533, outputs='RΓ©my Martin'
[2024-08-03 15:17:48] [Rank 2] totoal_tokens=24136, outputs='Cactus'
[2024-08-03 15:17:49] [Rank 1] totoal_tokens=24612, outputs='Tiemeyer'
[2024-08-03 15:17:49] [Rank 3] totoal_tokens=24894, outputs='The final step in constructing the model is to assemble the model.'
[2024-08-03 15:17:49] [Rank 2] totoal_tokens=24207, outputs='Dubai'
[2024-08-03 15:17:50] [Rank 0] totoal_tokens=25428, outputs='tulip'
[2024-08-03 15:17:50] [Rank 1] totoal_tokens=24737, outputs='Alice'
[2024-08-03 15:17:51] [Rank 3] totoal_tokens=25052, outputs='Get plant specimens from table'
[2024-08-03 15:17:51] [Rank 2] totoal_tokens=24259, outputs='The 2007 global economic downturn'
[2024-08-03 15:17:52] [Rank 0] totoal_tokens=25630, outputs='Ricoh'
[2024-08-03 15:17:52] [Rank 1] totoal_tokens=24758, outputs='Dr. Allen'
[2024-08-03 15:17:52] [Rank 3] totoal_tokens=25126, outputs='Dr. Phil'
[2024-08-03 15:17:52] [Rank 2] totoal_tokens=24259, outputs='Ancient Corinth'
[2024-08-03 15:17:53] [Rank 0] totoal_tokens=25710, outputs='People'
[2024-08-03 15:17:53] [Rank 1] totoal_tokens=24771, outputs='Get her backpack'
[2024-08-03 15:17:53] [Rank 3] totoal_tokens=25195, outputs='Aloe vera'
[2024-08-03 15:17:54] [Rank 2] totoal_tokens=24462, outputs='Impressionist'
[2024-08-03 15:17:54] [Rank 1] totoal_tokens=24972, outputs='backyard'
[2024-08-03 15:17:55] [Rank 0] totoal_tokens=26021, outputs='New York Public Library'
οΏ½οΏ½β–ˆβ–ˆβ–ˆβ– | 468/751 [04:22<07:14, 1.53s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 62%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 469/751 [04:25<09:18, 1.98s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 470/751 [04:28<10:36, 2.27s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 471/751 [04:30<09:44, 2.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 472/751 [04:31<08:38, 1.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 473/751 [04:33<08:15, 1.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 474/751 [04:35<09:20, 2.02s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 475/751 [04:37<08:52, 1.93s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 476/751 [04:38<08:20, 1.82s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:17:55] [Rank 3] totoal_tokens=25195, outputs='Cara plants an oak tree next to the pine tree in the backyard.'
[2024-08-03 15:17:56] [Rank 2] totoal_tokens=24548, outputs='Ragdoll'
[2024-08-03 15:17:56] [Rank 1] totoal_tokens=25049, outputs='Paris'
[2024-08-03 15:17:56] [Rank 0] totoal_tokens=26097, outputs='Get ready to go'
[2024-08-03 15:17:57] [Rank 3] totoal_tokens=25231, outputs='Dr. Zhang'
[2024-08-03 15:17:57] [Rank 1] totoal_tokens=25108, outputs='Get ready for the day'
[2024-08-03 15:17:58] [Rank 2] totoal_tokens=24592, outputs='The couch is behind Tim.'
[2024-08-03 15:17:58] [Rank 0] totoal_tokens=26112, outputs='Alice'
[2024-08-03 15:17:58] [Rank 3] totoal_tokens=25277, outputs='Go jogging'
[2024-08-03 15:17:59] [Rank 1] totoal_tokens=25284, outputs='Lysol'
[2024-08-03 15:17:59] [Rank 2] totoal_tokens=24598, outputs='Instagram for Kids'
[2024-08-03 15:18:00] [Rank 0] totoal_tokens=26130, outputs='The Catcher in the Rye'
[2024-08-03 15:18:00] [Rank 1] totoal_tokens=25289, outputs='Fukushima'
[2024-08-03 15:18:01] [Rank 3] totoal_tokens=25287, outputs='The triathlon started with swimming.'
[2024-08-03 15:18:01] [Rank 2] totoal_tokens=24600, outputs='The final step in constructing the model is painting the model.'
[2024-08-03 15:18:02] [Rank 1] totoal_tokens=25447, outputs='apple'
[2024-08-03 15:18:02] [Rank 0] totoal_tokens=26161, outputs='The Sweeper'
[2024-08-03 15:18:02] [Rank 2] totoal_tokens=24612, outputs='Tiemeyer'
[2024-08-03 15:18:03] [Rank 3] totoal_tokens=25620, outputs='Get watermelon from the other person'
[2024-08-03 15:18:03] [Rank 1] totoal_tokens=25461, outputs='Jonathan Groff'
[2024-08-03 15:18:03] [Rank 0] totoal_tokens=26223, outputs='Cara'
[2024-08-03 15:18:04] [Rank 3] totoal_tokens=25695, outputs='Y'
[2024-08-03 15:18:04] [Rank 2] totoal_tokens=25222, outputs='Cactus'
[2024-08-03 15:18:04] [Rank 1] totoal_tokens=25543, outputs='June 9'
[2024-08-03 15:18:05] [Rank 0] totoal_tokens=26287, outputs='The finish line'
[2024-08-03 15:18:05] [Rank 3] totoal_tokens=25718, outputs='The stock market'
[2024-08-03 15:18:06] [Rank 2] totoal_tokens=25232, outputs='Diet Plan C'
[2024-08-03 15:18:06] [Rank 0] totoal_tokens=26318, outputs='Wasp'
[2024-08-03 15:18:06] [Rank 1] totoal_tokens=25834, outputs='Earth'
[2024-08-03 15:18:07] [Rank 2] totoal_tokens=25383, outputs='Alexandre Mourot'
[2024-08-03 15:18:08] [Rank 0] totoal_tokens=26360, outputs='Adam'
[2024-08-03 15:18:08] [Rank 3] totoal_tokens=26034, outputs='oak'
[2024-08-03 15:18:08] [Rank 1] totoal_tokens=26051, outputs='Go to bed'
[2024-08-03 15:18:09] [Rank 2] totoal_tokens=25481, outputs='Draw the main road as a rectangle.'
[2024-08-03 15:18:09] [Rank 3] totoal_tokens=26096, outputs='Painting'
[2024-08-03 15:18:09] [Rank 0] totoal_tokens=26534, outputs='Gourmet food stores'
64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 477/751 [04:40<08:14, 1.80s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 478/751 [04:42<07:43, 1.70s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 479/751 [04:43<07:12, 1.59s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 480/751 [04:45<07:35, 1.68s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 481/751 [04:47<08:17, 1.84s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 482/751 [04:48<07:35, 1.69s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 483/751 [04:50<07:17, 1.63s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 64%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 484/751 [04:51<06:56, 1.56s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 485/751 [04:53<06:48, 1.54s/it] Processing InternVL2-2B_reasoning-text-te[2024-08-03 15:18:10] [Rank 1] totoal_tokens=26101, outputs='matzo'
[2024-08-03 15:18:10] [Rank 2] totoal_tokens=25711, outputs='Get coffee'
[2024-08-03 15:18:11] [Rank 3] totoal_tokens=26131, outputs='Sugarcane'
[2024-08-03 15:18:11] [Rank 1] totoal_tokens=26163, outputs='Apple'
[2024-08-03 15:18:12] [Rank 0] totoal_tokens=26711, outputs='Investment C'
[2024-08-03 15:18:12] [Rank 3] totoal_tokens=26225, outputs='Jean'
[2024-08-03 15:18:12] [Rank 2] totoal_tokens=25824, outputs='Lamborghini Urus'
[2024-08-03 15:18:13] [Rank 1] totoal_tokens=26265, outputs="The couch is behind Tim's wallet."
[2024-08-03 15:18:14] [Rank 3] totoal_tokens=26257, outputs='The Monster'
[2024-08-03 15:18:14] [Rank 2] totoal_tokens=26049, outputs='Grocery stores'
[2024-08-03 15:18:14] [Rank 1] totoal_tokens=26269, outputs='The main course'
[2024-08-03 15:18:15] [Rank 0] totoal_tokens=27057, outputs='Davis'
[2024-08-03 15:18:16] [Rank 3] totoal_tokens=26431, outputs='in the backyard'
[2024-08-03 15:18:16] [Rank 1] totoal_tokens=26340, outputs='The production'
[2024-08-03 15:18:16] [Rank 0] totoal_tokens=27079, outputs='Product A'
[2024-08-03 15:18:16] [Rank 2] totoal_tokens=26049, outputs='Junior'
[2024-08-03 15:18:18] [Rank 1] totoal_tokens=26377, outputs='The MassResistance handout.'
[2024-08-03 15:18:18] [Rank 3] totoal_tokens=26505, outputs='Aung San Suu Kyi'
[2024-08-03 15:18:18] [Rank 0] totoal_tokens=27179, outputs='Drew Stankiewicz'
[2024-08-03 15:18:18] [Rank 2] totoal_tokens=26226, outputs='A store that receives its goods last is called a "last store."'
[2024-08-03 15:18:19] [Rank 3] totoal_tokens=26714, outputs='She is a teacher'
[2024-08-03 15:18:19] [Rank 0] totoal_tokens=27238, outputs='Bob'
[2024-08-03 15:18:19] [Rank 1] totoal_tokens=26483, outputs='mallow'
[2024-08-03 15:18:20] [Rank 2] totoal_tokens=26346, outputs='Larry Correia'
[2024-08-03 15:18:21] [Rank 3] totoal_tokens=27079, outputs='The least valuable item is the jewelry.'
[2024-08-03 15:18:21] [Rank 1] totoal_tokens=26551, outputs='Samsung'
[2024-08-03 15:18:21] [Rank 0] totoal_tokens=27239, outputs='Corn'
[2024-08-03 15:18:21] [Rank 2] totoal_tokens=26350, outputs='Editing'
[2024-08-03 15:18:23] [Rank 1] totoal_tokens=26667, outputs='dessert'
[2024-08-03 15:18:23] [Rank 0] totoal_tokens=27240, outputs='Alice'
[2024-08-03 15:18:23] [Rank 3] totoal_tokens=27131, outputs='Brian Yazzie'
[2024-08-03 15:18:23] [Rank 2] totoal_tokens=26456, outputs='The walls'
[2024-08-03 15:18:24] [Rank 1] totoal_tokens=27097, outputs='Diet Plan B'
[2024-08-03 15:18:24] [Rank 0] totoal_tokens=27288, outputs='Alice'
[2024-08-03 15:18:25] [Rank 3] totoal_tokens=27219, outputs='Store C'
[2024-08-03 15:18:25] [Rank 2] totoal_tokens=26550, outputs='Samsung'
[2024-08-03 15:18:26] [Rank 0] totoal_tokens=27309, outputs='Bob'
st.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 486/751 [04:54<06:57, 1.58s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 487/751 [04:57<08:32, 1.94s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 488/751 [05:00<09:34, 2.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 489/751 [05:01<08:34, 1.96s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 490/751 [05:03<08:11, 1.88s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 491/751 [05:05<07:41, 1.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 492/751 [05:07<07:46, 1.80s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 493/751 [05:08<07:25, 1.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 494/751 [05:10<07:06, 1.66s/it] Processing InternVL2-2B_reason[2024-08-03 15:18:26] [Rank 1] totoal_tokens=27275, outputs='Earth'
[2024-08-03 15:18:26] [Rank 3] totoal_tokens=27297, outputs='Dragon Age 4'
[2024-08-03 15:18:27] [Rank 2] totoal_tokens=27142, outputs='Rachel Margolis'
[2024-08-03 15:18:28] [Rank 0] totoal_tokens=27436, outputs='Joe'
[2024-08-03 15:18:28] [Rank 1] totoal_tokens=27295, outputs='Parker Smith'
[2024-08-03 15:18:28] [Rank 3] totoal_tokens=27761, outputs='Car A'
[2024-08-03 15:18:28] [Rank 2] totoal_tokens=27204, outputs='Calvin Coolidge'
[2024-08-03 15:18:29] [Rank 1] totoal_tokens=27306, outputs='Next to the couch'
[2024-08-03 15:18:30] [Rank 3] totoal_tokens=27945, outputs='The manager meets the CEO.'
[2024-08-03 15:18:30] [Rank 2] totoal_tokens=27213, outputs='Banana'
[2024-08-03 15:18:30] [Rank 0] totoal_tokens=27755, outputs='Hemp'
[2024-08-03 15:18:31] [Rank 1] totoal_tokens=27410, outputs='Lunch'
[2024-08-03 15:18:31] [Rank 2] totoal_tokens=27272, outputs='Route X'
[2024-08-03 15:18:31] [Rank 0] totoal_tokens=27829, outputs='Cake'
[2024-08-03 15:18:32] [Rank 3] totoal_tokens=28055, outputs='Main event fighters Oleksandr Usyk and Anthony Joshua, who have'
[2024-08-03 15:18:33] [Rank 1] totoal_tokens=27499, outputs='Cara'
[2024-08-03 15:18:33] [Rank 2] totoal_tokens=27279, outputs='The Greatest Trade Ever Seen'
[2024-08-03 15:18:33] [Rank 3] totoal_tokens=28108, outputs='Music'
[2024-08-03 15:18:34] [Rank 0] totoal_tokens=28136, outputs='Sue'
[2024-08-03 15:18:34] [Rank 1] totoal_tokens=27553, outputs='Rue'
[2024-08-03 15:18:35] [Rank 2] totoal_tokens=27280, outputs='Jurassic World'
[2024-08-03 15:18:35] [Rank 3] totoal_tokens=28142, outputs='Ed Bassmaster'
[2024-08-03 15:18:35] [Rank 0] totoal_tokens=28216, outputs='Chef'
[2024-08-03 15:18:36] [Rank 1] totoal_tokens=27663, outputs='The painting was acquired by the Metropolitan Museum of Art.'
[2024-08-03 15:18:36] [Rank 2] totoal_tokens=27401, outputs='Car A'
[2024-08-03 15:18:37] [Rank 3] totoal_tokens=28167, outputs='Asia'
[2024-08-03 15:18:38] [Rank 1] totoal_tokens=27827, outputs='Glass'
[2024-08-03 15:18:38] [Rank 2] totoal_tokens=27442, outputs='Tall grass'
[2024-08-03 15:18:38] [Rank 3] totoal_tokens=28244, outputs='Earth'
[2024-08-03 15:18:38] [Rank 0] totoal_tokens=28217, outputs='The test is passed.'
[2024-08-03 15:18:40] [Rank 1] totoal_tokens=28102, outputs='A store that receives its goods last is the store that receives its goods last.'
[2024-08-03 15:18:40] [Rank 2] totoal_tokens=27453, outputs='The Girl on the Train'
[2024-08-03 15:18:40] [Rank 0] totoal_tokens=28221, outputs='Coca-Cola'
[2024-08-03 15:18:40] [Rank 3] totoal_tokens=28258, outputs='Homo habilis'
[2024-08-03 15:18:41] [Rank 1] totoal_tokens=28137, outputs='Diet Plan C'
[2024-08-03 15:18:41] [Rank 2] totoal_tokens=27556, outputs='Cara'
[2024-08-03 15:18:42] [Rank 0] totoal_tokens=28242, outputs='The office is set up with all the necessary equipment and tools.'
[2024-08-03 15:18:42] [Rank 3] totoal_tokens=28492, outputs='Jung'
[2024-08-03 15:18:43] [Rank 2] totoal_tokens=27938, outputs='Ant'
[2024-08-03 15:18:43] [Rank 1] totoal_tokens=28156, outputs='Peregrine falcon'
[2024-08-03 15:18:43] [Rank 0] totoal_tokens=28335, outputs='Cake'
ing-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 495/751 [05:11<07:02, 1.65s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 496/751 [05:13<07:03, 1.66s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 497/751 [05:15<07:52, 1.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 498/751 [05:17<07:18, 1.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 66%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 499/751 [05:19<07:53, 1.88s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 500/751 [05:20<07:25, 1.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 501/751 [05:24<09:02, 2.17s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 502/751 [05:25<08:13, 1.98s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 503/751 [05:27<08:00, 1.94s/it] Processing InternVL[2024-08-03 15:18:44] [Rank 3] totoal_tokens=28613, outputs='Investment C'
[2024-08-03 15:18:44] [Rank 2] totoal_tokens=28052, outputs='Diet plan C'
[2024-08-03 15:18:44] [Rank 1] totoal_tokens=28250, outputs='Diet plan A'
[2024-08-03 15:18:45] [Rank 0] totoal_tokens=28359, outputs='Hike'
[2024-08-03 15:18:45] [Rank 3] totoal_tokens=29171, outputs='Eagle'
[2024-08-03 15:18:47] [Rank 0] totoal_tokens=28537, outputs='The festival concludes with a concert.'
[2024-08-03 15:18:47] [Rank 1] totoal_tokens=28250, outputs='The final event of the festival is the award ceremony.'
[2024-08-03 15:18:47] [Rank 3] totoal_tokens=29270, outputs='Bird 1'
[2024-08-03 15:18:47] [Rank 2] totoal_tokens=28054, outputs='The 2022 Golf Outing'
[2024-08-03 15:18:48] [Rank 0] totoal_tokens=28550, outputs='RPN'
[2024-08-03 15:18:49] [Rank 3] totoal_tokens=29483, outputs='Run'
[2024-08-03 15:18:49] [Rank 2] totoal_tokens=28120, outputs='The finish line'
[2024-08-03 15:18:49] [Rank 1] totoal_tokens=28251, outputs='Whole Foods'
[2024-08-03 15:18:50] [Rank 0] totoal_tokens=28595, outputs='SmashCon'
[2024-08-03 15:18:50] [Rank 2] totoal_tokens=28257, outputs='apple'
[2024-08-03 15:18:51] [Rank 1] totoal_tokens=28287, outputs='Carl Ryden'
[2024-08-03 15:18:52] [Rank 2] totoal_tokens=28267, outputs='Aloe Vera'
[2024-08-03 15:18:52] [Rank 3] totoal_tokens=29596, outputs='Vodafone'
[2024-08-03 15:18:52] [Rank 0] totoal_tokens=28666, outputs='Alexander Graham Bell'
[2024-08-03 15:18:53] [Rank 1] totoal_tokens=28358, outputs='Watch TV'
[2024-08-03 15:18:53] [Rank 2] totoal_tokens=28279, outputs='Pumpkin'
[2024-08-03 15:18:53] [Rank 3] totoal_tokens=30074, outputs='Pine'
[2024-08-03 15:18:54] [Rank 0] totoal_tokens=29042, outputs='Gertrude Koch'
[2024-08-03 15:18:55] [Rank 2] totoal_tokens=28279, outputs='Mikie Okamine'
[2024-08-03 15:18:55] [Rank 1] totoal_tokens=29109, outputs='Hieu was released from prison in 2013.'
[2024-08-03 15:18:55] [Rank 3] totoal_tokens=30109, outputs='Dragonfly'
[2024-08-03 15:18:56] [Rank 2] totoal_tokens=28301, outputs='ice cream'
[2024-08-03 15:18:56] [Rank 0] totoal_tokens=29200, outputs='oak'
[2024-08-03 15:18:56] [Rank 1] totoal_tokens=29160, outputs='C'
[2024-08-03 15:18:57] [Rank 3] totoal_tokens=30400, outputs='The test is passed.'
[2024-08-03 15:18:58] [Rank 1] totoal_tokens=29203, outputs='Tulips'
[2024-08-03 15:18:58] [Rank 2] totoal_tokens=28341, outputs='The festival ends with a concert.'
[2024-08-03 15:18:58] [Rank 0] totoal_tokens=29349, outputs='Pigeon'
[2024-08-03 15:18:59] [Rank 3] totoal_tokens=30906, outputs='Sergey Shoygu'
[2024-08-03 15:19:00] [Rank 2] totoal_tokens=28390, outputs='on top of table'
[2024-08-03 15:19:00] [Rank 1] totoal_tokens=29395, outputs='Get his backpack'
[2024-08-03 15:19:00] [Rank 0] totoal_tokens=29605, outputs='Get his backpack'
2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 504/751 [05:28<07:26, 1.81s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 505/751 [05:30<07:05, 1.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 506/751 [05:32<07:30, 1.84s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 507/751 [05:34<07:05, 1.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 508/751 [05:35<06:51, 1.70s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 509/751 [05:38<07:47, 1.93s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 510/751 [05:39<07:39, 1.91s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 511/751 [05:42<07:56, 1.99s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 512/751 [05:44<07:44, 1.94s/it] Processi[2024-08-03 15:19:00] [Rank 3] totoal_tokens=31035, outputs='The player is taken off the field and is suspended for a game.'
[2024-08-03 15:19:01] [Rank 2] totoal_tokens=29018, outputs='South'
[2024-08-03 15:19:02] [Rank 1] totoal_tokens=29541, outputs='Mauritanian'
[2024-08-03 15:19:02] [Rank 0] totoal_tokens=30196, outputs='Work'
[2024-08-03 15:19:03] [Rank 2] totoal_tokens=29077, outputs='A'
[2024-08-03 15:19:03] [Rank 3] totoal_tokens=31044, outputs='Gaytheist'
[2024-08-03 15:19:03] [Rank 0] totoal_tokens=30215, outputs='Pink flamingo'
[2024-08-03 15:19:03] [Rank 1] totoal_tokens=29727, outputs='June'
[2024-08-03 15:19:04] [Rank 2] totoal_tokens=29082, outputs='New York'
[2024-08-03 15:19:05] [Rank 3] totoal_tokens=31121, outputs='Linda Steele'
[2024-08-03 15:19:05] [Rank 1] totoal_tokens=29814, outputs='Coyote'
[2024-08-03 15:19:06] [Rank 2] totoal_tokens=29155, outputs='Banana'
[2024-08-03 15:19:07] [Rank 0] totoal_tokens=30423, outputs='Jordan Evans'
[2024-08-03 15:19:07] [Rank 2] totoal_tokens=29314, outputs='The results are reported to the public'
[2024-08-03 15:19:09] [Rank 1] totoal_tokens=30103, outputs='oak'
[2024-08-03 15:19:09] [Rank 0] totoal_tokens=30462, outputs='Get her backpack'
[2024-08-03 15:19:09] [Rank 3] totoal_tokens=31152, outputs='backyard'
[2024-08-03 15:19:11] [Rank 2] totoal_tokens=29388, outputs='Dr. Aduwo'
[2024-08-03 15:19:11] [Rank 0] totoal_tokens=30540, outputs='He was reading'
[2024-08-03 15:19:11] [Rank 3] totoal_tokens=31217, outputs='Oil'
[2024-08-03 15:19:11] [Rank 1] totoal_tokens=30373, outputs='Earth'
[2024-08-03 15:19:12] [Rank 2] totoal_tokens=29613, outputs='Diet plan B'
[2024-08-03 15:19:13] [Rank 3] totoal_tokens=31302, outputs='The Great Barrier Reef'
[2024-08-03 15:19:13] [Rank 0] totoal_tokens=30710, outputs='Wiggles at left'
[2024-08-03 15:19:13] [Rank 1] totoal_tokens=30615, outputs='Megadeth'
[2024-08-03 15:19:14] [Rank 2] totoal_tokens=29627, outputs='children'
[2024-08-03 15:19:15] [Rank 3] totoal_tokens=31335, outputs='The manager meets the team.'
[2024-08-03 15:19:15] [Rank 1] totoal_tokens=30970, outputs='Dave'
[2024-08-03 15:19:15] [Rank 0] totoal_tokens=30888, outputs='Pelican'
[2024-08-03 15:19:16] [Rank 2] totoal_tokens=30059, outputs='World Series'
[2024-08-03 15:19:17] [Rank 1] totoal_tokens=31057, outputs='Next to the coffee table'
[2024-08-03 15:19:17] [Rank 3] totoal_tokens=31491, outputs='annual'
[2024-08-03 15:19:17] [Rank 0] totoal_tokens=31031, outputs='A store that receives its goods last is the grocery store.'
ng InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 513/751 [05:45<07:29, 1.89s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 514/751 [05:47<07:20, 1.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 515/751 [05:49<07:01, 1.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 516/751 [05:52<08:53, 2.27s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 517/751 [05:54<08:36, 2.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 518/751 [05:56<08:08, 2.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 519/751 [05:58<08:15, 2.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 520/751 [06:01<08:31, 2.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 521/751 [06:03<08:22, 2.19s/i[2024-08-03 15:19:18] [Rank 2] totoal_tokens=30308, outputs='I'
[2024-08-03 15:19:19] [Rank 3] totoal_tokens=31845, outputs='Investment C'
[2024-08-03 15:19:19] [Rank 0] totoal_tokens=31101, outputs='on the coffee table'
[2024-08-03 15:19:19] [Rank 2] totoal_tokens=30793, outputs='The manager meets the manager.'
[2024-08-03 15:19:19] [Rank 1] totoal_tokens=31092, outputs='Dawn'
[2024-08-03 15:19:21] [Rank 3] totoal_tokens=32031, outputs='Jesus'
[2024-08-03 15:19:21] [Rank 1] totoal_tokens=31093, outputs='Hippocrates'
[2024-08-03 15:19:22] [Rank 0] totoal_tokens=31145, outputs='Ladybug'
[2024-08-03 15:19:22] [Rank 2] totoal_tokens=31231, outputs='Pandora Papers'
[2024-08-03 15:19:22] [Rank 3] totoal_tokens=32066, outputs='Dr. Marc Pietryzkowski'
[2024-08-03 15:19:23] [Rank 1] totoal_tokens=31241, outputs='Honey Badger'
[2024-08-03 15:19:24] [Rank 0] totoal_tokens=31164, outputs='KR'
[2024-08-03 15:19:24] [Rank 2] totoal_tokens=31263, outputs='GTA'
[2024-08-03 15:19:24] [Rank 3] totoal_tokens=32111, outputs='The office is set up in the basement of the house.'
[2024-08-03 15:19:26] [Rank 1] totoal_tokens=31329, outputs='Get her backpack'
[2024-08-03 15:19:26] [Rank 0] totoal_tokens=31267, outputs='Pink Puffin'
[2024-08-03 15:19:27] [Rank 2] totoal_tokens=31265, outputs='Northern India'
[2024-08-03 15:19:27] [Rank 3] totoal_tokens=32207, outputs='Amazon'
[2024-08-03 15:19:28] [Rank 0] totoal_tokens=31307, outputs='Get coffee'
[2024-08-03 15:19:28] [Rank 1] totoal_tokens=32045, outputs='Apple'
[2024-08-03 15:19:29] [Rank 3] totoal_tokens=32477, outputs='The Ford Fiesta'
[2024-08-03 15:19:30] [Rank 0] totoal_tokens=31410, outputs='Car A'
[2024-08-03 15:19:30] [Rank 2] totoal_tokens=31286, outputs='Samsung'
[2024-08-03 15:19:31] [Rank 1] totoal_tokens=32051, outputs='The investment with the highest annual return is the stock of the company.'
[2024-08-03 15:19:31] [Rank 3] totoal_tokens=32677, outputs='The development of the playbook'
[2024-08-03 15:19:31] [Rank 0] totoal_tokens=31841, outputs='Cactus'
[2024-08-03 15:19:32] [Rank 2] totoal_tokens=32078, outputs='Diet Plan C'
[2024-08-03 15:19:33] [Rank 1] totoal_tokens=32064, outputs='Jen Harley'
[2024-08-03 15:19:33] [Rank 0] totoal_tokens=32152, outputs='Earth'
[2024-08-03 15:19:33] [Rank 3] totoal_tokens=33129, outputs='The Sound of Music'
[2024-08-03 15:19:34] [Rank 2] totoal_tokens=32374, outputs='TomTom'
[2024-08-03 15:19:35] [Rank 1] totoal_tokens=32206, outputs='at the end of the day'
[2024-08-03 15:19:35] [Rank 0] totoal_tokens=32204, outputs='Get ready for work'
t] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 522/751 [06:05<08:00, 2.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 523/751 [06:07<08:13, 2.16s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 524/751 [06:09<07:51, 2.08s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 525/751 [06:11<08:24, 2.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 526/751 [06:13<07:54, 2.11s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 527/751 [06:15<07:29, 2.01s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 528/751 [06:17<07:14, 1.95s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 529/751 [06:19<07:01, 1.90s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 530/751 [06:21<07:0[2024-08-03 15:19:35] [Rank 3] totoal_tokens=33286, outputs='The fish is fed first.'
[2024-08-03 15:19:36] [Rank 2] totoal_tokens=33053, outputs='Get ready for work'
[2024-08-03 15:19:37] [Rank 1] totoal_tokens=32631, outputs='The concert'
[2024-08-03 15:19:37] [Rank 0] totoal_tokens=32290, outputs='The Hobbit'
[2024-08-03 15:19:38] [Rank 3] totoal_tokens=33293, outputs='The plant that grows the slowest is the Giant Panda.'
[2024-08-03 15:19:38] [Rank 2] totoal_tokens=33084, outputs='At the end of the day'
[2024-08-03 15:19:39] [Rank 1] totoal_tokens=33080, outputs='Get dressed'
[2024-08-03 15:19:40] [Rank 0] totoal_tokens=32554, outputs='Get ready for work'
[2024-08-03 15:19:40] [Rank 3] totoal_tokens=33343, outputs='Apple'
[2024-08-03 15:19:40] [Rank 2] totoal_tokens=33099, outputs='Pontiac'
[2024-08-03 15:19:42] [Rank 1] totoal_tokens=33166, outputs='The final venue was the Kursaal-Dunkerque Wine and Beer Festival'
[2024-08-03 15:19:42] [Rank 0] totoal_tokens=32644, outputs='Female'
[2024-08-03 15:19:43] [Rank 2] totoal_tokens=33196, outputs='Ladybug'
[2024-08-03 15:19:44] [Rank 3] totoal_tokens=34206, outputs='on top of the couch'
[2024-08-03 15:19:44] [Rank 1] totoal_tokens=33209, outputs='Banana'
[2024-08-03 15:19:45] [Rank 2] totoal_tokens=33398, outputs='The finish line'
[2024-08-03 15:19:45] [Rank 0] totoal_tokens=33162, outputs='Dr. Johnson'
[2024-08-03 15:19:46] [Rank 3] totoal_tokens=34502, outputs='Route X'
[2024-08-03 15:19:46] [Rank 1] totoal_tokens=33277, outputs='Toyota'
[2024-08-03 15:19:47] [Rank 0] totoal_tokens=33196, outputs='Surrealism'
[2024-08-03 15:19:47] [Rank 2] totoal_tokens=34101, outputs='ice cream'
[2024-08-03 15:19:48] [Rank 3] totoal_tokens=34536, outputs='Bella da Ball'
[2024-08-03 15:19:49] [Rank 1] totoal_tokens=33375, outputs='The baguette is now being churned out by 24-hour automatic'
[2024-08-03 15:19:49] [Rank 0] totoal_tokens=34068, outputs='March 10'
[2024-08-03 15:19:49] [Rank 2] totoal_tokens=34636, outputs='Bob'
[2024-08-03 15:19:51] [Rank 3] totoal_tokens=34853, outputs='Whooping cough'
[2024-08-03 15:19:51] [Rank 2] totoal_tokens=35094, outputs='Satan'
[2024-08-03 15:19:51] [Rank 0] totoal_tokens=34127, outputs='Willow'
[2024-08-03 15:19:53] [Rank 3] totoal_tokens=35093, outputs='Pokey'
[2024-08-03 15:19:53] [Rank 1] totoal_tokens=34055, outputs='The phone rings'
[2024-08-03 15:19:54] [Rank 0] totoal_tokens=34289, outputs='PayPal'
[2024-08-03 15:19:54] [Rank 2] totoal_tokens=35100, outputs='Oil painting'
[2024-08-03 15:19:55] [Rank 3] totoal_tokens=35586, outputs='The Beatles'
[2024-08-03 15:19:55] [Rank 1] totoal_tokens=34078, outputs='South'
[2024-08-03 15:19:56] [Rank 2] totoal_tokens=35202, outputs='Joseph Young'
[2024-08-03 15:19:56] [Rank 0] totoal_tokens=34665, outputs='Coffee'
5, 1.93s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 531/751 [06:23<07:16, 1.98s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 532/751 [06:25<07:37, 2.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 533/751 [06:27<07:44, 2.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 534/751 [06:30<08:32, 2.36s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 535/751 [06:32<08:13, 2.28s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 71%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 536/751 [06:34<07:55, 2.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 537/751 [06:37<08:07, 2.28s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 538/751 [06:39<08:09, 2.30s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | [2024-08-03 15:19:57] [Rank 3] totoal_tokens=36257, outputs='Birds'
[2024-08-03 15:19:57] [Rank 1] totoal_tokens=34085, outputs='Dan Levy'
[2024-08-03 15:19:58] [Rank 2] totoal_tokens=35233, outputs='Puget Sound'
[2024-08-03 15:19:58] [Rank 0] totoal_tokens=35033, outputs='Honda'
[2024-08-03 15:19:59] [Rank 1] totoal_tokens=34758, outputs='A food company'
[2024-08-03 15:19:59] [Rank 3] totoal_tokens=36309, outputs='The final step in constructing the model is to use the model to answer the question'
[2024-08-03 15:20:00] [Rank 2] totoal_tokens=35283, outputs='Paint the model'
[2024-08-03 15:20:01] [Rank 0] totoal_tokens=35092, outputs='Daisy'
[2024-08-03 15:20:01] [Rank 1] totoal_tokens=35082, outputs='Albert Einstein'
[2024-08-03 15:20:02] [Rank 3] totoal_tokens=36314, outputs='The festival concludes with a communal meal.'
[2024-08-03 15:20:03] [Rank 0] totoal_tokens=35190, outputs='Get ice cream'
[2024-08-03 15:20:03] [Rank 2] totoal_tokens=35443, outputs='Cactus'
[2024-08-03 15:20:03] [Rank 1] totoal_tokens=35094, outputs='The final event in the triathlon is the 1,500m swim'
[2024-08-03 15:20:04] [Rank 3] totoal_tokens=36326, outputs='Oscar Pistorius'
[2024-08-03 15:20:05] [Rank 1] totoal_tokens=35221, outputs='annually'
[2024-08-03 15:20:05] [Rank 0] totoal_tokens=35196, outputs='The Guttenberg Press Has Landed'
[2024-08-03 15:20:05] [Rank 2] totoal_tokens=35905, outputs='Ezra Pound'
[2024-08-03 15:20:06] [Rank 3] totoal_tokens=36330, outputs='The Howard Stern Show'
[2024-08-03 15:20:07] [Rank 0] totoal_tokens=35378, outputs='John'
[2024-08-03 15:20:07] [Rank 1] totoal_tokens=35250, outputs='Jeff Hoops'
[2024-08-03 15:20:08] [Rank 2] totoal_tokens=36159, outputs='Rolex'
[2024-08-03 15:20:09] [Rank 3] totoal_tokens=36344, outputs='The last event in the triathlon was the 2018 Olympic Games.'
[2024-08-03 15:20:10] [Rank 0] totoal_tokens=35597, outputs='The Beatles'
[2024-08-03 15:20:10] [Rank 1] totoal_tokens=35565, outputs='Wasp'
[2024-08-03 15:20:10] [Rank 2] totoal_tokens=36311, outputs='African'
[2024-08-03 15:20:11] [Rank 3] totoal_tokens=36579, outputs='Watch TV'
[2024-08-03 15:20:12] [Rank 0] totoal_tokens=35711, outputs='bleach'
[2024-08-03 15:20:12] [Rank 1] totoal_tokens=35676, outputs='Kim Kardashian'
[2024-08-03 15:20:13] [Rank 3] totoal_tokens=36596, outputs='Cactus'
[2024-08-03 15:20:13] [Rank 2] totoal_tokens=36624, outputs='Larry Vela'
[2024-08-03 15:20:14] [Rank 0] totoal_tokens=35855, outputs='Lendlease'
[2024-08-03 15:20:14] [Rank 1] totoal_tokens=35708, outputs='The test is passed.'
[2024-08-03 15:20:15] [Rank 3] totoal_tokens=36651, outputs='A'
[2024-08-03 15:20:15] [Rank 2] totoal_tokens=36923, outputs='mallow'
[2024-08-03 15:20:16] [Rank 0] totoal_tokens=35876, outputs='The cook'
539/751 [06:42<08:19, 2.36s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 540/751 [06:44<07:51, 2.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 541/751 [06:46<08:08, 2.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 542/751 [06:48<07:55, 2.28s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 543/751 [06:51<08:04, 2.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 544/751 [06:53<07:44, 2.24s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 545/751 [06:55<07:37, 2.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 546/751 [06:57<07:20, 2.15s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 547/751 [06:59<07:21, 2.16s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%[2024-08-03 15:20:16] [Rank 1] totoal_tokens=35806, outputs='Oil on canvas'
[2024-08-03 15:20:17] [Rank 2] totoal_tokens=36958, outputs='Open the door'
[2024-08-03 15:20:17] [Rank 3] totoal_tokens=36713, outputs='Mia'
[2024-08-03 15:20:18] [Rank 0] totoal_tokens=36096, outputs='Jiyin'
[2024-08-03 15:20:18] [Rank 1] totoal_tokens=36169, outputs='Smokers'
[2024-08-03 15:20:20] [Rank 3] totoal_tokens=36745, outputs='Despina Stokou'
[2024-08-03 15:20:20] [Rank 2] totoal_tokens=37006, outputs='Investment C'
[2024-08-03 15:20:20] [Rank 0] totoal_tokens=36148, outputs='Route X'
[2024-08-03 15:20:21] [Rank 1] totoal_tokens=36175, outputs='The manager meets the manager.'
[2024-08-03 15:20:22] [Rank 3] totoal_tokens=36870, outputs='the workers'
[2024-08-03 15:20:22] [Rank 2] totoal_tokens=37095, outputs='The Death and Life of Great American Cities'
[2024-08-03 15:20:22] [Rank 0] totoal_tokens=36264, outputs='Get coffee'
[2024-08-03 15:20:23] [Rank 1] totoal_tokens=36321, outputs='Lysol'
[2024-08-03 15:20:24] [Rank 3] totoal_tokens=37011, outputs='Hannah Leonard'
[2024-08-03 15:20:24] [Rank 2] totoal_tokens=37166, outputs='The final step in constructing the model is to analyze the results and make any necessary'
[2024-08-03 15:20:25] [Rank 0] totoal_tokens=36292, outputs='The couch is behind Tim.'
[2024-08-03 15:20:25] [Rank 1] totoal_tokens=36470, outputs='The final step in constructing the model is to test the model.'
[2024-08-03 15:20:27] [Rank 0] totoal_tokens=36416, outputs='Coffee'
[2024-08-03 15:20:27] [Rank 2] totoal_tokens=37458, outputs='The Hobbit'
[2024-08-03 15:20:28] [Rank 1] totoal_tokens=36470, outputs='A rather unusual Chinaman'
[2024-08-03 15:20:29] [Rank 0] totoal_tokens=36511, outputs='Coca-Cola'
[2024-08-03 15:20:29] [Rank 2] totoal_tokens=37460, outputs='Earth'
[2024-08-03 15:20:30] [Rank 1] totoal_tokens=36480, outputs='pine tree'
[2024-08-03 15:20:30] [Rank 3] totoal_tokens=37326, outputs='Katherine Neville'
[2024-08-03 15:20:31] [Rank 0] totoal_tokens=36582, outputs='Dragonfly'
[2024-08-03 15:20:32] [Rank 2] totoal_tokens=37540, outputs='Northern Europe'
[2024-08-03 15:20:33] [Rank 1] totoal_tokens=36589, outputs='Layman Brewing'
[2024-08-03 15:20:33] [Rank 3] totoal_tokens=37443, outputs='London'
[2024-08-03 15:20:33] [Rank 0] totoal_tokens=36638, outputs='X'
[2024-08-03 15:20:34] [Rank 2] totoal_tokens=37656, outputs='dog'
[2024-08-03 15:20:35] [Rank 1] totoal_tokens=36743, outputs='Despina Stokou'
[2024-08-03 15:20:35] [Rank 3] totoal_tokens=37493, outputs='Diving'
[2024-08-03 15:20:36] [Rank 0] totoal_tokens=36770, outputs='Huntress'
|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 548/751 [07:01<07:14, 2.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 549/751 [07:03<07:12, 2.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 550/751 [07:05<07:10, 2.14s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 73%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 551/751 [07:08<07:06, 2.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 552/751 [07:10<07:12, 2.17s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 553/751 [07:12<07:16, 2.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 554/751 [07:15<07:26, 2.27s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 555/751 [07:17<07:08, 2.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 556/751 [07:19<07:09, 2.20s/it] Processing InternVL2-2B_re[2024-08-03 15:20:37] [Rank 1] totoal_tokens=36934, outputs='milk'
[2024-08-03 15:20:37] [Rank 2] totoal_tokens=37844, outputs='Bike'
[2024-08-03 15:20:37] [Rank 3] totoal_tokens=37587, outputs='Marie Curie'
[2024-08-03 15:20:38] [Rank 0] totoal_tokens=37148, outputs='Jane Addams'
[2024-08-03 15:20:39] [Rank 1] totoal_tokens=37059, outputs='on top of table'
[2024-08-03 15:20:39] [Rank 2] totoal_tokens=38108, outputs='Go to bed'
[2024-08-03 15:20:40] [Rank 3] totoal_tokens=37632, outputs='work'
[2024-08-03 15:20:40] [Rank 0] totoal_tokens=37174, outputs='Gavin'
[2024-08-03 15:20:41] [Rank 1] totoal_tokens=37148, outputs='John the Baptist'
[2024-08-03 15:20:42] [Rank 2] totoal_tokens=38153, outputs='Journeys in Middle Earth'
[2024-08-03 15:20:42] [Rank 3] totoal_tokens=37674, outputs='Go to bed'
[2024-08-03 15:20:42] [Rank 0] totoal_tokens=37865, outputs='Route X'
[2024-08-03 15:20:43] [Rank 1] totoal_tokens=37174, outputs='Laboratory Alliance'
[2024-08-03 15:20:44] [Rank 2] totoal_tokens=38280, outputs='Rika'
[2024-08-03 15:20:44] [Rank 3] totoal_tokens=37691, outputs='Corn'
[2024-08-03 15:20:45] [Rank 0] totoal_tokens=37922, outputs='ice cream'
[2024-08-03 15:20:46] [Rank 1] totoal_tokens=37222, outputs='Cake'
[2024-08-03 15:20:46] [Rank 2] totoal_tokens=38662, outputs='Get coffee'
[2024-08-03 15:20:47] [Rank 0] totoal_tokens=37983, outputs='Abigail'
[2024-08-03 15:20:47] [Rank 3] totoal_tokens=37710, outputs='Liam'
[2024-08-03 15:20:48] [Rank 1] totoal_tokens=37327, outputs='Bush'
[2024-08-03 15:20:50] [Rank 0] totoal_tokens=38068, outputs='Cactus'
[2024-08-03 15:20:50] [Rank 2] totoal_tokens=38680, outputs='oak'
[2024-08-03 15:20:50] [Rank 3] totoal_tokens=37795, outputs='mallow'
[2024-08-03 15:20:51] [Rank 1] totoal_tokens=37337, outputs='Pandora'
[2024-08-03 15:20:52] [Rank 0] totoal_tokens=38080, outputs='John Burris'
[2024-08-03 15:20:52] [Rank 2] totoal_tokens=38745, outputs='Impressionist'
[2024-08-03 15:20:52] [Rank 3] totoal_tokens=38197, outputs='Gabriel'
[2024-08-03 15:20:53] [Rank 1] totoal_tokens=37437, outputs='Get coffee'
[2024-08-03 15:20:55] [Rank 0] totoal_tokens=38199, outputs='The Iliad'
asoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 557/751 [07:21<07:07, 2.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 558/751 [07:23<07:00, 2.18s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 559/751 [07:26<07:18, 2.28s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 560/751 [07:28<07:04, 2.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 561/751 [07:30<07:02, 2.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 562/751 [07:33<07:28, 2.38s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 563/751 [07:35<07:28, 2.38s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 564/751 [07:38<07:39, 2.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 565/751 [07:40<07:30, 2.42s/i[2024-08-03 15:20:55] [Rank 2] totoal_tokens=38840, outputs='oak'
[2024-08-03 15:20:55] [Rank 3] totoal_tokens=38220, outputs='The last event in the triathlon is the Olympic Games.'
[2024-08-03 15:20:56] [Rank 1] totoal_tokens=37441, outputs='Amazon'
[2024-08-03 15:20:57] [Rank 0] totoal_tokens=38258, outputs='Austin'
[2024-08-03 15:20:58] [Rank 3] totoal_tokens=38231, outputs='The Sundance Film Festival'
[2024-08-03 15:20:58] [Rank 2] totoal_tokens=38857, outputs='Chamois'
[2024-08-03 15:20:58] [Rank 1] totoal_tokens=37490, outputs='Fitz Tepper'
[2024-08-03 15:20:59] [Rank 0] totoal_tokens=38363, outputs='The festival ends with a fireworks display.'
[2024-08-03 15:21:00] [Rank 2] totoal_tokens=39062, outputs='A man'
[2024-08-03 15:21:00] [Rank 3] totoal_tokens=38266, outputs='The final step in constructing the model is the completion of the model.'
[2024-08-03 15:21:01] [Rank 1] totoal_tokens=37532, outputs='France'
[2024-08-03 15:21:02] [Rank 0] totoal_tokens=38585, outputs='The SM843 Pro Data Series SSD is docked in Napeague.'
[2024-08-03 15:21:03] [Rank 3] totoal_tokens=38268, outputs='ice cream'
[2024-08-03 15:21:04] [Rank 2] totoal_tokens=39255, outputs='A store that receives its goods last is the grocery store.'
[2024-08-03 15:21:04] [Rank 1] totoal_tokens=37616, outputs='The Ziada family home in the Bureij refugee camp in Gaza was'
[2024-08-03 15:21:05] [Rank 0] totoal_tokens=38702, outputs='Ice cream'
[2024-08-03 15:21:05] [Rank 3] totoal_tokens=38382, outputs='Dragonfly'
[2024-08-03 15:21:06] [Rank 1] totoal_tokens=37869, outputs='Cactus'
[2024-08-03 15:21:06] [Rank 2] totoal_tokens=39285, outputs='Turn on computer'
[2024-08-03 15:21:07] [Rank 0] totoal_tokens=38772, outputs='The ZenWiFi AX AX6600 Whole-Home Tri-band Mesh Wi'
[2024-08-03 15:21:07] [Rank 3] totoal_tokens=38405, outputs='The OA'
[2024-08-03 15:21:09] [Rank 1] totoal_tokens=38049, outputs='The Yellow Warbler'
[2024-08-03 15:21:09] [Rank 2] totoal_tokens=39291, outputs='The festival ends with a concert.'
[2024-08-03 15:21:10] [Rank 3] totoal_tokens=38524, outputs='The Grim 13: Short Stories'
[2024-08-03 15:21:10] [Rank 0] totoal_tokens=38931, outputs='Get the baby'
[2024-08-03 15:21:11] [Rank 1] totoal_tokens=38428, outputs='Samsung'
[2024-08-03 15:21:11] [Rank 2] totoal_tokens=39349, outputs='Corn'
[2024-08-03 15:21:12] [Rank 3] totoal_tokens=38653, outputs='Sugarcane'
[2024-08-03 15:21:12] [Rank 0] totoal_tokens=39041, outputs='Get ready for work'
[2024-08-03 15:21:14] [Rank 1] totoal_tokens=38503, outputs='Dessert'
[2024-08-03 15:21:14] [Rank 2] totoal_tokens=39637, outputs='Peaches'
[2024-08-03 15:21:14] [Rank 0] totoal_tokens=39050, outputs='Watch TV'
[2024-08-03 15:21:15] [Rank 3] totoal_tokens=38686, outputs='Oregon'
[2024-08-03 15:21:16] [Rank 1] totoal_tokens=38565, outputs='John Ferari'
[2024-08-03 15:21:16] [Rank 2] totoal_tokens=39717, outputs='Pink Amazon'
[2024-08-03 15:21:17] [Rank 0] totoal_tokens=39079, outputs='Product B'
t] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 566/751 [07:42<07:11, 2.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 567/751 [07:45<07:11, 2.34s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 568/751 [07:48<07:49, 2.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 569/751 [07:50<07:37, 2.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 570/751 [07:53<07:36, 2.52s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 571/751 [07:55<07:52, 2.62s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 572/751 [07:58<07:24, 2.48s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 573/751 [08:00<07:02, 2.37s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 5[2024-08-03 15:21:17] [Rank 3] totoal_tokens=39065, outputs='The festival ends with a concert.'
[2024-08-03 15:21:18] [Rank 1] totoal_tokens=38638, outputs='Fiona Bruce'
[2024-08-03 15:21:19] [Rank 0] totoal_tokens=39251, outputs='Cheetos'
[2024-08-03 15:21:19] [Rank 2] totoal_tokens=39813, outputs='Laurie'
[2024-08-03 15:21:20] [Rank 3] totoal_tokens=39359, outputs='The parade'
[2024-08-03 15:21:21] [Rank 1] totoal_tokens=38754, outputs='Dr. Johnson'
[2024-08-03 15:21:21] [Rank 0] totoal_tokens=39786, outputs='Oil painting'
[2024-08-03 15:21:22] [Rank 3] totoal_tokens=39507, outputs='The Animal Crossing: New Horizons 2.0'
[2024-08-03 15:21:23] [Rank 2] totoal_tokens=40145, outputs='next to'
[2024-08-03 15:21:24] [Rank 1] totoal_tokens=38848, outputs='Lionsgate'
[2024-08-03 15:21:24] [Rank 0] totoal_tokens=40074, outputs='Diet Plan B'
[2024-08-03 15:21:25] [Rank 3] totoal_tokens=40125, outputs='left'
[2024-08-03 15:21:26] [Rank 2] totoal_tokens=40160, outputs='Castle'
[2024-08-03 15:21:27] [Rank 1] totoal_tokens=39521, outputs='The car that finishes the race first is the one that has the highest item level'
[2024-08-03 15:21:27] [Rank 3] totoal_tokens=40157, outputs='David Grotto'
[2024-08-03 15:21:27] [Rank 0] totoal_tokens=40271, outputs='Demon'
[2024-08-03 15:21:28] [Rank 2] totoal_tokens=40165, outputs='Wine'
[2024-08-03 15:21:29] [Rank 0] totoal_tokens=40604, outputs='Pride'
[2024-08-03 15:21:29] [Rank 1] totoal_tokens=39654, outputs='LGBTI'
[2024-08-03 15:21:29] [Rank 3] totoal_tokens=40175, outputs='Rubble'
[2024-08-03 15:21:31] [Rank 2] totoal_tokens=40190, outputs='Park West Gallery'
[2024-08-03 15:21:32] [Rank 0] totoal_tokens=40623, outputs='Amy Harris'
[2024-08-03 15:21:32] [Rank 3] totoal_tokens=40212, outputs='Palm'
[2024-08-03 15:21:32] [Rank 1] totoal_tokens=40237, outputs='chocolate cream pie'
[2024-08-03 15:21:33] [Rank 2] totoal_tokens=40191, outputs='Get his diploma'
[2024-08-03 15:21:34] [Rank 3] totoal_tokens=40241, outputs='apple'
[2024-08-03 15:21:34] [Rank 0] totoal_tokens=40771, outputs='The Hunger Games'
[2024-08-03 15:21:35] [Rank 1] totoal_tokens=40349, outputs='Banana'
[2024-08-03 15:21:35] [Rank 2] totoal_tokens=40281, outputs='The product is launched to the public'
[2024-08-03 15:21:37] [Rank 0] totoal_tokens=40773, outputs='Dragonfly'
[2024-08-03 15:21:37] [Rank 3] totoal_tokens=41385, outputs='Lysol'
[2024-08-03 15:21:38] [Rank 2] totoal_tokens=40286, outputs='Route X'
[2024-08-03 15:21:38] [Rank 1] totoal_tokens=40374, outputs='Aeroplanes'
[2024-08-03 15:21:40] [Rank 0] totoal_tokens=41325, outputs='Stern'
74/751 [08:02<06:50, 2.32s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 575/751 [08:04<06:39, 2.27s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 576/751 [08:07<06:54, 2.37s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 577/751 [08:09<07:16, 2.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 578/751 [08:12<07:34, 2.63s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 579/751 [08:15<07:14, 2.52s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 580/751 [08:17<07:01, 2.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 581/751 [08:19<06:55, 2.44s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 77%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 582/751 [08:22<06:57, 2.47s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|[2024-08-03 15:21:40] [Rank 3] totoal_tokens=41436, outputs='Lysol'
[2024-08-03 15:21:40] [Rank 2] totoal_tokens=40337, outputs='A'
[2024-08-03 15:21:41] [Rank 1] totoal_tokens=40416, outputs='Marie Curie'
[2024-08-03 15:21:42] [Rank 0] totoal_tokens=41621, outputs='Paint the model'
[2024-08-03 15:21:42] [Rank 3] totoal_tokens=41454, outputs='Dawn'
[2024-08-03 15:21:43] [Rank 2] totoal_tokens=40378, outputs='Pink Puffin'
[2024-08-03 15:21:43] [Rank 1] totoal_tokens=40428, outputs='Region C'
[2024-08-03 15:21:45] [Rank 0] totoal_tokens=41760, outputs='Cake'
[2024-08-03 15:21:45] [Rank 2] totoal_tokens=40539, outputs='Apter mags'
[2024-08-03 15:21:45] [Rank 1] totoal_tokens=40872, outputs='Pigeon'
[2024-08-03 15:21:46] [Rank 3] totoal_tokens=41532, outputs='A food company'
[2024-08-03 15:21:47] [Rank 0] totoal_tokens=41878, outputs='Get ready for work'
[2024-08-03 15:21:48] [Rank 2] totoal_tokens=40546, outputs='West'
[2024-08-03 15:21:48] [Rank 1] totoal_tokens=41411, outputs='Bush'
[2024-08-03 15:21:49] [Rank 3] totoal_tokens=41645, outputs='polar bear'
[2024-08-03 15:21:50] [Rank 0] totoal_tokens=42279, outputs='dog'
[2024-08-03 15:21:51] [Rank 2] totoal_tokens=40774, outputs='Hemlock Grove'
[2024-08-03 15:21:51] [Rank 1] totoal_tokens=41412, outputs='The car is tested and deemed to be in good working order.'
[2024-08-03 15:21:52] [Rank 3] totoal_tokens=41686, outputs='Facebook'
[2024-08-03 15:21:52] [Rank 0] totoal_tokens=42311, outputs='The coffee is de-pulped, fermented and washed at the centralized wet'
[2024-08-03 15:21:53] [Rank 2] totoal_tokens=40877, outputs='Pine'
[2024-08-03 15:21:54] [Rank 1] totoal_tokens=41610, outputs='Scott Whitlock'
[2024-08-03 15:21:55] [Rank 0] totoal_tokens=42322, outputs='Earth'
[2024-08-03 15:21:55] [Rank 2] totoal_tokens=41141, outputs='The Umbrella Academy'
[2024-08-03 15:21:56] [Rank 3] totoal_tokens=42123, outputs='apple'
[2024-08-03 15:21:57] [Rank 1] totoal_tokens=42246, outputs='Alex Hennech'
[2024-08-03 15:21:58] [Rank 0] totoal_tokens=42415, outputs='I did not cheat on 9ice'
[2024-08-03 15:21:58] [Rank 2] totoal_tokens=41465, outputs='The 3rd'
[2024-08-03 15:21:59] [Rank 3] totoal_tokens=42258, outputs='Popsicles'
[2024-08-03 15:21:59] [Rank 1] totoal_tokens=42248, outputs='Lantus'
[2024-08-03 15:22:00] [Rank 0] totoal_tokens=42522, outputs='Toyota'
[2024-08-03 15:22:02] [Rank 3] totoal_tokens=42449, outputs='Pelican'
[2024-08-03 15:22:02] [Rank 2] totoal_tokens=41929, outputs='Route X'
[2024-08-03 15:22:03] [Rank 1] totoal_tokens=42282, outputs='Workout'
[2024-08-03 15:22:03] [Rank 0] totoal_tokens=42537, outputs='Go to bed'
β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 583/751 [08:25<07:28, 2.67s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 584/751 [08:28<07:18, 2.62s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 585/751 [08:30<07:08, 2.58s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 586/751 [08:32<07:00, 2.55s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 587/751 [08:35<06:51, 2.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 588/751 [08:38<07:00, 2.58s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 78%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 589/751 [08:40<07:01, 2.60s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 590/751 [08:43<07:05, 2.64s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 591/751 [08:46<06:56, 2.60s/it] Processing InternVL2-2B_rea[2024-08-03 15:22:05] [Rank 2] totoal_tokens=42145, outputs='visit the museum'
[2024-08-03 15:22:05] [Rank 3] totoal_tokens=42568, outputs='The festival concludes with a parade.'
[2024-08-03 15:22:05] [Rank 0] totoal_tokens=42970, outputs='June 9'
[2024-08-03 15:22:05] [Rank 1] totoal_tokens=42541, outputs='Oil painting'
[2024-08-03 15:22:07] [Rank 3] totoal_tokens=42851, outputs='The ballast is put in'
[2024-08-03 15:22:07] [Rank 2] totoal_tokens=42254, outputs='The final step in constructing the model is to assemble the model.'
[2024-08-03 15:22:08] [Rank 1] totoal_tokens=42740, outputs='Dan'
[2024-08-03 15:22:08] [Rank 0] totoal_tokens=43330, outputs='A food delivery service'
[2024-08-03 15:22:10] [Rank 2] totoal_tokens=42806, outputs='The closing ceremony'
[2024-08-03 15:22:10] [Rank 3] totoal_tokens=42968, outputs='Lego'
[2024-08-03 15:22:11] [Rank 0] totoal_tokens=43391, outputs='Car A'
[2024-08-03 15:22:11] [Rank 1] totoal_tokens=42839, outputs='Elisabeth Haich'
[2024-08-03 15:22:13] [Rank 3] totoal_tokens=43184, outputs='Corn'
[2024-08-03 15:22:13] [Rank 2] totoal_tokens=42998, outputs='The week was superb fun and the lake was better than wished for. Having Matt'
[2024-08-03 15:22:13] [Rank 0] totoal_tokens=43570, outputs='The oak tree is planted in the backyard.'
[2024-08-03 15:22:14] [Rank 1] totoal_tokens=43045, outputs='Get coffee'
[2024-08-03 15:22:16] [Rank 2] totoal_tokens=43130, outputs='Wasp'
[2024-08-03 15:22:16] [Rank 3] totoal_tokens=43498, outputs='Jupiter'
[2024-08-03 15:22:17] [Rank 0] totoal_tokens=44175, outputs='Lee'
[2024-08-03 15:22:17] [Rank 1] totoal_tokens=44255, outputs='The Girl with the Dragon Tattoo'
[2024-08-03 15:22:18] [Rank 2] totoal_tokens=43176, outputs='Belfast'
[2024-08-03 15:22:19] [Rank 0] totoal_tokens=44312, outputs='Jerry Eubanks'
[2024-08-03 15:22:19] [Rank 3] totoal_tokens=44363, outputs='Dawn'
[2024-08-03 15:22:20] [Rank 1] totoal_tokens=44577, outputs='Al Sadd'
[2024-08-03 15:22:21] [Rank 2] totoal_tokens=43194, outputs='The product is launched to the public.'
[2024-08-03 15:22:22] [Rank 0] totoal_tokens=44319, outputs='Driving'
[2024-08-03 15:22:22] [Rank 3] totoal_tokens=44448, outputs='Jared Leto'
[2024-08-03 15:22:23] [Rank 1] totoal_tokens=45119, outputs='Mediterranean'
[2024-08-03 15:22:25] [Rank 2] totoal_tokens=43445, outputs='Lee So-hee'
[2024-08-03 15:22:25] [Rank 0] totoal_tokens=44914, outputs='Aloe Vera'
soning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 592/751 [08:48<06:46, 2.56s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 593/751 [08:51<06:43, 2.56s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 594/751 [08:53<06:48, 2.60s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 595/751 [08:56<06:44, 2.59s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 596/751 [08:59<06:52, 2.66s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 597/751 [09:02<07:16, 2.83s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 598/751 [09:04<07:03, 2.77s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 599/751 [09:07<06:57, 2.75s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 600/751 [09:10<07:18, 2.91s/it[2024-08-03 15:22:26] [Rank 1] totoal_tokens=45190, outputs='The final touches are added by the PowerShell ISE console.'
[2024-08-03 15:22:27] [Rank 3] totoal_tokens=44562, outputs='Go to bed'
[2024-08-03 15:22:28] [Rank 2] totoal_tokens=43506, outputs='Karen'
[2024-08-03 15:22:29] [Rank 0] totoal_tokens=44916, outputs='Methane'
[2024-08-03 15:22:29] [Rank 1] totoal_tokens=45260, outputs="And It's Still Alight"
[2024-08-03 15:22:29] [Rank 3] totoal_tokens=44736, outputs='August 3rd'
[2024-08-03 15:22:31] [Rank 2] totoal_tokens=44053, outputs='Chef'
[2024-08-03 15:22:32] [Rank 0] totoal_tokens=45445, outputs='Pine'
[2024-08-03 15:22:32] [Rank 3] totoal_tokens=45572, outputs='Rio 2016'
[2024-08-03 15:22:32] [Rank 1] totoal_tokens=45327, outputs='The foundation is constructed by the following steps:\n\n1. The foundation is constructed by'
[2024-08-03 15:22:34] [Rank 2] totoal_tokens=44259, outputs='The final touches are added by the user.'
[2024-08-03 15:22:35] [Rank 0] totoal_tokens=45657, outputs='Earth'
[2024-08-03 15:22:35] [Rank 1] totoal_tokens=45410, outputs='vegetarian'
[2024-08-03 15:22:35] [Rank 3] totoal_tokens=46185, outputs='Northern America'
[2024-08-03 15:22:37] [Rank 2] totoal_tokens=44328, outputs='Turn on the iCloud Music Library'
[2024-08-03 15:22:38] [Rank 3] totoal_tokens=46315, outputs='She is a teacher'
[2024-08-03 15:22:38] [Rank 0] totoal_tokens=45659, outputs='Get a job'
[2024-08-03 15:22:39] [Rank 1] totoal_tokens=45512, outputs='Quebec'
[2024-08-03 15:22:40] [Rank 2] totoal_tokens=44376, outputs='Mediterranean'
[2024-08-03 15:22:40] [Rank 3] totoal_tokens=46321, outputs='Kevin'
[2024-08-03 15:22:41] [Rank 0] totoal_tokens=45665, outputs='The 2019 Cannes Film Festival'
[2024-08-03 15:22:41] [Rank 1] totoal_tokens=45634, outputs='Food'
[2024-08-03 15:22:42] [Rank 2] totoal_tokens=44409, outputs='Snow'
[2024-08-03 15:22:43] [Rank 3] totoal_tokens=46367, outputs='Nathaniel Rateliff'
[2024-08-03 15:22:44] [Rank 1] totoal_tokens=45745, outputs='Obi'
[2024-08-03 15:22:45] [Rank 0] totoal_tokens=45876, outputs='Facebook'
[2024-08-03 15:22:46] [Rank 3] totoal_tokens=46460, outputs='Doc'
[2024-08-03 15:22:47] [Rank 2] totoal_tokens=44409, outputs='The Leaning Tower of Pisa'
[2024-08-03 15:22:47] [Rank 1] totoal_tokens=46198, outputs='Kate Huntington'
[2024-08-03 15:22:47] [Rank 0] totoal_tokens=46484, outputs='Castle Leslie'
[2024-08-03 15:22:49] [Rank 2] totoal_tokens=45138, outputs='The game'
[2024-08-03 15:22:50] [Rank 3] totoal_tokens=46581, outputs='Awards are given out at the end of the day.'
[2024-08-03 15:22:50] [Rank 1] totoal_tokens=46479, outputs='Company Y'
[2024-08-03 15:22:51] [Rank 0] totoal_tokens=47461, outputs='Get homework done'
[2024-08-03 15:22:52] [Rank 2] totoal_tokens=45494, outputs='Boeing'
[2024-08-03 15:22:52] [Rank 3] totoal_tokens=47176, outputs='Get a bottle'
[2024-08-03 15:22:53] [Rank 1] totoal_tokens=46621, outputs='The festival concludes with a fireworks display.'
[2024-08-03 15:22:54] [Rank 0] totoal_tokens=47745, outputs='The President of the United States'
] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 601/751 [09:14<08:00, 3.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 602/751 [09:17<07:32, 3.04s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 603/751 [09:20<07:22, 2.99s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 604/751 [09:23<07:39, 3.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 605/751 [09:26<07:31, 3.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 606/751 [09:30<07:49, 3.23s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 607/751 [09:33<07:27, 3.11s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 608/751 [09:36<07:37, 3.20s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 60[2024-08-03 15:22:55] [Rank 2] totoal_tokens=45586, outputs='The fighters will fight in the King Abdullah Sports City Arena.'
[2024-08-03 15:22:55] [Rank 3] totoal_tokens=47245, outputs='backyard'
[2024-08-03 15:22:57] [Rank 1] totoal_tokens=47119, outputs='Robert Louis Stevenson'
[2024-08-03 15:22:58] [Rank 2] totoal_tokens=45946, outputs='Corn'
[2024-08-03 15:22:58] [Rank 0] totoal_tokens=47823, outputs='Investment C'
[2024-08-03 15:22:58] [Rank 3] totoal_tokens=47791, outputs='Succulent'
[2024-08-03 15:23:00] [Rank 1] totoal_tokens=47727, outputs='Mitch'
[2024-08-03 15:23:01] [Rank 0] totoal_tokens=47835, outputs='June 27'
[2024-08-03 15:23:01] [Rank 2] totoal_tokens=45998, outputs='Oak'
[2024-08-03 15:23:01] [Rank 3] totoal_tokens=47793, outputs='Get coffee'
[2024-08-03 15:23:03] [Rank 1] totoal_tokens=47843, outputs='Gosha Rubchinskiy'
[2024-08-03 15:23:04] [Rank 0] totoal_tokens=48015, outputs='Pomegranate'
[2024-08-03 15:23:04] [Rank 2] totoal_tokens=46284, outputs='The aliens are gone.'
[2024-08-03 15:23:04] [Rank 3] totoal_tokens=48180, outputs='Wasp'
[2024-08-03 15:23:06] [Rank 1] totoal_tokens=48020, outputs='Ka-kheti'
[2024-08-03 15:23:07] [Rank 0] totoal_tokens=48486, outputs='The Atlantic'
[2024-08-03 15:23:07] [Rank 2] totoal_tokens=46538, outputs='on the left'
[2024-08-03 15:23:07] [Rank 3] totoal_tokens=48488, outputs='Banana'
[2024-08-03 15:23:09] [Rank 1] totoal_tokens=48716, outputs='Spartan Sprint'
[2024-08-03 15:23:10] [Rank 0] totoal_tokens=48884, outputs='Mzansi'
[2024-08-03 15:23:10] [Rank 2] totoal_tokens=46598, outputs='Roger Bannister'
[2024-08-03 15:23:11] [Rank 3] totoal_tokens=48622, outputs='Wash his face'
[2024-08-03 15:23:12] [Rank 1] totoal_tokens=49304, outputs='Bob'
[2024-08-03 15:23:13] [Rank 2] totoal_tokens=46846, outputs='Landon Liboiron'
[2024-08-03 15:23:13] [Rank 0] totoal_tokens=48994, outputs='Larvae'
[2024-08-03 15:23:14] [Rank 3] totoal_tokens=48923, outputs='Pink'
[2024-08-03 15:23:16] [Rank 1] totoal_tokens=49533, outputs='The children also got to try lots of enjoy exotic fruit'
[2024-08-03 15:23:16] [Rank 2] totoal_tokens=47596, outputs='Oil on canvas'
[2024-08-03 15:23:17] [Rank 0] totoal_tokens=49315, outputs='Genies'
[2024-08-03 15:23:17] [Rank 3] totoal_tokens=48929, outputs='Jerry'
[2024-08-03 15:23:19] [Rank 2] totoal_tokens=48174, outputs='Honda City ZX Exi'
[2024-08-03 15:23:20] [Rank 1] totoal_tokens=49992, outputs='X'
[2024-08-03 15:23:20] [Rank 3] totoal_tokens=49365, outputs='Peregrine falcon'
[2024-08-03 15:23:21] [Rank 0] totoal_tokens=49383, outputs='annual'
[2024-08-03 15:23:22] [Rank 2] totoal_tokens=48365, outputs='Get coffee'
[2024-08-03 15:23:23] [Rank 1] totoal_tokens=50101, outputs='Lindsey Anderson'
[2024-08-03 15:23:24] [Rank 0] totoal_tokens=50392, outputs='South'
9/751 [09:39<07:20, 3.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 610/751 [09:43<08:04, 3.43s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 611/751 [09:46<07:42, 3.31s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 612/751 [09:49<07:25, 3.21s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 613/751 [09:52<07:17, 3.17s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 614/751 [09:55<07:13, 3.17s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 615/751 [09:59<07:12, 3.18s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 616/751 [10:02<07:14, 3.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 617/751 [10:06<07:49, 3.51s/it] Processing InternVL2-2B_reasoning-text-test[2024-08-03 15:23:26] [Rank 3] totoal_tokens=49458, outputs='Store B'
[2024-08-03 15:23:27] [Rank 1] totoal_tokens=50684, outputs='September 13'
[2024-08-03 15:23:27] [Rank 2] totoal_tokens=48624, outputs='Russell'
[2024-08-03 15:23:27] [Rank 0] totoal_tokens=50448, outputs='Exaktbox 6'
[2024-08-03 15:23:29] [Rank 3] totoal_tokens=49472, outputs='Southern'
[2024-08-03 15:23:30] [Rank 1] totoal_tokens=50702, outputs='Bird'
[2024-08-03 15:23:30] [Rank 2] totoal_tokens=48934, outputs='Marie Curie'
[2024-08-03 15:23:31] [Rank 0] totoal_tokens=50759, outputs='Bruce Willis'
[2024-08-03 15:23:32] [Rank 3] totoal_tokens=50047, outputs='Get the equipment ready'
[2024-08-03 15:23:33] [Rank 1] totoal_tokens=50716, outputs='Earth'
[2024-08-03 15:23:33] [Rank 2] totoal_tokens=48979, outputs='The National Writers Series'
[2024-08-03 15:23:34] [Rank 0] totoal_tokens=50769, outputs='The final event is the award ceremony.'
[2024-08-03 15:23:35] [Rank 3] totoal_tokens=50100, outputs='Wasp'
[2024-08-03 15:23:36] [Rank 1] totoal_tokens=50799, outputs='Samsung'
[2024-08-03 15:23:36] [Rank 2] totoal_tokens=48997, outputs='Gaia Foods'
[2024-08-03 15:23:38] [Rank 0] totoal_tokens=50948, outputs='Driving'
[2024-08-03 15:23:39] [Rank 3] totoal_tokens=50157, outputs='Samsung'
[2024-08-03 15:23:39] [Rank 2] totoal_tokens=49009, outputs='Gouache'
[2024-08-03 15:23:41] [Rank 0] totoal_tokens=51075, outputs='Car B'
[2024-08-03 15:23:42] [Rank 1] totoal_tokens=50909, outputs='The Yellow Book'
[2024-08-03 15:23:42] [Rank 3] totoal_tokens=50374, outputs='PavΓ©'
[2024-08-03 15:23:43] [Rank 2] totoal_tokens=49084, outputs='Xiaomi'
[2024-08-03 15:23:45] [Rank 1] totoal_tokens=50946, outputs='the house'
[2024-08-03 15:23:45] [Rank 0] totoal_tokens=51484, outputs='The production'
[2024-08-03 15:23:46] [Rank 3] totoal_tokens=50685, outputs='I am not sure'
[2024-08-03 15:23:46] [Rank 2] totoal_tokens=49198, outputs='Play video game'
[2024-08-03 15:23:49] [Rank 1] totoal_tokens=51684, outputs='Cactus'
[2024-08-03 15:23:49] [Rank 3] totoal_tokens=51005, outputs='bleach'
[2024-08-03 15:23:49] [Rank 2] totoal_tokens=49521, outputs='Serie A'
[2024-08-03 15:23:50] [Rank 0] totoal_tokens=51847, outputs='Southern'
[2024-08-03 15:23:52] [Rank 1] totoal_tokens=51828, outputs='Lavender'
[2024-08-03 15:23:52] [Rank 2] totoal_tokens=50169, outputs='Chancelor Bennett'
[2024-08-03 15:23:53] [Rank 3] totoal_tokens=51120, outputs='Viv'
[2024-08-03 15:23:53] [Rank 0] totoal_tokens=52395, outputs='Butch Thompson'
.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 618/751 [10:09<07:27, 3.37s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 82%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 619/751 [10:12<07:14, 3.29s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 620/751 [10:17<07:49, 3.58s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 621/751 [10:20<07:30, 3.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 622/751 [10:23<07:24, 3.44s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 623/751 [10:27<07:22, 3.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 624/751 [10:31<07:45, 3.67s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 625/751 [10:35<07:59, 3.80s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 626/751 [10:38<07:42, 3.70s/[2024-08-03 15:23:55] [Rank 1] totoal_tokens=51861, outputs='Peach of Time'
[2024-08-03 15:23:56] [Rank 2] totoal_tokens=50622, outputs='Sarah Zapata'
[2024-08-03 15:23:56] [Rank 0] totoal_tokens=52604, outputs='Esther Choi'
[2024-08-03 15:23:57] [Rank 3] totoal_tokens=51197, outputs='Where does Iowa stand in the national popular vote debate?'
[2024-08-03 15:23:58] [Rank 1] totoal_tokens=51945, outputs='South'
[2024-08-03 15:23:59] [Rank 2] totoal_tokens=50760, outputs='The couch'
[2024-08-03 15:24:00] [Rank 0] totoal_tokens=52608, outputs='vegetarian'
[2024-08-03 15:24:01] [Rank 3] totoal_tokens=51241, outputs='The Simpsons'
[2024-08-03 15:24:03] [Rank 1] totoal_tokens=52322, outputs='Left'
[2024-08-03 15:24:03] [Rank 0] totoal_tokens=52881, outputs='Miranda Lambert'
[2024-08-03 15:24:03] [Rank 2] totoal_tokens=50761, outputs='The Mom 100 Cookbook: 100 Recipes Every Mom Needs in Her Back'
[2024-08-03 15:24:04] [Rank 3] totoal_tokens=51401, outputs='Coffee'
[2024-08-03 15:24:06] [Rank 1] totoal_tokens=52478, outputs='go to bed'
[2024-08-03 15:24:07] [Rank 2] totoal_tokens=51484, outputs='Get ready for the day'
[2024-08-03 15:24:07] [Rank 3] totoal_tokens=51480, outputs='apple'
[2024-08-03 15:24:07] [Rank 0] totoal_tokens=53670, outputs='Ferguson'
[2024-08-03 15:24:09] [Rank 1] totoal_tokens=52593, outputs='South'
[2024-08-03 15:24:10] [Rank 2] totoal_tokens=51498, outputs='Luigi'
[2024-08-03 15:24:11] [Rank 3] totoal_tokens=51564, outputs='Pelican'
[2024-08-03 15:24:11] [Rank 0] totoal_tokens=53856, outputs='Boodle Fight Manila'
[2024-08-03 15:24:13] [Rank 1] totoal_tokens=52598, outputs='Panchromatic'
[2024-08-03 15:24:13] [Rank 2] totoal_tokens=51748, outputs='The Best Vegan Restaurant in the World'
[2024-08-03 15:24:15] [Rank 3] totoal_tokens=51671, outputs='The last event in the triathlon is the 2018 Australian Open.'
[2024-08-03 15:24:16] [Rank 1] totoal_tokens=52695, outputs='on the right'
[2024-08-03 15:24:16] [Rank 0] totoal_tokens=54056, outputs='The final step in constructing the model is to run the simulation.'
[2024-08-03 15:24:16] [Rank 2] totoal_tokens=51915, outputs='Sanguinian'
[2024-08-03 15:24:19] [Rank 3] totoal_tokens=51684, outputs='Norton'
[2024-08-03 15:24:19] [Rank 1] totoal_tokens=52833, outputs='A store'
[2024-08-03 15:24:20] [Rank 0] totoal_tokens=54140, outputs='Bushy'
[2024-08-03 15:24:20] [Rank 2] totoal_tokens=51961, outputs='Valentina'
[2024-08-03 15:24:23] [Rank 3] totoal_tokens=53464, outputs='The route that is the quickest is the one that is the most direct'
[2024-08-03 15:24:23] [Rank 2] totoal_tokens=52220, outputs='oak'
[2024-08-03 15:24:23] [Rank 0] totoal_tokens=54362, outputs='oak'
[2024-08-03 15:24:24] [Rank 1] totoal_tokens=52862, outputs='The marriage'
[2024-08-03 15:24:26] [Rank 3] totoal_tokens=53509, outputs='The Beatles'
[2024-08-03 15:24:27] [Rank 2] totoal_tokens=52239, outputs="Macy's"
[2024-08-03 15:24:27] [Rank 1] totoal_tokens=52863, outputs='Rome'
[2024-08-03 15:24:28] [Rank 0] totoal_tokens=54665, outputs='Go to the beach'
it] Processing InternVL2-2B_reasoning-text-test.jsonl: 83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 627/751 [10:42<07:22, 3.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 628/751 [10:45<07:05, 3.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 629/751 [10:48<06:55, 3.41s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 630/751 [10:53<07:29, 3.72s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 631/751 [10:57<07:36, 3.80s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 632/751 [11:01<08:10, 4.12s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 633/751 [11:05<07:48, 3.97s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 634/751 [11:09<07:32, 3.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆ[2024-08-03 15:24:30] [Rank 2] totoal_tokens=52271, outputs='apple'
[2024-08-03 15:24:30] [Rank 3] totoal_tokens=53776, outputs='Brazil'
[2024-08-03 15:24:31] [Rank 1] totoal_tokens=53294, outputs='Guy'
[2024-08-03 15:24:31] [Rank 0] totoal_tokens=55084, outputs='mosquito'
[2024-08-03 15:24:34] [Rank 2] totoal_tokens=52446, outputs='Peter'
[2024-08-03 15:24:35] [Rank 0] totoal_tokens=55199, outputs='on the right'
[2024-08-03 15:24:35] [Rank 3] totoal_tokens=54182, outputs='Wasp'
[2024-08-03 15:24:36] [Rank 1] totoal_tokens=53770, outputs='Dessert'
[2024-08-03 15:24:38] [Rank 2] totoal_tokens=52447, outputs='The first'
[2024-08-03 15:24:39] [Rank 3] totoal_tokens=54331, outputs='Armitage'
[2024-08-03 15:24:39] [Rank 0] totoal_tokens=55537, outputs='annual'
[2024-08-03 15:24:40] [Rank 1] totoal_tokens=53873, outputs='Pelican'
[2024-08-03 15:24:42] [Rank 3] totoal_tokens=54337, outputs='Route 1'
[2024-08-03 15:24:42] [Rank 2] totoal_tokens=52450, outputs='James Vculek'
[2024-08-03 15:24:43] [Rank 0] totoal_tokens=55708, outputs='South'
[2024-08-03 15:24:44] [Rank 1] totoal_tokens=54122, outputs='The Qur’an'
[2024-08-03 15:24:46] [Rank 2] totoal_tokens=52458, outputs='The Color of Pomegranates'
[2024-08-03 15:24:46] [Rank 3] totoal_tokens=54390, outputs='Diagnosed with juvenile diabetes at the age of 21'
[2024-08-03 15:24:46] [Rank 0] totoal_tokens=55831, outputs='France'
[2024-08-03 15:24:48] [Rank 1] totoal_tokens=54134, outputs='West'
[2024-08-03 15:24:50] [Rank 3] totoal_tokens=54393, outputs='The test is passed.'
[2024-08-03 15:24:50] [Rank 0] totoal_tokens=55869, outputs='Store B'
[2024-08-03 15:24:51] [Rank 2] totoal_tokens=52567, outputs='The final step in constructing the model is to use the model to make predictions about'
[2024-08-03 15:24:52] [Rank 1] totoal_tokens=54154, outputs='Bush'
[2024-08-03 15:24:54] [Rank 0] totoal_tokens=55874, outputs='The festival ends with a closing ceremony.'
[2024-08-03 15:24:54] [Rank 3] totoal_tokens=54652, outputs="I don't know"
[2024-08-03 15:24:55] [Rank 1] totoal_tokens=54161, outputs='The 2020 election'
[2024-08-03 15:24:56] [Rank 2] totoal_tokens=53625, outputs='The β€œTalking” Dog of Tiktok'
[2024-08-03 15:24:57] [Rank 3] totoal_tokens=54854, outputs='AIDA'
[2024-08-03 15:24:58] [Rank 0] totoal_tokens=56063, outputs='Celastrus paniculatus'
[2024-08-03 15:24:59] [Rank 1] totoal_tokens=54476, outputs='Route Y'
[2024-08-03 15:25:00] [Rank 2] totoal_tokens=54052, outputs='Toyota'
[2024-08-03 15:25:01] [Rank 3] totoal_tokens=54866, outputs='The AIDA cruise ship was hit by a fire on board.'
[2024-08-03 15:25:01] [Rank 0] totoal_tokens=56082, outputs='Nicolas Marchesi'
β–ˆβ–ˆβ–ˆβ–ˆβ– | 635/751 [11:13<07:53, 4.08s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 636/751 [11:17<07:29, 3.91s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 637/751 [11:20<07:10, 3.77s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 638/751 [11:24<07:06, 3.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 639/751 [11:28<07:08, 3.83s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 640/751 [11:31<06:51, 3.71s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 641/751 [11:35<06:50, 3.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 642/751 [11:39<06:46, 3.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 643/751 [11:43<07:01, 3.91s/it] Processing InternVL2-[2024-08-03 15:25:02] [Rank 1] totoal_tokens=54957, outputs='Sacramento'
[2024-08-03 15:25:03] [Rank 2] totoal_tokens=54213, outputs='Low Carb'
[2024-08-03 15:25:05] [Rank 3] totoal_tokens=55331, outputs='Cake'
[2024-08-03 15:25:06] [Rank 0] totoal_tokens=56141, outputs='Wych Cult'
[2024-08-03 15:25:07] [Rank 2] totoal_tokens=54391, outputs='Dusan Tadic'
[2024-08-03 15:25:07] [Rank 1] totoal_tokens=55019, outputs='Aloe Vera'
[2024-08-03 15:25:09] [Rank 3] totoal_tokens=55352, outputs='ice cream'
[2024-08-03 15:25:10] [Rank 0] totoal_tokens=56199, outputs='Investment A'
[2024-08-03 15:25:10] [Rank 2] totoal_tokens=54469, outputs='Rice'
[2024-08-03 15:25:10] [Rank 1] totoal_tokens=55236, outputs='Go to the library'
[2024-08-03 15:25:13] [Rank 3] totoal_tokens=55803, outputs='Go to church'
[2024-08-03 15:25:13] [Rank 0] totoal_tokens=56252, outputs='Bob'
[2024-08-03 15:25:14] [Rank 1] totoal_tokens=55250, outputs='New York'
[2024-08-03 15:25:14] [Rank 2] totoal_tokens=54744, outputs='Mauritania'
[2024-08-03 15:25:17] [Rank 3] totoal_tokens=55974, outputs='Oak'
[2024-08-03 15:25:18] [Rank 2] totoal_tokens=54776, outputs='Bird'
[2024-08-03 15:25:18] [Rank 0] totoal_tokens=56353, outputs='Earth'
[2024-08-03 15:25:18] [Rank 1] totoal_tokens=55252, outputs='Mount Vernon'
[2024-08-03 15:25:22] [Rank 1] totoal_tokens=55532, outputs='Get ready for work'
[2024-08-03 15:25:22] [Rank 3] totoal_tokens=56429, outputs='Zach Glare'
[2024-08-03 15:25:23] [Rank 2] totoal_tokens=54839, outputs='Income Tax'
[2024-08-03 15:25:24] [Rank 0] totoal_tokens=56380, outputs='Dawn'
[2024-08-03 15:25:25] [Rank 3] totoal_tokens=56534, outputs='Lynx'
[2024-08-03 15:25:26] [Rank 1] totoal_tokens=56250, outputs='Cook the noodles on the griddle.'
[2024-08-03 15:25:27] [Rank 2] totoal_tokens=55240, outputs='Catalina'
[2024-08-03 15:25:27] [Rank 0] totoal_tokens=56450, outputs='Red Bull'
[2024-08-03 15:25:29] [Rank 3] totoal_tokens=56792, outputs='The French'
[2024-08-03 15:25:30] [Rank 2] totoal_tokens=55508, outputs='The Astronomer'
[2024-08-03 15:25:31] [Rank 0] totoal_tokens=56713, outputs='Guitar'
[2024-08-03 15:25:31] [Rank 1] totoal_tokens=56329, outputs='banana'
[2024-08-03 15:25:33] [Rank 3] totoal_tokens=56839, outputs='Village Sante'
[2024-08-03 15:25:33] [Rank 2] totoal_tokens=55846, outputs='Andrew Tran'
[2024-08-03 15:25:35] [Rank 0] totoal_tokens=56872, outputs='Coffee'
2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 644/751 [11:47<06:48, 3.81s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 645/751 [11:51<07:08, 4.05s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 646/751 [11:55<06:48, 3.89s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 647/751 [11:58<06:33, 3.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 648/751 [12:03<07:04, 4.12s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 649/751 [12:09<07:45, 4.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 650/751 [12:12<07:08, 4.24s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 651/751 [12:16<06:58, 4.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 652/751[2024-08-03 15:25:35] [Rank 1] totoal_tokens=56612, outputs='The couch is behind the table.'
[2024-08-03 15:25:36] [Rank 3] totoal_tokens=56877, outputs='Northern Europe'
[2024-08-03 15:25:37] [Rank 2] totoal_tokens=56220, outputs='Ye'
[2024-08-03 15:25:39] [Rank 0] totoal_tokens=56875, outputs='The Elva is a convertible roadster.'
[2024-08-03 15:25:39] [Rank 1] totoal_tokens=56754, outputs='Banana'
[2024-08-03 15:25:40] [Rank 3] totoal_tokens=56973, outputs='Paddleboard'
[2024-08-03 15:25:42] [Rank 2] totoal_tokens=56256, outputs='Nintendo 3DS'
[2024-08-03 15:25:42] [Rank 0] totoal_tokens=56989, outputs='The Matrix'
[2024-08-03 15:25:43] [Rank 1] totoal_tokens=56807, outputs='Investment C'
[2024-08-03 15:25:44] [Rank 3] totoal_tokens=57209, outputs='Robert Jackson Federal Courthouse'
[2024-08-03 15:25:46] [Rank 2] totoal_tokens=56323, outputs='Taliaferro'
[2024-08-03 15:25:46] [Rank 1] totoal_tokens=56879, outputs='Maria Butina'
[2024-08-03 15:25:46] [Rank 0] totoal_tokens=57033, outputs='Northern Europe'
[2024-08-03 15:25:48] [Rank 3] totoal_tokens=57214, outputs='food'
[2024-08-03 15:25:50] [Rank 2] totoal_tokens=56556, outputs="I don't have any information about what Liam did before going to bed."
[2024-08-03 15:25:50] [Rank 1] totoal_tokens=56972, outputs='Craig Althof'
[2024-08-03 15:25:51] [Rank 0] totoal_tokens=57123, outputs='Funkhouser'
[2024-08-03 15:25:53] [Rank 3] totoal_tokens=57602, outputs='Chef'
[2024-08-03 15:25:54] [Rank 0] totoal_tokens=57478, outputs='Banana'
[2024-08-03 15:25:54] [Rank 1] totoal_tokens=57256, outputs='Mercado'
[2024-08-03 15:25:55] [Rank 2] totoal_tokens=56691, outputs='Earth'
[2024-08-03 15:25:58] [Rank 3] totoal_tokens=57620, outputs='Wooster Square'
[2024-08-03 15:25:58] [Rank 0] totoal_tokens=57592, outputs='Coffee'
[2024-08-03 15:25:59] [Rank 1] totoal_tokens=57274, outputs='Monsanto'
[2024-08-03 15:26:00] [Rank 2] totoal_tokens=56704, outputs='Grocery stores'
[2024-08-03 15:26:01] [Rank 3] totoal_tokens=57904, outputs='Chris Thomas'
[2024-08-03 15:26:02] [Rank 0] totoal_tokens=57909, outputs='Dr. Johnson'
[2024-08-03 15:26:03] [Rank 1] totoal_tokens=57321, outputs='The student is not allowed to leave for school.'
[2024-08-03 15:26:04] [Rank 2] totoal_tokens=56747, outputs='annual'
[2024-08-03 15:26:05] [Rank 3] totoal_tokens=58159, outputs='Dawn'
[2024-08-03 15:26:06] [Rank 0] totoal_tokens=59078, outputs='Open'
[2024-08-03 15:26:07] [Rank 1] totoal_tokens=57644, outputs='Sean Bonniwell'
[2024-08-03 15:26:09] [Rank 2] totoal_tokens=56882, outputs='Kim Zakka'
[2024-08-03 15:26:09] [Rank 0] totoal_tokens=59226, outputs='GaN HEMTs'
[12:20<06:40, 4.04s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 653/751 [12:24<06:32, 4.00s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 654/751 [12:28<06:13, 3.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 655/751 [12:32<06:16, 3.92s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 656/751 [12:36<06:22, 4.03s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 657/751 [12:40<06:05, 3.89s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 658/751 [12:43<05:55, 3.82s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 659/751 [12:47<05:51, 3.82s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 660/751 [12:51<05:54, 3.89s/it] Processing InternVL2-2B_reasoning-text-test.js[2024-08-03 15:26:10] [Rank 3] totoal_tokens=58395, outputs='The room is filled with people'
[2024-08-03 15:26:10] [Rank 1] totoal_tokens=57715, outputs='Bob'
[2024-08-03 15:26:12] [Rank 2] totoal_tokens=57089, outputs='COPERT Micro'
[2024-08-03 15:26:14] [Rank 0] totoal_tokens=59273, outputs='Get coffee'
[2024-08-03 15:26:14] [Rank 3] totoal_tokens=58511, outputs='The Expats'
[2024-08-03 15:26:14] [Rank 1] totoal_tokens=57917, outputs='Felt Soul Media'
[2024-08-03 15:26:18] [Rank 2] totoal_tokens=57416, outputs='Get ready for work'
[2024-08-03 15:26:18] [Rank 0] totoal_tokens=59306, outputs='The car that finishes the race first is the winner.'
[2024-08-03 15:26:18] [Rank 3] totoal_tokens=58528, outputs='Amazon'
[2024-08-03 15:26:19] [Rank 1] totoal_tokens=57980, outputs='Northern Hemisphere'
[2024-08-03 15:26:22] [Rank 0] totoal_tokens=59592, outputs='Jean'
[2024-08-03 15:26:22] [Rank 2] totoal_tokens=57499, outputs='ice cream'
[2024-08-03 15:26:22] [Rank 3] totoal_tokens=58637, outputs='Cook'
[2024-08-03 15:26:23] [Rank 1] totoal_tokens=58187, outputs='Porsche'
[2024-08-03 15:26:26] [Rank 0] totoal_tokens=59608, outputs='Cuckoo'
[2024-08-03 15:26:27] [Rank 3] totoal_tokens=59009, outputs='The 13th Five-Year Plan'
[2024-08-03 15:26:27] [Rank 1] totoal_tokens=58417, outputs='Budget'
[2024-08-03 15:26:27] [Rank 2] totoal_tokens=58593, outputs='Dee'
[2024-08-03 15:26:31] [Rank 1] totoal_tokens=58439, outputs='Cardi B'
[2024-08-03 15:26:32] [Rank 0] totoal_tokens=59703, outputs='France'
[2024-08-03 15:26:32] [Rank 2] totoal_tokens=58756, outputs='Investment C'
[2024-08-03 15:26:32] [Rank 3] totoal_tokens=59461, outputs='Giovanni Maria Farina'
[2024-08-03 15:26:35] [Rank 1] totoal_tokens=58441, outputs='Andrew and Pam Wennell got married in October in front of just their two registr'
[2024-08-03 15:26:36] [Rank 0] totoal_tokens=59935, outputs='Agriculture'
[2024-08-03 15:26:36] [Rank 2] totoal_tokens=58906, outputs='Palm'
[2024-08-03 15:26:36] [Rank 3] totoal_tokens=59555, outputs='Earth'
[2024-08-03 15:26:39] [Rank 1] totoal_tokens=58953, outputs='The Dude'
[2024-08-03 15:26:40] [Rank 3] totoal_tokens=59702, outputs='apple'
[2024-08-03 15:26:40] [Rank 2] totoal_tokens=59038, outputs='Ian'
[2024-08-03 15:26:41] [Rank 0] totoal_tokens=59992, outputs='Apple'
[2024-08-03 15:26:45] [Rank 3] totoal_tokens=59739, outputs='The dragonfly'
[2024-08-03 15:26:45] [Rank 1] totoal_tokens=59097, outputs='The Atlantic Ocean'
[2024-08-03 15:26:45] [Rank 0] totoal_tokens=60007, outputs='Saban'
onl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 661/751 [12:55<05:45, 3.83s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 662/751 [12:59<05:59, 4.04s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 663/751 [13:04<06:02, 4.12s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 88%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 664/751 [13:07<05:51, 4.04s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 665/751 [13:12<05:52, 4.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 666/751 [13:17<06:25, 4.54s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 667/751 [13:21<06:05, 4.35s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 668/751 [13:26<06:10, 4.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 669/751 [13:30<06:00, 4.39s/it][2024-08-03 15:26:45] [Rank 2] totoal_tokens=59043, outputs='She takes a yoga class.'
[2024-08-03 15:26:49] [Rank 3] totoal_tokens=59777, outputs='The Great Depression'
[2024-08-03 15:26:49] [Rank 1] totoal_tokens=59612, outputs='James Young'
[2024-08-03 15:26:49] [Rank 0] totoal_tokens=60187, outputs='Pecan'
[2024-08-03 15:26:49] [Rank 2] totoal_tokens=59807, outputs='Earth'
[2024-08-03 15:26:54] [Rank 3] totoal_tokens=59915, outputs='North America'
[2024-08-03 15:26:54] [Rank 2] totoal_tokens=59896, outputs='Fight'
[2024-08-03 15:26:54] [Rank 0] totoal_tokens=60412, outputs='William Shakespeare'
[2024-08-03 15:26:54] [Rank 1] totoal_tokens=59642, outputs='Mako Komuro'
[2024-08-03 15:26:58] [Rank 3] totoal_tokens=60201, outputs='December 31'
[2024-08-03 15:26:58] [Rank 2] totoal_tokens=59904, outputs='The Tate'
[2024-08-03 15:26:59] [Rank 0] totoal_tokens=60681, outputs='banana'
[2024-08-03 15:27:00] [Rank 1] totoal_tokens=60087, outputs='Alma Moreno'
[2024-08-03 15:27:02] [Rank 3] totoal_tokens=60226, outputs='Mansourasaurus'
[2024-08-03 15:27:03] [Rank 0] totoal_tokens=61093, outputs='Suzuki'
[2024-08-03 15:27:04] [Rank 2] totoal_tokens=60408, outputs='Get the bus to school'
[2024-08-03 15:27:04] [Rank 1] totoal_tokens=60206, outputs='Eddie Izzard'
[2024-08-03 15:27:06] [Rank 3] totoal_tokens=60616, outputs='Patrick Phelan'
[2024-08-03 15:27:07] [Rank 0] totoal_tokens=61219, outputs='Ashley Graham'
[2024-08-03 15:27:09] [Rank 2] totoal_tokens=60472, outputs='The Daughter Of Dawn'
[2024-08-03 15:27:10] [Rank 3] totoal_tokens=60623, outputs='She is on an Antarctic expedition'
[2024-08-03 15:27:10] [Rank 1] totoal_tokens=60729, outputs='Rosmarinus'
[2024-08-03 15:27:12] [Rank 0] totoal_tokens=61255, outputs='Tommy Lapid'
[2024-08-03 15:27:12] [Rank 2] totoal_tokens=60914, outputs='Dance'
[2024-08-03 15:27:15] [Rank 1] totoal_tokens=60769, outputs='AmaWaterways'
[2024-08-03 15:27:16] [Rank 3] totoal_tokens=60719, outputs='J.K. Rowling'
[2024-08-03 15:27:16] [Rank 2] totoal_tokens=61257, outputs='Get married'
[2024-08-03 15:27:17] [Rank 0] totoal_tokens=62000, outputs='Peregrine falcon'
[2024-08-03 15:27:19] [Rank 1] totoal_tokens=60840, outputs='Antti Ilvessuo'
[2024-08-03 15:27:20] [Rank 3] totoal_tokens=60907, outputs='Paris'
[2024-08-03 15:27:21] [Rank 2] totoal_tokens=61418, outputs='David Gooding'
[2024-08-03 15:27:21] [Rank 0] totoal_tokens=62102, outputs='Reed Exhibitions'
[2024-08-03 15:27:23] [Rank 1] totoal_tokens=60931, outputs='David Bowie'
[2024-08-03 15:27:24] [Rank 3] totoal_tokens=61673, outputs='NASA'
[2024-08-03 15:27:25] [Rank 2] totoal_tokens=61418, outputs='Stanbrook Abbey'
[2024-08-03 15:27:25] [Rank 0] totoal_tokens=62219, outputs='Jouman Abdu'
Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 670/751 [13:35<06:02, 4.47s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 671/751 [13:40<06:08, 4.60s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 89%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 672/751 [13:45<06:14, 4.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 673/751 [13:49<05:51, 4.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 674/751 [13:53<05:36, 4.37s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 675/751 [13:57<05:26, 4.29s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 676/751 [14:02<05:38, 4.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 677/751 [14:06<05:24, 4.39s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ[2024-08-03 15:27:27] [Rank 1] totoal_tokens=61009, outputs='Fighting Bear Antiques'
[2024-08-03 15:27:28] [Rank 3] totoal_tokens=61853, outputs='The beer is brewed'
[2024-08-03 15:27:31] [Rank 2] totoal_tokens=61519, outputs='The Relens'
[2024-08-03 15:27:31] [Rank 0] totoal_tokens=63278, outputs='Drink water'
[2024-08-03 15:27:31] [Rank 1] totoal_tokens=61219, outputs='Mark Greene'
[2024-08-03 15:27:33] [Rank 3] totoal_tokens=61892, outputs='Konami'
[2024-08-03 15:27:35] [Rank 0] totoal_tokens=64013, outputs='Answer: The West Coast'
[2024-08-03 15:27:35] [Rank 2] totoal_tokens=61640, outputs='Patrick Schwarzenegger'
[2024-08-03 15:27:35] [Rank 1] totoal_tokens=62224, outputs="I'm a fan of the show."
[2024-08-03 15:27:37] [Rank 3] totoal_tokens=62221, outputs='Gary Hooper'
[2024-08-03 15:27:39] [Rank 2] totoal_tokens=61999, outputs='Cara'
[2024-08-03 15:27:40] [Rank 1] totoal_tokens=62318, outputs='Audi'
[2024-08-03 15:27:40] [Rank 0] totoal_tokens=64086, outputs='Pink'
[2024-08-03 15:27:41] [Rank 3] totoal_tokens=62455, outputs='Samsung'
[2024-08-03 15:27:44] [Rank 2] totoal_tokens=62254, outputs='Low sodium, low salt diet'
[2024-08-03 15:27:45] [Rank 0] totoal_tokens=64140, outputs='The Broken Destiny'
[2024-08-03 15:27:45] [Rank 1] totoal_tokens=62490, outputs='North'
[2024-08-03 15:27:46] [Rank 3] totoal_tokens=62496, outputs='Coffee'
[2024-08-03 15:27:48] [Rank 2] totoal_tokens=62286, outputs='China'
[2024-08-03 15:27:49] [Rank 0] totoal_tokens=64142, outputs='Arsenal'
[2024-08-03 15:27:49] [Rank 1] totoal_tokens=62750, outputs='The roof'
[2024-08-03 15:27:51] [Rank 3] totoal_tokens=62596, outputs='Kathy'
[2024-08-03 15:27:53] [Rank 0] totoal_tokens=64147, outputs='The Walking Dead'
[2024-08-03 15:27:55] [Rank 1] totoal_tokens=63204, outputs='Jim Anker'
[2024-08-03 15:27:55] [Rank 2] totoal_tokens=62757, outputs='The festival concludes with a fireworks display.'
[2024-08-03 15:27:55] [Rank 3] totoal_tokens=62622, outputs='South'
[2024-08-03 15:27:57] [Rank 0] totoal_tokens=64308, outputs='the people'
[2024-08-03 15:27:59] [Rank 2] totoal_tokens=62850, outputs='Cactus'
[2024-08-03 15:28:00] [Rank 1] totoal_tokens=63255, outputs='Diana Vreeland'
[2024-08-03 15:28:01] [Rank 3] totoal_tokens=62739, outputs='annually'
[2024-08-03 15:28:03] [Rank 0] totoal_tokens=64431, outputs='The House That Jack Built'
[2024-08-03 15:28:04] [Rank 2] totoal_tokens=63270, outputs='Sergey Kryachkov'
[2024-08-03 15:28:05] [Rank 1] totoal_tokens=63682, outputs='Gustave Eiffel'
[2024-08-03 15:28:06] [Rank 3] totoal_tokens=62989, outputs='The foundation is a place where the building is built'
[2024-08-03 15:28:08] [Rank 0] totoal_tokens=64470, outputs='The movie is released.'
β–ˆβ–ˆβ–ˆβ–ˆ | 678/751 [14:11<05:27, 4.49s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 679/751 [14:16<05:38, 4.71s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 680/751 [14:20<05:24, 4.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 681/751 [14:25<05:34, 4.77s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 682/751 [14:30<05:22, 4.68s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 683/751 [14:34<05:08, 4.53s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 684/751 [14:39<05:06, 4.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 685/751 [14:43<04:51, 4.41s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 686/751 [14:49<05:12, 4.81s/it] Processing InternVL2-2[2024-08-03 15:28:09] [Rank 2] totoal_tokens=63352, outputs='Rubus idaeus'
[2024-08-03 15:28:10] [Rank 1] totoal_tokens=63870, outputs='The production of the book'
[2024-08-03 15:28:12] [Rank 3] totoal_tokens=63433, outputs='The 85th Euroconstruct Conference was held in the Finlandia Hall, H'
[2024-08-03 15:28:14] [Rank 0] totoal_tokens=64471, outputs='Beech'
[2024-08-03 15:28:14] [Rank 2] totoal_tokens=63864, outputs='The Lee family has lived in and been a part of Virginia history since long before'
[2024-08-03 15:28:14] [Rank 1] totoal_tokens=63880, outputs='The Dark Tower'
[2024-08-03 15:28:16] [Rank 3] totoal_tokens=63491, outputs='Brie Larson'
[2024-08-03 15:28:18] [Rank 0] totoal_tokens=64899, outputs="I don't know"
[2024-08-03 15:28:18] [Rank 2] totoal_tokens=64040, outputs='Hayes Carll'
[2024-08-03 15:28:18] [Rank 1] totoal_tokens=63913, outputs='Banana'
[2024-08-03 15:28:21] [Rank 3] totoal_tokens=63711, outputs='Cara'
[2024-08-03 15:28:23] [Rank 1] totoal_tokens=64106, outputs='Nicholas Mak'
[2024-08-03 15:28:23] [Rank 0] totoal_tokens=64929, outputs='Dawn'
[2024-08-03 15:28:24] [Rank 2] totoal_tokens=64183, outputs='The festival concludes with the award ceremony.'
[2024-08-03 15:28:26] [Rank 3] totoal_tokens=63735, outputs='She is a writer'
[2024-08-03 15:28:27] [Rank 1] totoal_tokens=64149, outputs='Store C'
[2024-08-03 15:28:27] [Rank 0] totoal_tokens=64949, outputs='Rinne Groff'
[2024-08-03 15:28:28] [Rank 2] totoal_tokens=64527, outputs='Junaid Hussain'
[2024-08-03 15:28:31] [Rank 3] totoal_tokens=63778, outputs='Dawn'
[2024-08-03 15:28:31] [Rank 1] totoal_tokens=64328, outputs='Northern'
[2024-08-03 15:28:32] [Rank 0] totoal_tokens=64986, outputs='The Walking Dead'
[2024-08-03 15:28:34] [Rank 2] totoal_tokens=64616, outputs='The festival concludes with a concert.'
[2024-08-03 15:28:35] [Rank 3] totoal_tokens=63882, outputs='George Milpurrurru Dawidi'
[2024-08-03 15:28:36] [Rank 1] totoal_tokens=64527, outputs='Nancy Andrews'
[2024-08-03 15:28:37] [Rank 0] totoal_tokens=65223, outputs='The game is played on the computer.'
[2024-08-03 15:28:38] [Rank 2] totoal_tokens=64725, outputs='The left'
[2024-08-03 15:28:39] [Rank 3] totoal_tokens=63976, outputs='Soda'
[2024-08-03 15:28:41] [Rank 1] totoal_tokens=64532, outputs='Annual'
[2024-08-03 15:28:41] [Rank 0] totoal_tokens=65401, outputs='Golf'
[2024-08-03 15:28:42] [Rank 2] totoal_tokens=64729, outputs='Fire Emblem: Fates'
[2024-08-03 15:28:43] [Rank 3] totoal_tokens=64444, outputs='Apple Maps'
[2024-08-03 15:28:45] [Rank 1] totoal_tokens=64744, outputs='The Romanian online market posted, in S1, an increase of 7'
[2024-08-03 15:28:46] [Rank 0] totoal_tokens=65480, outputs='Airplane'
B_reasoning-text-test.jsonl: 91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 687/751 [14:54<05:15, 4.94s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 688/751 [14:59<05:14, 5.00s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 689/751 [15:03<04:56, 4.78s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 690/751 [15:08<04:54, 4.82s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 691/751 [15:13<04:42, 4.71s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 692/751 [15:18<04:47, 4.88s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 693/751 [15:22<04:34, 4.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 92%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 694/751 [15:26<04:21, 4.59s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆοΏ½[2024-08-03 15:28:47] [Rank 2] totoal_tokens=64729, outputs='Northern'
[2024-08-03 15:28:48] [Rank 3] totoal_tokens=64467, outputs='Oil painting'
[2024-08-03 15:28:50] [Rank 0] totoal_tokens=65729, outputs='Litmus'
[2024-08-03 15:28:51] [Rank 1] totoal_tokens=64936, outputs='Eddie Izzard'
[2024-08-03 15:28:51] [Rank 2] totoal_tokens=64834, outputs='The festival concludes the 2015 Winter Olympics.'
[2024-08-03 15:28:52] [Rank 3] totoal_tokens=64550, outputs='the government'
[2024-08-03 15:28:55] [Rank 0] totoal_tokens=65847, outputs='Scotts Miracle-Gro'
[2024-08-03 15:28:55] [Rank 1] totoal_tokens=65466, outputs='Store C'
[2024-08-03 15:28:56] [Rank 2] totoal_tokens=64902, outputs='Get ready for work'
[2024-08-03 15:28:57] [Rank 3] totoal_tokens=65553, outputs='Aloe Vera'
[2024-08-03 15:29:00] [Rank 1] totoal_tokens=65839, outputs='Pelican'
[2024-08-03 15:29:01] [Rank 2] totoal_tokens=64902, outputs='The fastest route is the one that is closest to the start line.'
[2024-08-03 15:29:01] [Rank 0] totoal_tokens=65881, outputs='Eagle'
[2024-08-03 15:29:01] [Rank 3] totoal_tokens=65795, outputs='Get ready for work'
[2024-08-03 15:29:05] [Rank 2] totoal_tokens=64946, outputs='Eddie Izzard'
[2024-08-03 15:29:06] [Rank 0] totoal_tokens=65945, outputs='Germain HΓ΄tels'
[2024-08-03 15:29:06] [Rank 1] totoal_tokens=66163, outputs='June 2020'
[2024-08-03 15:29:06] [Rank 3] totoal_tokens=65808, outputs='Eat more vegetables'
[2024-08-03 15:29:10] [Rank 2] totoal_tokens=64970, outputs='Serge Ibaka'
[2024-08-03 15:29:10] [Rank 0] totoal_tokens=66162, outputs='Cactus'
[2024-08-03 15:29:10] [Rank 1] totoal_tokens=66236, outputs='Google Play Store'
[2024-08-03 15:29:11] [Rank 3] totoal_tokens=65916, outputs='Kevin McCarthy'
[2024-08-03 15:29:15] [Rank 0] totoal_tokens=66314, outputs='Ginna Brelsford'
[2024-08-03 15:29:15] [Rank 2] totoal_tokens=65411, outputs='Alex Aventuria'
[2024-08-03 15:29:16] [Rank 3] totoal_tokens=65974, outputs='South'
[2024-08-03 15:29:16] [Rank 1] totoal_tokens=66618, outputs='Ladybug'
[2024-08-03 15:29:20] [Rank 3] totoal_tokens=65978, outputs='Fred Smith'
[2024-08-03 15:29:21] [Rank 0] totoal_tokens=67115, outputs='Dawn'
[2024-08-03 15:29:21] [Rank 2] totoal_tokens=65647, outputs='Barnes & Noble'
[2024-08-03 15:29:22] [Rank 1] totoal_tokens=67602, outputs='ZK8880'
[2024-08-03 15:29:26] [Rank 2] totoal_tokens=65966, outputs="The German's decision"
[2024-08-03 15:29:26] [Rank 1] totoal_tokens=67707, outputs='ice cream'
[2024-08-03 15:29:26] [Rank 3] totoal_tokens=66279, outputs='Diet Plan C'
[2024-08-03 15:29:26] [Rank 0] totoal_tokens=67953, outputs='The ice cream is served after the salad.'
[2024-08-03 15:29:31] [Rank 2] totoal_tokens=66214, outputs='Amazon'
[2024-08-03 15:29:32] [Rank 0] totoal_tokens=68157, outputs='Pigeon'
οΏ½οΏ½β–ˆβ–Ž| 695/751 [15:31<04:22, 4.69s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 696/751 [15:36<04:13, 4.61s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 697/751 [15:40<04:08, 4.60s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 698/751 [15:46<04:24, 4.99s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 699/751 [15:51<04:12, 4.86s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 700/751 [15:55<04:04, 4.79s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 701/751 [16:00<03:57, 4.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 702/751 [16:06<04:13, 5.16s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 703/751 [16:12<04:13, 5.28s/it] Processing In[2024-08-03 15:29:32] [Rank 3] totoal_tokens=66483, outputs='Time blocking'
[2024-08-03 15:29:33] [Rank 1] totoal_tokens=67754, outputs='in his pocket'
[2024-08-03 15:29:36] [Rank 2] totoal_tokens=66233, outputs='The Mandalorian'
[2024-08-03 15:29:37] [Rank 0] totoal_tokens=68466, outputs='She is a freelance writer'
[2024-08-03 15:29:38] [Rank 3] totoal_tokens=66685, outputs='She is a teacher'
[2024-08-03 15:29:38] [Rank 1] totoal_tokens=67900, outputs='The answer is "Morningside PlayCare."'
[2024-08-03 15:29:41] [Rank 2] totoal_tokens=66632, outputs='The room is set up with a desk, chair, and computer.'
[2024-08-03 15:29:43] [Rank 1] totoal_tokens=67913, outputs="I don't know"
[2024-08-03 15:29:43] [Rank 3] totoal_tokens=66947, outputs='in the backyard'
[2024-08-03 15:29:43] [Rank 0] totoal_tokens=69798, outputs='Lucy'
[2024-08-03 15:29:46] [Rank 2] totoal_tokens=66686, outputs='sunflower'
[2024-08-03 15:29:48] [Rank 3] totoal_tokens=67085, outputs='The model is a 3D computer-generated image of the scene.'
[2024-08-03 15:29:49] [Rank 1] totoal_tokens=67948, outputs='South Africa'
[2024-08-03 15:29:49] [Rank 0] totoal_tokens=69877, outputs='The model is a 3D computer-generated image.'
[2024-08-03 15:29:50] [Rank 2] totoal_tokens=67596, outputs='West'
[2024-08-03 15:29:53] [Rank 3] totoal_tokens=67952, outputs='The 2017 official estimate reported a population of 1,029,556'
[2024-08-03 15:29:55] [Rank 1] totoal_tokens=67961, outputs='Goyal'
[2024-08-03 15:29:55] [Rank 2] totoal_tokens=67900, outputs='Bird'
[2024-08-03 15:29:55] [Rank 0] totoal_tokens=69954, outputs='Made In Chelsea'
[2024-08-03 15:29:59] [Rank 3] totoal_tokens=68179, outputs='The ShareTek Story'
[2024-08-03 15:30:00] [Rank 1] totoal_tokens=68499, outputs='Dr. Saeed I. Latif'
[2024-08-03 15:30:01] [Rank 2] totoal_tokens=68582, outputs='Apple pie'
[2024-08-03 15:30:01] [Rank 0] totoal_tokens=70284, outputs='Mary Baum'
[2024-08-03 15:30:03] [Rank 3] totoal_tokens=68338, outputs='Car B'
[2024-08-03 15:30:05] [Rank 1] totoal_tokens=68507, outputs="Morrison's Rogue River Lodge"
[2024-08-03 15:30:06] [Rank 2] totoal_tokens=68755, outputs='Mike Tyson'
[2024-08-03 15:30:07] [Rank 0] totoal_tokens=70403, outputs='Grocery stores'
[2024-08-03 15:30:09] [Rank 3] totoal_tokens=68576, outputs='The 20th century'
[2024-08-03 15:30:10] [Rank 1] totoal_tokens=68781, outputs='Oil painting'
[2024-08-03 15:30:11] [Rank 2] totoal_tokens=68756, outputs='Jared Leto'
[2024-08-03 15:30:13] [Rank 0] totoal_tokens=70476, outputs='Mack'
[2024-08-03 15:30:13] [Rank 3] totoal_tokens=68743, outputs='Mobile phones'
[2024-08-03 15:30:16] [Rank 1] totoal_tokens=68888, outputs='sunflower'
[2024-08-03 15:30:16] [Rank 2] totoal_tokens=68838, outputs='The festival ends with a closing ceremony.'
[2024-08-03 15:30:19] [Rank 0] totoal_tokens=70481, outputs='oak'
ternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 704/751 [16:17<04:05, 5.22s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 705/751 [16:22<04:01, 5.24s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 706/751 [16:29<04:13, 5.63s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 707/751 [16:34<04:08, 5.65s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 708/751 [16:40<04:07, 5.76s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 709/751 [16:47<04:07, 5.90s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 710/751 [16:53<04:01, 5.90s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 711/751 [16:58<03:54, 5.85s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆοΏ½[2024-08-03 15:30:19] [Rank 3] totoal_tokens=69201, outputs='The plane was the coolest'
[2024-08-03 15:30:21] [Rank 1] totoal_tokens=69193, outputs='ice cream'
[2024-08-03 15:30:24] [Rank 2] totoal_tokens=68873, outputs='Bakir Izetbegović'
[2024-08-03 15:30:24] [Rank 0] totoal_tokens=70519, outputs='Aromatase'
[2024-08-03 15:30:25] [Rank 3] totoal_tokens=69297, outputs='God'
[2024-08-03 15:30:27] [Rank 1] totoal_tokens=69611, outputs='Kiin Kiin'
[2024-08-03 15:30:29] [Rank 0] totoal_tokens=70707, outputs='Go to the museum'
[2024-08-03 15:30:29] [Rank 2] totoal_tokens=68991, outputs='I will update this post in response to the reports.'
[2024-08-03 15:30:30] [Rank 3] totoal_tokens=69601, outputs='Marketing'
[2024-08-03 15:30:32] [Rank 1] totoal_tokens=69675, outputs='Route X'
[2024-08-03 15:30:35] [Rank 3] totoal_tokens=70571, outputs='Route X'
[2024-08-03 15:30:35] [Rank 0] totoal_tokens=70940, outputs='Fans'
[2024-08-03 15:30:36] [Rank 2] totoal_tokens=69505, outputs='Mei Mei'
[2024-08-03 15:30:39] [Rank 1] totoal_tokens=70432, outputs='The Oak Tree'
[2024-08-03 15:30:41] [Rank 2] totoal_tokens=69743, outputs='Route X'
[2024-08-03 15:30:42] [Rank 0] totoal_tokens=70949, outputs='The stock market'
[2024-08-03 15:30:42] [Rank 3] totoal_tokens=71208, outputs='Houston, Texas based Kinetic Motorcycles was founded in 2013 by'
[2024-08-03 15:30:44] [Rank 1] totoal_tokens=70515, outputs='The yellow book'
[2024-08-03 15:30:46] [Rank 2] totoal_tokens=70168, outputs='Sri Lanka'
[2024-08-03 15:30:47] [Rank 0] totoal_tokens=71042, outputs='The couch is in the corner.'
[2024-08-03 15:30:48] [Rank 3] totoal_tokens=71597, outputs='Kim Eun-hee'
[2024-08-03 15:30:50] [Rank 1] totoal_tokens=70755, outputs='The company is working on a thinner, lighter laptop that will have slimmer'
[2024-08-03 15:30:51] [Rank 2] totoal_tokens=70294, outputs='Linda'
[2024-08-03 15:30:52] [Rank 0] totoal_tokens=71146, outputs='Kevin Stewart'
[2024-08-03 15:30:52] [Rank 3] totoal_tokens=71677, outputs='Dawn'
[2024-08-03 15:30:55] [Rank 1] totoal_tokens=71146, outputs='JΓΌrgen Klopp'
[2024-08-03 15:30:56] [Rank 2] totoal_tokens=70337, outputs='Paul'
[2024-08-03 15:30:59] [Rank 0] totoal_tokens=71234, outputs='Alexey Gavrishev'
[2024-08-03 15:30:59] [Rank 3] totoal_tokens=71718, outputs='Thunder Valley'
[2024-08-03 15:31:01] [Rank 1] totoal_tokens=71205, outputs='March'
[2024-08-03 15:31:02] [Rank 2] totoal_tokens=70495, outputs='Grace Graupe-Pillard'
[2024-08-03 15:31:04] [Rank 3] totoal_tokens=71897, outputs='BUTCH'
[2024-08-03 15:31:04] [Rank 0] totoal_tokens=71535, outputs='Topshop'
[2024-08-03 15:31:06] [Rank 1] totoal_tokens=71207, outputs='Thomas Jefferson'
[2024-08-03 15:31:07] [Rank 2] totoal_tokens=70505, outputs='Store C'
[2024-08-03 15:31:09] [Rank 0] totoal_tokens=71576, outputs='The festival concludes with a concert.'
οΏ½οΏ½β–ˆβ–ˆβ–ˆβ–ˆβ–| 712/751 [17:05<03:53, 5.99s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 713/751 [17:09<03:34, 5.64s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 714/751 [17:14<03:21, 5.44s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 715/751 [17:21<03:26, 5.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 716/751 [17:27<03:25, 5.88s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 95%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 717/751 [17:32<03:09, 5.59s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 718/751 [17:37<03:01, 5.51s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 719/751 [17:44<03:07, 5.84s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 720/751 [17:49<02:58, 5.76s/it] Proc[2024-08-03 15:31:09] [Rank 3] totoal_tokens=72038, outputs='Jordan'
[2024-08-03 15:31:11] [Rank 1] totoal_tokens=71364, outputs='Jack'
[2024-08-03 15:31:13] [Rank 2] totoal_tokens=71193, outputs='Bila Tserkva'
[2024-08-03 15:31:15] [Rank 3] totoal_tokens=72114, outputs='The Force Awakens'
[2024-08-03 15:31:16] [Rank 0] totoal_tokens=71997, outputs='The company that launches its product last is not specified in the article.'
[2024-08-03 15:31:16] [Rank 1] totoal_tokens=71380, outputs='Bush'
[2024-08-03 15:31:18] [Rank 2] totoal_tokens=71378, outputs='Bush'
[2024-08-03 15:31:21] [Rank 3] totoal_tokens=72515, outputs='2014'
[2024-08-03 15:31:22] [Rank 0] totoal_tokens=72290, outputs='Gerard Schiffman'
[2024-08-03 15:31:22] [Rank 1] totoal_tokens=71534, outputs='The tree grows the slowest.'
[2024-08-03 15:31:23] [Rank 2] totoal_tokens=71529, outputs='Saturn'
[2024-08-03 15:31:27] [Rank 0] totoal_tokens=72424, outputs='Jenna'
[2024-08-03 15:31:27] [Rank 3] totoal_tokens=72591, outputs='June 10'
[2024-08-03 15:31:27] [Rank 1] totoal_tokens=71553, outputs='Northern Europe'
[2024-08-03 15:31:28] [Rank 2] totoal_tokens=71562, outputs='Bird'
[2024-08-03 15:31:32] [Rank 0] totoal_tokens=72540, outputs='She is a nurse'
[2024-08-03 15:31:32] [Rank 3] totoal_tokens=72979, outputs='Soy'
[2024-08-03 15:31:32] [Rank 1] totoal_tokens=71765, outputs='She is a self-taught freelance documentary photographer.'
[2024-08-03 15:31:34] [Rank 2] totoal_tokens=71578, outputs='Maria'
[2024-08-03 15:31:37] [Rank 3] totoal_tokens=73364, outputs='Nord Stream 2'
[2024-08-03 15:31:38] [Rank 1] totoal_tokens=72429, outputs='Jupiter'
[2024-08-03 15:31:38] [Rank 0] totoal_tokens=72601, outputs='Dawn'
[2024-08-03 15:31:40] [Rank 2] totoal_tokens=71944, outputs='He wakes up'
[2024-08-03 15:31:43] [Rank 0] totoal_tokens=72678, outputs='The Great Gatsby'
[2024-08-03 15:31:44] [Rank 3] totoal_tokens=73448, outputs='No'
[2024-08-03 15:31:44] [Rank 1] totoal_tokens=72609, outputs='The oak tree is planted in the ground.'
[2024-08-03 15:31:45] [Rank 2] totoal_tokens=72053, outputs='Johnson'
[2024-08-03 15:31:48] [Rank 0] totoal_tokens=73329, outputs='Water'
[2024-08-03 15:31:49] [Rank 3] totoal_tokens=73462, outputs='The couch'
[2024-08-03 15:31:50] [Rank 1] totoal_tokens=72688, outputs='The 2019 festival'
[2024-08-03 15:31:51] [Rank 2] totoal_tokens=72472, outputs='Lisa Wilcox'
[2024-08-03 15:31:54] [Rank 0] totoal_tokens=73562, outputs='NXT'
essing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 721/751 [17:55<02:47, 5.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ| 722/751 [18:01<02:47, 5.79s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 723/751 [18:07<02:44, 5.88s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 96%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 724/751 [18:12<02:31, 5.62s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 725/751 [18:17<02:23, 5.50s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 726/751 [18:23<02:20, 5.63s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 727/751 [18:29<02:13, 5.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 728/751 [18:34<02:05, 5.46s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆοΏ½[2024-08-03 15:31:54] [Rank 3] totoal_tokens=74915, outputs='Oak tree'
[2024-08-03 15:31:55] [Rank 1] totoal_tokens=72799, outputs='Mari Kurokawa'
[2024-08-03 15:31:57] [Rank 2] totoal_tokens=73016, outputs='The Reverend Shawn Amos & The Brother'
[2024-08-03 15:32:01] [Rank 1] totoal_tokens=73348, outputs='Gameloft'
[2024-08-03 15:32:01] [Rank 3] totoal_tokens=74993, outputs='Mr. Shekhar Suman'
[2024-08-03 15:32:02] [Rank 0] totoal_tokens=73641, outputs='Rest'
[2024-08-03 15:32:04] [Rank 2] totoal_tokens=73085, outputs='June 7, 2017'
[2024-08-03 15:32:07] [Rank 1] totoal_tokens=73486, outputs='The 2015 European Rally Championship'
[2024-08-03 15:32:07] [Rank 3] totoal_tokens=75056, outputs='The 2019 ARLC'
[2024-08-03 15:32:08] [Rank 0] totoal_tokens=73903, outputs='South'
[2024-08-03 15:32:09] [Rank 2] totoal_tokens=73211, outputs='I’m not sure'
[2024-08-03 15:32:12] [Rank 1] totoal_tokens=73594, outputs='pigeon'
[2024-08-03 15:32:13] [Rank 0] totoal_tokens=74624, outputs='Doug'
[2024-08-03 15:32:14] [Rank 2] totoal_tokens=73232, outputs='The couch is on the floor.'
[2024-08-03 15:32:15] [Rank 3] totoal_tokens=75285, outputs='Route X'
[2024-08-03 15:32:17] [Rank 1] totoal_tokens=73948, outputs='Dawn'
[2024-08-03 15:32:19] [Rank 0] totoal_tokens=74863, outputs='The mother shop is located in the mall.'
[2024-08-03 15:32:19] [Rank 2] totoal_tokens=73468, outputs='Magene Noir'
[2024-08-03 15:32:21] [Rank 3] totoal_tokens=75693, outputs='John Mendenhall'
[2024-08-03 15:32:23] [Rank 1] totoal_tokens=74885, outputs='Bardstown Bourbon Co.'
[2024-08-03 15:32:24] [Rank 0] totoal_tokens=74895, outputs='Police Academy'
[2024-08-03 15:32:25] [Rank 2] totoal_tokens=73577, outputs='Growing'
[2024-08-03 15:32:26] [Rank 3] totoal_tokens=75829, outputs='Pizza'
[2024-08-03 15:32:29] [Rank 1] totoal_tokens=75285, outputs='The American Meteorological Society'
[2024-08-03 15:32:30] [Rank 0] totoal_tokens=75064, outputs="I'm sorry, I can't help you with that."
[2024-08-03 15:32:30] [Rank 2] totoal_tokens=73824, outputs='AIDA'
[2024-08-03 15:32:32] [Rank 3] totoal_tokens=75900, outputs='Billy Casper'
[2024-08-03 15:32:35] [Rank 1] totoal_tokens=75401, outputs='Liam Belle Elion'
[2024-08-03 15:32:36] [Rank 2] totoal_tokens=74404, outputs='The interview'
[2024-08-03 15:32:37] [Rank 0] totoal_tokens=75192, outputs="I'm sorry, I can't help you with that."
[2024-08-03 15:32:39] [Rank 3] totoal_tokens=75923, outputs='The 2021 @HerculesTires CAA'
[2024-08-03 15:32:41] [Rank 1] totoal_tokens=75582, outputs='John Mills'
[2024-08-03 15:32:42] [Rank 0] totoal_tokens=75683, outputs='Danny Leong'
οΏ½οΏ½β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 729/751 [18:39<01:58, 5.40s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 730/751 [18:47<02:12, 6.29s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 731/751 [18:53<02:03, 6.19s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 732/751 [18:59<01:52, 5.92s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 733/751 [19:04<01:43, 5.76s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 734/751 [19:09<01:36, 5.67s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 735/751 [19:15<01:30, 5.64s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 736/751 [19:22<01:31, 6.09s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 737/751 [19:28<01:22, 5.90s[2024-08-03 15:32:43] [Rank 2] totoal_tokens=74419, outputs='Denon'
[2024-08-03 15:32:45] [Rank 3] totoal_tokens=76179, outputs='Guy'
[2024-08-03 15:32:47] [Rank 1] totoal_tokens=75870, outputs='Deborah Logan'
[2024-08-03 15:32:48] [Rank 0] totoal_tokens=75944, outputs='Corn'
[2024-08-03 15:32:49] [Rank 2] totoal_tokens=74682, outputs='Graham'
[2024-08-03 15:32:50] [Rank 3] totoal_tokens=76217, outputs='Mohammad Shami'
[2024-08-03 15:32:53] [Rank 0] totoal_tokens=75982, outputs='Nell Carter'
[2024-08-03 15:32:53] [Rank 1] totoal_tokens=76883, outputs='Mark'
[2024-08-03 15:32:55] [Rank 2] totoal_tokens=74855, outputs='The Memoirs of Joseph Grimaldi'
[2024-08-03 15:32:56] [Rank 3] totoal_tokens=76473, outputs='France'
[2024-08-03 15:32:59] [Rank 1] totoal_tokens=76946, outputs='Take a picture'
[2024-08-03 15:32:59] [Rank 0] totoal_tokens=76623, outputs='Ninja'
[2024-08-03 15:33:02] [Rank 3] totoal_tokens=76943, outputs='Go to bed'
[2024-08-03 15:33:03] [Rank 2] totoal_tokens=75302, outputs='She goes to the mall'
[2024-08-03 15:33:04] [Rank 0] totoal_tokens=76895, outputs='Samsung'
[2024-08-03 15:33:05] [Rank 1] totoal_tokens=77050, outputs='North America'
[2024-08-03 15:33:08] [Rank 3] totoal_tokens=77058, outputs='Jadon Sancho'
[2024-08-03 15:33:09] [Rank 2] totoal_tokens=75640, outputs='I'
[2024-08-03 15:33:10] [Rank 0] totoal_tokens=76949, outputs='The student goes to bed.'
[2024-08-03 15:33:10] [Rank 1] totoal_tokens=77104, outputs='Earth'
[2024-08-03 15:33:14] [Rank 3] totoal_tokens=77264, outputs='Terry'
[2024-08-03 15:33:14] [Rank 2] totoal_tokens=75694, outputs='Bhoomi'
[2024-08-03 15:33:16] [Rank 0] totoal_tokens=77044, outputs='The owl'
[2024-08-03 15:33:17] [Rank 1] totoal_tokens=77368, outputs='Pumpkin'
[2024-08-03 15:33:19] [Rank 3] totoal_tokens=77459, outputs='Europe'
[2024-08-03 15:33:20] [Rank 2] totoal_tokens=75805, outputs='August 2021'
[2024-08-03 15:33:23] [Rank 0] totoal_tokens=77130, outputs='The moon'
[2024-08-03 15:33:24] [Rank 1] totoal_tokens=77391, outputs='Steve Smith'
[2024-08-03 15:33:26] [Rank 3] totoal_tokens=77929, outputs='I am in the Krakow, Poland, airport'
[2024-08-03 15:33:28] [Rank 2] totoal_tokens=76990, outputs='Durham'
[2024-08-03 15:33:29] [Rank 0] totoal_tokens=77805, outputs='The Plaza'
[2024-08-03 15:33:29] [Rank 1] totoal_tokens=77737, outputs='Jill Scott'
[2024-08-03 15:33:32] [Rank 3] totoal_tokens=78290, outputs='Dennis'
[2024-08-03 15:33:34] [Rank 0] totoal_tokens=77837, outputs='Go to bed'
/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 738/751 [19:33<01:14, 5.73s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 739/751 [19:38<01:07, 5.60s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 740/751 [19:44<01:03, 5.74s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 741/751 [19:50<00:55, 5.58s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 742/751 [19:55<00:50, 5.57s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 743/751 [20:01<00:46, 5.79s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 744/751 [20:08<00:42, 6.11s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 745/751 [20:14<00:35, 5.95s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: [2024-08-03 15:33:34] [Rank 2] totoal_tokens=77166, outputs='The Lone Ranger'
[2024-08-03 15:33:36] [Rank 1] totoal_tokens=77902, outputs='The Vitruvian Man'
[2024-08-03 15:33:38] [Rank 3] totoal_tokens=78997, outputs='F1'
[2024-08-03 15:33:40] [Rank 2] totoal_tokens=77464, outputs='Europe'
[2024-08-03 15:33:40] [Rank 0] totoal_tokens=77881, outputs='yallapapi'
[2024-08-03 15:33:42] [Rank 1] totoal_tokens=77934, outputs='The cake'
[2024-08-03 15:33:45] [Rank 3] totoal_tokens=79267, outputs='The mother shop is a shop that sells clothing.'
[2024-08-03 15:33:45] [Rank 2] totoal_tokens=77672, outputs='Candy'
[2024-08-03 15:33:47] [Rank 0] totoal_tokens=78696, outputs='Baylor College of Medicine'
[2024-08-03 15:33:47] [Rank 1] totoal_tokens=78436, outputs='The student goes to class.'
[2024-08-03 15:33:52] [Rank 3] totoal_tokens=79326, outputs='La Fin du Monde'
[2024-08-03 15:33:53] [Rank 2] totoal_tokens=77792, outputs="I'm not sure they'll beat City; if I'm honest, I think"
[2024-08-03 15:33:53] [Rank 0] totoal_tokens=79132, outputs='Sony'
[2024-08-03 15:33:53] [Rank 1] totoal_tokens=78524, outputs='I’m yet to download the workflow. If I am to use that idea'
[2024-08-03 15:33:58] [Rank 3] totoal_tokens=79363, outputs='SONAR 8'
[2024-08-03 15:33:58] [Rank 2] totoal_tokens=77984, outputs='Gardens'
[2024-08-03 15:34:00] [Rank 0] totoal_tokens=79675, outputs="I'm not sure"
[2024-08-03 15:34:00] [Rank 1] totoal_tokens=78528, outputs='North America'
[2024-08-03 15:34:05] [Rank 2] totoal_tokens=78992, outputs='Paintings'
[2024-08-03 15:34:05] [Rank 3] totoal_tokens=79411, outputs='The Sudanese government'
[2024-08-03 15:34:05] Rank 3 Finish
[2024-08-03 15:34:06] [Rank 0] totoal_tokens=79826, outputs="I'm not sure"
99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 746/751 [20:20<00:29, 5.89s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 99%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 747/751 [20:26<00:23, 5.95s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 748/751 [20:32<00:18, 6.13s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 749/751 [20:38<00:12, 6.10s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰| 750/751 [20:45<00:06, 6.33s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 751/751 [20:51<00:00, 6.25s/it] Processing InternVL2-2B_reasoning-text-test.jsonl: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 751/751 [20:51<00:00, 1.67s/it]
[2024-08-03 15:34:06] Rank 0 Finish
[2024-08-03 15:34:07] [Rank 1] totoal_tokens=79601, outputs='Growing'
[2024-08-03 15:34:11] [Rank 2] totoal_tokens=79026, outputs='January 1st'
[2024-08-03 15:34:15] [Rank 1] totoal_tokens=79796, outputs='Liam was a fan of the band'
[2024-08-03 15:34:19] [Rank 2] totoal_tokens=79447, outputs='Ghana'
[2024-08-03 15:34:21] [Rank 1] totoal_tokens=79821, outputs='Duck'
[2024-08-03 15:34:21] Rank 1 Finish
[2024-08-03 15:34:26] [Rank 2] totoal_tokens=79718, outputs='decaying'
[2024-08-03 15:34:34] [Rank 2] totoal_tokens=79764, outputs='Sterling'
[2024-08-03 15:34:40] [Rank 2] totoal_tokens=79915, outputs='The Great Depression'
[2024-08-03 15:34:40] Rank 2 Finish
cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/temp_InternVL2-2B_reasoning-text-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/InternVL2-2B_reasoning-text-test.jsonl
cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/temp_InternVL2-2B_reasoning-text-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/InternVL2-2B_reasoning-text-test.jsonl
cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/temp_InternVL2-2B_reasoning-text-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/InternVL2-2B_reasoning-text-test.jsonl
cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/temp_InternVL2-2B_reasoning-text-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/InternVL2-2B_reasoning-text-test.jsonl
python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test
python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test
python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test
python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test
[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure.
results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/results/InternVL2-2B/scores_test.json
[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure.
results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/results/InternVL2-2B/scores_test.json
[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure.
results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/results/InternVL2-2B/scores_test.json
[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure.
results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-text-test/results/InternVL2-2B/scores_test.json