mansaripo commited on
Commit
5de1d42
Β·
verified Β·
1 Parent(s): 954e44f

Delete lm_eval/test_eval.log

Browse files
Files changed (1) hide show
  1. lm_eval/test_eval.log +0 -94
lm_eval/test_eval.log DELETED
@@ -1,94 +0,0 @@
1
- The following values were not passed to `accelerate launch` and had defaults used instead:
2
- `--num_processes` was set to a value of `2`
3
- More than one GPU was found, enabling multi-GPU training.
4
- If this was unintended please pass in `--num_processes=1`.
5
- `--num_machines` was set to a value of `1`
6
- `--mixed_precision` was set to a value of `'no'`
7
- `--dynamo_backend` was set to a value of `'no'`
8
- To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
9
- 2026-03-19:14:15:25 INFO [_cli.run:375] Including path: ./
10
- 2026-03-19:14:15:25 INFO [_cli.run:376] Selected Tasks: ['arc_easy_mi', 'arc_challenge_mi', 'hellaswag', 'piqa']
11
- 2026-03-19:14:15:25 INFO [evaluator:211] Setting random seed to 0 | Setting numpy seed to 1234 | Setting torch manual seed to 1234 | Setting fewshot manual seed to 1234
12
- 2026-03-19:14:15:25 INFO [evaluator:236] Initializing cloverlm model, with arguments: {'pretrained': 'daslab-testing/CloverLM', 'dtype': 'bfloat16', 'quartet_2_impl': 'quartet2', 'attn_backend': 'pytorch', 'trust_remote_code': True}
13
- 2026-03-19:14:15:25 INFO [_cli.run:375] Including path: ./
14
- 2026-03-19:14:15:25 INFO [_cli.run:376] Selected Tasks: ['arc_easy_mi', 'arc_challenge_mi', 'hellaswag', 'piqa']
15
- 2026-03-19:14:15:25 INFO [evaluator:211] Setting random seed to 0 | Setting numpy seed to 1234 | Setting torch manual seed to 1234 | Setting fewshot manual seed to 1234
16
- 2026-03-19:14:15:25 INFO [evaluator:236] Initializing cloverlm model, with arguments: {'pretrained': 'daslab-testing/CloverLM', 'dtype': 'bfloat16', 'quartet_2_impl': 'quartet2', 'attn_backend': 'pytorch', 'trust_remote_code': True}
17
- 2026-03-19:14:15:26 INFO [models.huggingface:178] Using `accelerate launch` or `parallelize=True`, device 'cuda:0' will be overridden when placing model.
18
- 2026-03-19:14:15:26 INFO [models.huggingface:178] Using `accelerate launch` or `parallelize=True`, device 'cuda:0' will be overridden when placing model.
19
- 2026-03-19:14:15:26 INFO [models.huggingface:548] Model type cannot be determined. Using default model type 'causal'
20
- 2026-03-19:14:15:26 INFO [models.huggingface:548] Model type cannot be determined. Using default model type 'causal'
21
- 2026-03-19:14:15:28 INFO [models.huggingface:423] Model parallel was set to False, max memory was not set, and device map was set to {'': 'cuda:1'}
22
- 2026-03-19:14:15:28 INFO [models.huggingface:423] Model parallel was set to False, max memory was not set, and device map was set to {'': 'cuda:0'}
23
-
24
-
25
- The tied weights mapping and config for this model specifies to tie transformer.emb.weight to transformer.linear.weight, but both are present in the checkpoints, so we will NOT tie them. You should update the config with `tie_word_embeddings=False` to silence this warning
26
- The tied weights mapping and config for this model specifies to tie transformer.emb.weight to transformer.linear.weight, but both are present in the checkpoints, so we will NOT tie them. You should update the config with `tie_word_embeddings=False` to silence this warning
27
- 2026-03-19:14:15:45 INFO [tasks:700] Selected tasks:
28
- 2026-03-19:14:15:45 INFO [tasks:691] Task: piqa (.venv/lib/python3.11/site-packages/lm_eval/tasks/piqa/piqa.yaml)
29
- 2026-03-19:14:15:45 INFO [tasks:691] Task: hellaswag (.venv/lib/python3.11/site-packages/lm_eval/tasks/hellaswag/hellaswag.yaml)
30
- 2026-03-19:14:15:45 INFO [tasks:691] Task: arc_challenge_mi (arc_challenge.yaml)
31
- 2026-03-19:14:15:45 INFO [tasks:691] Task: arc_easy_mi (arc_easy_mi.yaml)
32
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of piqa from None to 0
33
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of hellaswag from None to 0
34
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of arc_challenge_mi from None to 0
35
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of arc_easy_mi from None to 0
36
- 2026-03-19:14:15:45 INFO [api.task:311] Building contexts for piqa on rank 0...
37
-
38
  0%| | 0/919 [00:00<?, ?it/s]
39
  10%|β–‰ | 91/919 [00:00<00:00, 900.62it/s]2026-03-19:14:15:45 INFO [tasks:700] Selected tasks:
40
- 2026-03-19:14:15:45 INFO [tasks:691] Task: piqa (.venv/lib/python3.11/site-packages/lm_eval/tasks/piqa/piqa.yaml)
41
- 2026-03-19:14:15:45 INFO [tasks:691] Task: hellaswag (.venv/lib/python3.11/site-packages/lm_eval/tasks/hellaswag/hellaswag.yaml)
42
- 2026-03-19:14:15:45 INFO [tasks:691] Task: arc_challenge_mi (arc_challenge.yaml)
43
- 2026-03-19:14:15:45 INFO [tasks:691] Task: arc_easy_mi (arc_easy_mi.yaml)
44
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of piqa from None to 0
45
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of hellaswag from None to 0
46
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of arc_challenge_mi from None to 0
47
- 2026-03-19:14:15:45 WARNING [evaluator:333] Overwriting default num_fewshot of arc_easy_mi from None to 0
48
- 2026-03-19:14:15:45 INFO [api.task:311] Building contexts for piqa on rank 1...
49
-
50
  20%|β–ˆβ–‰ | 182/919 [00:00<00:00, 905.61it/s]
51
  0%| | 0/919 [00:00<?, ?it/s]
52
  30%|β–ˆβ–ˆβ–‰ | 274/919 [00:00<00:00, 910.65it/s]
53
  10%|β–‰ | 91/919 [00:00<00:00, 907.36it/s]
54
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 366/919 [00:00<00:00, 911.36it/s]
55
  20%|β–ˆβ–‰ | 183/919 [00:00<00:00, 911.26it/s]
56
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 458/919 [00:00<00:00, 911.45it/s]
57
  30%|β–ˆβ–ˆβ–‰ | 275/919 [00:00<00:00, 915.08it/s]
58
  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 550/919 [00:00<00:00, 911.49it/s]
59
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 367/919 [00:00<00:00, 915.11it/s]
60
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 642/919 [00:00<00:00, 912.08it/s]
61
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 459/919 [00:00<00:00, 916.54it/s]
62
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 734/919 [00:00<00:00, 911.69it/s]
63
  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 551/919 [00:00<00:00, 914.63it/s]
64
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 826/919 [00:00<00:00, 912.45it/s]
65
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 643/919 [00:00<00:00, 914.29it/s]
66
-
67
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 735/919 [00:00<00:00, 912.73it/s]
68
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 827/919 [00:00<00:00, 913.03it/s]
69
- 2026-03-19:14:15:48 INFO [api.task:311] Building contexts for hellaswag on rank 0...
70
- 2026-03-19:14:15:48 INFO [api.task:311] Building contexts for hellaswag on rank 1...
71
-
72
  0%| | 0/5021 [00:00<?, ?it/s]
73
  6%|β–Œ | 278/5021 [00:00<00:01, 2773.97it/s]
74
  11%|β–ˆ | 564/5021 [00:00<00:01, 2824.23it/s]
75
  0%| | 0/5021 [00:00<?, ?it/s]
76
  17%|β–ˆβ–‹ | 852/5021 [00:00<00:01, 2845.58it/s]
77
  4%|β–Ž | 178/5021 [00:00<00:02, 1779.16it/s]
78
  23%|β–ˆβ–ˆβ–Ž | 1140/5021 [00:00<00:01, 2855.94it/s]
79
  7%|β–‹ | 362/5021 [00:00<00:02, 1813.26it/s]
80
  28%|β–ˆβ–ˆβ–Š | 1427/5021 [00:00<00:01, 2860.60it/s]
81
  11%|β–ˆ | 547/5021 [00:00<00:02, 1826.42it/s]
82
  34%|β–ˆβ–ˆβ–ˆβ– | 1714/5021 [00:00<00:01, 2863.10it/s]
83
  15%|β–ˆβ– | 730/5021 [00:00<00:02, 1826.29it/s]
84
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 2001/5021 [00:00<00:01, 2863.11it/s]
85
  18%|β–ˆβ–Š | 913/5021 [00:00<00:02, 1824.48it/s]
86
  46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 2289/5021 [00:00<00:00, 2868.32it/s]
87
  22%|β–ˆβ–ˆβ– | 1096/5021 [00:00<00:02, 1819.99it/s]
88
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 2577/5021 [00:00<00:00, 2871.27it/s]
89
  25%|β–ˆβ–ˆβ–Œ | 1279/5021 [00:00<00:02, 1815.56it/s]
90
  29%|β–ˆβ–ˆβ–‰ | 1461/5021 [00:00<00:01, 1802.42it/s]
91
  57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 2865/5021 [00:01<00:01, 1693.34it/s]
92
  63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 3149/5021 [00:01<00:00, 1928.96it/s]
93
  68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 3433/5021 [00:01<00:00, 2135.30it/s]
94
  33%|β–ˆβ–ˆβ–ˆβ–Ž | 1642/5021 [00:01<00:03, 1001.41it/s]
95
  74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3718/5021 [00:01<00:00, 2308.62it/s]
96
  36%|β–ˆβ–ˆβ–ˆβ–‹ | 1823/5021 [00:01<00:02, 1159.85it/s]
97
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 4004/5021 [00:01<00:00, 2449.72it/s]
98
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 2002/5021 [00:01<00:02, 1297.94it/s]
99
  85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 4290/5021 [00:01<00:00, 2558.71it/s]
100
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 2183/5021 [00:01<00:02, 1418.79it/s]
101
  91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 4577/5021 [00:01<00:00, 2643.21it/s]
102
  47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 2364/5021 [00:01<00:01, 1516.12it/s]
103
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 4864/5021 [00:01<00:00, 2705.33it/s]
104
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 2544/5021 [00:01<00:01, 1591.38it/s]
105
-
106
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 2725/5021 [00:01<00:01, 1650.39it/s]
107
  58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 2905/5021 [00:01<00:01, 1692.57it/s]
108
  61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3085/5021 [00:01<00:01, 1722.37it/s]
109
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 3266/5021 [00:02<00:01, 1745.32it/s]
110
  69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 3447/5021 [00:02<00:00, 1762.66it/s]
111
  72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3627/5021 [00:02<00:00, 1772.66it/s]
112
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 3807/5021 [00:02<00:00, 1779.65it/s]
113
  79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 3987/5021 [00:02<00:00, 1781.31it/s]
114
  83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 4168/5021 [00:02<00:00, 1787.31it/s]
115
  87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 4349/5021 [00:02<00:00, 1792.50it/s]
116
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 4529/5021 [00:02<00:00, 1790.62it/s]
117
  94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 4709/5021 [00:02<00:00, 1790.61it/s]
118
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 4890/5021 [00:02<00:00, 1793.65it/s]
119
- 2026-03-19:14:15:52 INFO [api.task:311] Building contexts for arc_challenge_mi on rank 1...
120
- 2026-03-19:14:15:52 INFO [api.task:311] Building contexts for arc_challenge_mi on rank 0...
121
-
122
  0%| | 0/586 [00:00<?, ?it/s]
123
  0%| | 0/586 [00:00<?, ?it/s]
124
  17%|β–ˆβ–‹ | 98/586 [00:00<00:00, 971.82it/s]
125
  11%|β–ˆ | 62/586 [00:00<00:00, 613.37it/s]
126
  34%|β–ˆβ–ˆβ–ˆβ–Ž | 197/586 [00:00<00:00, 976.00it/s]
127
  21%|β–ˆβ–ˆβ– | 125/586 [00:00<00:00, 619.71it/s]
128
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 297/586 [00:00<00:00, 985.47it/s]
129
  32%|β–ˆβ–ˆβ–ˆβ– | 189/586 [00:00<00:00, 624.63it/s]
130
  68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 396/586 [00:00<00:00, 986.66it/s]
131
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 253/586 [00:00<00:00, 627.18it/s]
132
  85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 496/586 [00:00<00:00, 989.81it/s]
133
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 317/586 [00:00<00:00, 628.58it/s]
134
-
135
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 381/586 [00:00<00:00, 629.34it/s]
136
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 445/586 [00:00<00:00, 630.51it/s]
137
  87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 509/586 [00:00<00:00, 632.25it/s]
138
  98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 573/586 [00:00<00:00, 632.82it/s]
139
- 2026-03-19:14:15:53 INFO [api.task:311] Building contexts for arc_easy_mi on rank 0...
140
- 2026-03-19:14:15:53 INFO [api.task:311] Building contexts for arc_easy_mi on rank 1...
141
-
142
  0%| | 0/1188 [00:00<?, ?it/s]
143
  0%| | 0/1188 [00:00<?, ?it/s]
144
  8%|β–Š | 100/1188 [00:00<00:01, 993.05it/s]
145
  5%|β–Œ | 63/1188 [00:00<00:01, 626.46it/s]
146
  17%|β–ˆβ–‹ | 200/1188 [00:00<00:00, 991.06it/s]
147
  11%|β–ˆ | 127/1188 [00:00<00:01, 629.71it/s]
148
  25%|β–ˆβ–ˆβ–Œ | 300/1188 [00:00<00:00, 994.71it/s]
149
  16%|β–ˆβ–Œ | 191/1188 [00:00<00:01, 630.93it/s]
150
  34%|β–ˆβ–ˆβ–ˆβ–Ž | 400/1188 [00:00<00:00, 993.35it/s]
151
  21%|β–ˆβ–ˆβ– | 255/1188 [00:00<00:01, 633.00it/s]
152
  42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 500/1188 [00:00<00:00, 993.59it/s]
153
  27%|β–ˆβ–ˆβ–‹ | 319/1188 [00:00<00:01, 634.81it/s]
154
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 600/1188 [00:00<00:00, 993.86it/s]
155
  32%|β–ˆβ–ˆβ–ˆβ– | 383/1188 [00:00<00:01, 636.25it/s]
156
  59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 700/1188 [00:00<00:00, 992.08it/s]
157
  38%|β–ˆβ–ˆβ–ˆβ–Š | 447/1188 [00:00<00:01, 636.11it/s]
158
  67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 800/1188 [00:00<00:00, 988.12it/s]
159
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 511/1188 [00:00<00:01, 634.75it/s]
160
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 899/1188 [00:00<00:00, 988.66it/s]
161
  48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 575/1188 [00:00<00:00, 634.67it/s]
162
  84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 999/1188 [00:01<00:00, 991.15it/s]
163
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 639/1188 [00:01<00:00, 633.38it/s]
164
  93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1099/1188 [00:01<00:00, 993.44it/s]
165
  59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 703/1188 [00:01<00:00, 633.82it/s]
166
-
167
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 768/1188 [00:01<00:00, 635.72it/s]
168
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 832/1188 [00:01<00:00, 636.49it/s]
169
  75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 896/1188 [00:01<00:00, 632.00it/s]
170
  81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 960/1188 [00:01<00:00, 628.76it/s]
171
  86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1023/1188 [00:01<00:00, 624.90it/s]
172
  91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1086/1188 [00:01<00:00, 625.37it/s]
173
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1149/1188 [00:01<00:00, 625.85it/s]
174
- 2026-03-19:14:15:55 INFO [evaluator:584] Running loglikelihood requests
175
- 2026-03-19:14:15:55 INFO [evaluator:584] Running loglikelihood requests
176
-
177
- Passed argument batch_size = auto:1. Detecting largest batch size
178
- Determined largest batch size: 64
179
- Determined largest batch size: 64
180
-
181
- [rank1]:W0319 14:20:19.946000 1458598 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] function: 'abs_max' (/home/matin/convert_dir/CloverLM/lm_eval/.venv/lib/python3.11/site-packages/quartet2/linear.py:147)
182
- [rank1]:W0319 14:20:19.946000 1458598 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] last reason: 0/5: tensor 'x' requires_grad mismatch. expected requires_grad=1
183
- [rank1]:W0319 14:20:19.946000 1458598 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
184
- [rank1]:W0319 14:20:19.946000 1458598 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/compile/programming_model.recompilation.html
185
-
186
- [rank0]:W0319 14:20:22.844000 1458597 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] function: 'abs_max' (/home/matin/convert_dir/CloverLM/lm_eval/.venv/lib/python3.11/site-packages/quartet2/linear.py:147)
187
- [rank0]:W0319 14:20:22.844000 1458597 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] last reason: 0/5: tensor 'x' requires_grad mismatch. expected requires_grad=1
188
- [rank0]:W0319 14:20:22.844000 1458597 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] To log all recompilation reasons, use TORCH_LOGS="recompiles".
189
- [rank0]:W0319 14:20:22.844000 1458597 .venv/lib/python3.11/site-packages/torch/_dynamo/convert_frame.py:1676] [0/8] To diagnose recompilation issues, see https://pytorch.org/docs/main/compile/programming_model.recompilation.html
190
-
191
- fatal: not a git repository (or any of the parent directories): .git
192
- 2026-03-19:14:20:29 INFO [loggers.evaluation_tracker:316] Output path not provided, skipping saving results aggregated
193
- cloverlm ({'pretrained': 'daslab-testing/CloverLM', 'dtype': 'bfloat16', 'quartet_2_impl': 'quartet2', 'attn_backend': 'pytorch'}), gen_kwargs: ({}), limit: None, num_fewshot: 0, batch_size: auto (64)
194
- | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
195
- |----------------|------:|------|-----:|---------------|---|-----:|---|-----:|
196
- |arc_challenge_mi| 1|none | 0|acc |↑ |0.4642|Β± |0.0146|
197
- | | |none | 0|acc_mutual_info|↑ |0.5017|Β± |0.0146|
198
- | | |none | 0|acc_norm |↑ |0.4940|Β± |0.0146|
199
- |arc_easy_mi | 1|none | 0|acc |↑ |0.8005|Β± |0.0082|
200
- | | |none | 0|acc_mutual_info|↑ |0.7193|Β± |0.0092|
201
- | | |none | 0|acc_norm |↑ |0.7740|Β± |0.0086|
202
- |hellaswag | 1|none | 0|acc |↑ |0.5392|Β± |0.0050|
203
- | | |none | 0|acc_norm |↑ |0.7169|Β± |0.0045|
204
- |piqa | 1|none | 0|acc |↑ |0.7911|Β± |0.0095|
205
- | | |none | 0|acc_norm |↑ |0.8090|Β± |0.0092|
206
-
207
- [rank0]:[W319 14:20:30.213375773 ProcessGroupNCCL.cpp:1553] Warning: WARNING: destroy_process_group() was not called before program exit, which can leak resources. For more info, please see https://pytorch.org/docs/stable/distributed.html#shutdown (function operator())
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
0
  0%| | 0/919 [00:00<?, ?it/s]
1
  10%|β–‰ | 91/919 [00:00<00:00, 900.62it/s]2026-03-19:14:15:45 INFO [tasks:700] Selected tasks:
 
 
 
 
 
 
 
 
 
 
2
  20%|β–ˆβ–‰ | 182/919 [00:00<00:00, 905.61it/s]
3
  0%| | 0/919 [00:00<?, ?it/s]
4
  30%|β–ˆβ–ˆβ–‰ | 274/919 [00:00<00:00, 910.65it/s]
5
  10%|β–‰ | 91/919 [00:00<00:00, 907.36it/s]
6
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 366/919 [00:00<00:00, 911.36it/s]
7
  20%|β–ˆβ–‰ | 183/919 [00:00<00:00, 911.26it/s]
8
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 458/919 [00:00<00:00, 911.45it/s]
9
  30%|β–ˆβ–ˆβ–‰ | 275/919 [00:00<00:00, 915.08it/s]
10
  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 550/919 [00:00<00:00, 911.49it/s]
11
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 367/919 [00:00<00:00, 915.11it/s]
12
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 642/919 [00:00<00:00, 912.08it/s]
13
  50%|β–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 459/919 [00:00<00:00, 916.54it/s]
14
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 734/919 [00:00<00:00, 911.69it/s]
15
  60%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 551/919 [00:00<00:00, 914.63it/s]
16
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 826/919 [00:00<00:00, 912.45it/s]
17
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 643/919 [00:00<00:00, 914.29it/s]
 
18
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 735/919 [00:00<00:00, 912.73it/s]
19
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 827/919 [00:00<00:00, 913.03it/s]
 
 
 
20
  0%| | 0/5021 [00:00<?, ?it/s]
21
  6%|β–Œ | 278/5021 [00:00<00:01, 2773.97it/s]
22
  11%|β–ˆ | 564/5021 [00:00<00:01, 2824.23it/s]
23
  0%| | 0/5021 [00:00<?, ?it/s]
24
  17%|β–ˆβ–‹ | 852/5021 [00:00<00:01, 2845.58it/s]
25
  4%|β–Ž | 178/5021 [00:00<00:02, 1779.16it/s]
26
  23%|β–ˆβ–ˆβ–Ž | 1140/5021 [00:00<00:01, 2855.94it/s]
27
  7%|β–‹ | 362/5021 [00:00<00:02, 1813.26it/s]
28
  28%|β–ˆβ–ˆβ–Š | 1427/5021 [00:00<00:01, 2860.60it/s]
29
  11%|β–ˆ | 547/5021 [00:00<00:02, 1826.42it/s]
30
  34%|β–ˆβ–ˆβ–ˆβ– | 1714/5021 [00:00<00:01, 2863.10it/s]
31
  15%|β–ˆβ– | 730/5021 [00:00<00:02, 1826.29it/s]
32
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 2001/5021 [00:00<00:01, 2863.11it/s]
33
  18%|β–ˆβ–Š | 913/5021 [00:00<00:02, 1824.48it/s]
34
  46%|β–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 2289/5021 [00:00<00:00, 2868.32it/s]
35
  22%|β–ˆβ–ˆβ– | 1096/5021 [00:00<00:02, 1819.99it/s]
36
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 2577/5021 [00:00<00:00, 2871.27it/s]
37
  25%|β–ˆβ–ˆβ–Œ | 1279/5021 [00:00<00:02, 1815.56it/s]
38
  29%|β–ˆβ–ˆβ–‰ | 1461/5021 [00:00<00:01, 1802.42it/s]
39
  57%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 2865/5021 [00:01<00:01, 1693.34it/s]
40
  63%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 3149/5021 [00:01<00:00, 1928.96it/s]
41
  68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 3433/5021 [00:01<00:00, 2135.30it/s]
42
  33%|β–ˆβ–ˆβ–ˆβ–Ž | 1642/5021 [00:01<00:03, 1001.41it/s]
43
  74%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3718/5021 [00:01<00:00, 2308.62it/s]
44
  36%|β–ˆβ–ˆβ–ˆβ–‹ | 1823/5021 [00:01<00:02, 1159.85it/s]
45
  80%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 4004/5021 [00:01<00:00, 2449.72it/s]
46
  40%|β–ˆβ–ˆβ–ˆβ–‰ | 2002/5021 [00:01<00:02, 1297.94it/s]
47
  85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 4290/5021 [00:01<00:00, 2558.71it/s]
48
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 2183/5021 [00:01<00:02, 1418.79it/s]
49
  91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 4577/5021 [00:01<00:00, 2643.21it/s]
50
  47%|β–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 2364/5021 [00:01<00:01, 1516.12it/s]
51
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 4864/5021 [00:01<00:00, 2705.33it/s]
52
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 2544/5021 [00:01<00:01, 1591.38it/s]
 
53
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 2725/5021 [00:01<00:01, 1650.39it/s]
54
  58%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 2905/5021 [00:01<00:01, 1692.57it/s]
55
  61%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3085/5021 [00:01<00:01, 1722.37it/s]
56
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 3266/5021 [00:02<00:01, 1745.32it/s]
57
  69%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 3447/5021 [00:02<00:00, 1762.66it/s]
58
  72%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 3627/5021 [00:02<00:00, 1772.66it/s]
59
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 3807/5021 [00:02<00:00, 1779.65it/s]
60
  79%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 3987/5021 [00:02<00:00, 1781.31it/s]
61
  83%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 4168/5021 [00:02<00:00, 1787.31it/s]
62
  87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 4349/5021 [00:02<00:00, 1792.50it/s]
63
  90%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 4529/5021 [00:02<00:00, 1790.62it/s]
64
  94%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 4709/5021 [00:02<00:00, 1790.61it/s]
65
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 4890/5021 [00:02<00:00, 1793.65it/s]
 
 
 
66
  0%| | 0/586 [00:00<?, ?it/s]
67
  0%| | 0/586 [00:00<?, ?it/s]
68
  17%|β–ˆβ–‹ | 98/586 [00:00<00:00, 971.82it/s]
69
  11%|β–ˆ | 62/586 [00:00<00:00, 613.37it/s]
70
  34%|β–ˆβ–ˆβ–ˆβ–Ž | 197/586 [00:00<00:00, 976.00it/s]
71
  21%|β–ˆβ–ˆβ– | 125/586 [00:00<00:00, 619.71it/s]
72
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 297/586 [00:00<00:00, 985.47it/s]
73
  32%|β–ˆβ–ˆβ–ˆβ– | 189/586 [00:00<00:00, 624.63it/s]
74
  68%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š | 396/586 [00:00<00:00, 986.66it/s]
75
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 253/586 [00:00<00:00, 627.18it/s]
76
  85%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 496/586 [00:00<00:00, 989.81it/s]
77
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 317/586 [00:00<00:00, 628.58it/s]
 
78
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 381/586 [00:00<00:00, 629.34it/s]
79
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 445/586 [00:00<00:00, 630.51it/s]
80
  87%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 509/586 [00:00<00:00, 632.25it/s]
81
  98%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Š| 573/586 [00:00<00:00, 632.82it/s]
 
 
 
82
  0%| | 0/1188 [00:00<?, ?it/s]
83
  0%| | 0/1188 [00:00<?, ?it/s]
84
  8%|β–Š | 100/1188 [00:00<00:01, 993.05it/s]
85
  5%|β–Œ | 63/1188 [00:00<00:01, 626.46it/s]
86
  17%|β–ˆβ–‹ | 200/1188 [00:00<00:00, 991.06it/s]
87
  11%|β–ˆ | 127/1188 [00:00<00:01, 629.71it/s]
88
  25%|β–ˆβ–ˆβ–Œ | 300/1188 [00:00<00:00, 994.71it/s]
89
  16%|β–ˆβ–Œ | 191/1188 [00:00<00:01, 630.93it/s]
90
  34%|β–ˆβ–ˆβ–ˆβ–Ž | 400/1188 [00:00<00:00, 993.35it/s]
91
  21%|β–ˆβ–ˆβ– | 255/1188 [00:00<00:01, 633.00it/s]
92
  42%|β–ˆβ–ˆβ–ˆβ–ˆβ– | 500/1188 [00:00<00:00, 993.59it/s]
93
  27%|β–ˆβ–ˆβ–‹ | 319/1188 [00:00<00:01, 634.81it/s]
94
  51%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 600/1188 [00:00<00:00, 993.86it/s]
95
  32%|β–ˆβ–ˆβ–ˆβ– | 383/1188 [00:00<00:01, 636.25it/s]
96
  59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 700/1188 [00:00<00:00, 992.08it/s]
97
  38%|β–ˆβ–ˆβ–ˆβ–Š | 447/1188 [00:00<00:01, 636.11it/s]
98
  67%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹ | 800/1188 [00:00<00:00, 988.12it/s]
99
  43%|β–ˆβ–ˆβ–ˆβ–ˆβ–Ž | 511/1188 [00:00<00:01, 634.75it/s]
100
  76%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 899/1188 [00:00<00:00, 988.66it/s]
101
  48%|β–ˆβ–ˆβ–ˆβ–ˆβ–Š | 575/1188 [00:00<00:00, 634.67it/s]
102
  84%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 999/1188 [00:01<00:00, 991.15it/s]
103
  54%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 639/1188 [00:01<00:00, 633.38it/s]
104
  93%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Ž| 1099/1188 [00:01<00:00, 993.44it/s]
105
  59%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‰ | 703/1188 [00:01<00:00, 633.82it/s]
 
106
  65%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ– | 768/1188 [00:01<00:00, 635.72it/s]
107
  70%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 832/1188 [00:01<00:00, 636.49it/s]
108
  75%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 896/1188 [00:01<00:00, 632.00it/s]
109
  81%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ | 960/1188 [00:01<00:00, 628.76it/s]
110
  86%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–Œ | 1023/1188 [00:01<00:00, 624.90it/s]
111
  91%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–| 1086/1188 [00:01<00:00, 625.37it/s]
112
  97%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‹| 1149/1188 [00:01<00:00, 625.85it/s]