INFO 10-26 08:02:52 [__init__.py:235] Automatically detected platform cuda.
[2025-10-26 08:02:53,805] [[32m    INFO[0m]: --- INIT SEEDS --- (pipeline.py:249)[0m
[2025-10-26 08:02:53,806] [[32m    INFO[0m]: --- LOADING TASKS --- (pipeline.py:210)[0m
[2025-10-26 08:02:53,808] [[32m    INFO[0m]: Found 1 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/ifeval/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,809] [[32m    INFO[0m]: Found 2 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/ifbench/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,810] [[32m    INFO[0m]: Found 6 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/tiny_benchmarks/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,811] [[32m    INFO[0m]: Found 1 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/mt_bench/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,812] [[32m    INFO[0m]: Found 4 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/mix_eval/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,813] [[32m    INFO[0m]: Found 5 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/olympiade_bench/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,814] [[32m    INFO[0m]: Found 1 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/hle/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,815] [[32m    INFO[0m]: Found 23 custom tasks in /mnt/public/wucanhui/lighteval/src/lighteval/tasks/extended/lcb/main.py (registry.py:260)[0m
[2025-10-26 08:02:53,817] [[33m WARNING[0m]: Careful, the task lcb:codegeneration_v6 is using evaluation data to build the few shot examples. (lighteval_task.py:269)[0m
[2025-10-26 08:03:00,696] [[32m    INFO[0m]: --- LOADING MODEL --- (pipeline.py:177)[0m
`torch_dtype` is deprecated! Use `dtype` instead!
[2025-10-26 08:03:06,813] [[32m    INFO[0m]: Using max model len 32768 (config.py:1604)[0m
[2025-10-26 08:03:07,320] [[32m    INFO[0m]: Chunked prefill is enabled with max_num_batched_tokens=2048. (config.py:2434)[0m
INFO 10-26 08:03:11 [__init__.py:235] Automatically detected platform cuda.
INFO 10-26 08:03:13 [core.py:572] Waiting for init message from front-end.
INFO 10-26 08:03:13 [core.py:71] Initializing a V1 LLM engine (v0.10.0) with config: model='/mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562', speculative_config=None, tokenizer='/mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562', skip_tokenizer_init=False, tokenizer_mode=auto, revision=main, override_neuron_config={}, tokenizer_revision=main, trust_remote_code=False, dtype=torch.bfloat16, max_seq_len=32768, download_dir=None, load_format=LoadFormat.AUTO, tensor_parallel_size=1, pipeline_parallel_size=1, disable_custom_all_reduce=False, quantization=None, enforce_eager=True, kv_cache_dtype=auto,  device_config=cuda, decoding_config=DecodingConfig(backend='auto', disable_fallback=False, disable_any_whitespace=False, disable_additional_properties=False, reasoning_backend=''), observability_config=ObservabilityConfig(show_hidden_metrics_for_version=None, otlp_traces_endpoint=None, collect_detailed_traces=None), seed=1234, served_model_name=/mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562, num_scheduler_steps=1, multi_step_stream_outputs=True, enable_prefix_caching=True, chunked_prefill_enabled=True, use_async_output_proc=True, pooler_config=None, compilation_config={"level":0,"debug_dump_path":"","cache_dir":"","backend":"","custom_ops":[],"splitting_ops":[],"use_inductor":true,"compile_sizes":[],"inductor_compile_config":{"enable_auto_functionalized_v2":false},"inductor_passes":{},"use_cudagraph":true,"cudagraph_num_of_warmups":0,"cudagraph_capture_sizes":[],"cudagraph_copy_inputs":false,"full_cuda_graph":false,"max_capture_size":0,"local_cache_dir":null}
INFO 10-26 08:03:17 [parallel_state.py:1102] rank 0 in world size 1 is assigned as DP rank 0, PP rank 0, TP rank 0, EP rank 0
WARNING 10-26 08:03:17 [topk_topp_sampler.py:59] FlashInfer is not available. Falling back to the PyTorch-native implementation of top-p & top-k sampling. For the best performance, please install FlashInfer.
INFO 10-26 08:03:17 [gpu_model_runner.py:1843] Starting to load model /mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562...
INFO 10-26 08:03:17 [gpu_model_runner.py:1875] Loading model from scratch...
INFO 10-26 08:03:17 [cuda.py:290] Using Flash Attention backend on V1 engine.
Loading safetensors checkpoint shards:   0% Completed | 0/2 [00:00<?, ?it/s]
Loading safetensors checkpoint shards:  50% Completed | 1/2 [00:31<00:31, 31.57s/it]
Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:52<00:00, 25.55s/it]
Loading safetensors checkpoint shards: 100% Completed | 2/2 [00:52<00:00, 26.45s/it]

INFO 10-26 08:04:10 [default_loader.py:262] Loading weights took 52.97 seconds
INFO 10-26 08:04:11 [gpu_model_runner.py:1892] Model loading took 7.5552 GiB and 53.142551 seconds
INFO 10-26 08:04:11 [gpu_worker.py:255] Available KV cache memory: 117.60 GiB
INFO 10-26 08:04:11 [kv_cache_utils.py:833] GPU KV cache size: 856,336 tokens
INFO 10-26 08:04:11 [kv_cache_utils.py:837] Maximum concurrency for 32,768 tokens per request: 26.13x
INFO 10-26 08:04:12 [core.py:193] init engine (profile, create kv cache, warmup model) took 0.87 seconds
[2025-10-26 08:04:12,768] [[32m    INFO[0m]: [CACHING] Initializing data cache (cache_management.py:105)[0m
[2025-10-26 08:04:12,775] [[32m    INFO[0m]: --- RUNNING MODEL --- (pipeline.py:330)[0m
[2025-10-26 08:04:12,776] [[32m    INFO[0m]: Running SamplingMethod.GENERATIVE requests (pipeline.py:313)[0m
[2025-10-26 08:04:17,846] [[32m    INFO[0m]: Cache: Starting to process 175/175 samples (not found in cache) for tasks extended|lcb:codegeneration_v6|0 (af02290c8d784372, GENERATIVE) (cache_management.py:399)[0m
[2025-10-26 08:04:17,848] [[33m WARNING[0m]: You cannot select the number of dataset splits for a generative evaluation at the moment. Automatically inferring. (data.py:206)[0m
Splits:   0%|          | 0/1 [00:00<?, ?it/s][2025-10-26 08:04:17,987] [[33m WARNING[0m]: context_size + max_new_tokens=34058 which is greater than self.max_length=32768. Truncating context to 0 tokens. (vllm_model.py:367)[0m

Adding requests:   0%|          | 0/175 [00:00<?, ?it/s][AAdding requests: 100%|██████████| 175/175 [00:00<00:00, 14647.25it/s]

Processed prompts:   0%|          | 0/175 [00:00<?, ?it/s, est. speed input: 0.00 toks/s, output: 0.00 toks/s][A
Processed prompts:   1%|          | 1/175 [00:03<10:31,  3.63s/it, est. speed input: 140.59 toks/s, output: 32.53 toks/s][A
Processed prompts:   1%|          | 2/175 [00:03<04:54,  1.70s/it, est. speed input: 251.14 toks/s, output: 64.48 toks/s][A
Processed prompts:   2%|▏         | 3/175 [00:07<07:19,  2.55s/it, est. speed input: 200.98 toks/s, output: 84.12 toks/s][A
Processed prompts:   2%|▏         | 4/175 [00:08<04:56,  1.73s/it, est. speed input: 243.81 toks/s, output: 130.13 toks/s][A
Processed prompts:   3%|▎         | 6/175 [00:08<02:28,  1.13it/s, est. speed input: 379.96 toks/s, output: 229.42 toks/s][A
Processed prompts:   4%|▍         | 7/175 [00:08<01:56,  1.44it/s, est. speed input: 420.79 toks/s, output: 276.70 toks/s][A
Processed prompts:   5%|▍         | 8/175 [00:08<01:35,  1.75it/s, est. speed input: 464.06 toks/s, output: 319.73 toks/s][A
Processed prompts:   5%|▌         | 9/175 [00:09<01:25,  1.95it/s, est. speed input: 510.97 toks/s, output: 360.30 toks/s][A
Processed prompts:   6%|▌         | 10/175 [00:09<01:05,  2.54it/s, est. speed input: 582.60 toks/s, output: 409.38 toks/s][A
Processed prompts:   6%|▋         | 11/175 [00:09<00:55,  2.93it/s, est. speed input: 636.86 toks/s, output: 452.15 toks/s][A
Processed prompts:   7%|▋         | 13/175 [00:09<00:38,  4.19it/s, est. speed input: 760.22 toks/s, output: 545.05 toks/s][A
Processed prompts:   9%|▊         | 15/175 [00:09<00:29,  5.41it/s, est. speed input: 845.18 toks/s, output: 636.19 toks/s][A
Processed prompts:   9%|▉         | 16/175 [00:10<00:32,  4.94it/s, est. speed input: 876.99 toks/s, output: 671.70 toks/s][A
Processed prompts:  10%|▉         | 17/175 [00:10<00:35,  4.47it/s, est. speed input: 914.69 toks/s, output: 706.33 toks/s][A
Processed prompts:  11%|█         | 19/175 [00:10<00:30,  5.14it/s, est. speed input: 990.10 toks/s, output: 789.48 toks/s][A
Processed prompts:  12%|█▏        | 21/175 [00:10<00:22,  6.96it/s, est. speed input: 1093.61 toks/s, output: 887.68 toks/s][A
Processed prompts:  13%|█▎        | 23/175 [00:11<00:20,  7.39it/s, est. speed input: 1137.34 toks/s, output: 960.96 toks/s][A
Processed prompts:  14%|█▍        | 25/175 [00:11<00:16,  9.03it/s, est. speed input: 1267.06 toks/s, output: 1057.47 toks/s][A
Processed prompts:  15%|█▌        | 27/175 [00:11<00:14,  9.93it/s, est. speed input: 1386.68 toks/s, output: 1149.01 toks/s][A
Processed prompts:  17%|█▋        | 29/175 [00:11<00:16,  8.80it/s, est. speed input: 1464.01 toks/s, output: 1225.95 toks/s][A
Processed prompts:  18%|█▊        | 31/175 [00:11<00:16,  8.51it/s, est. speed input: 1511.96 toks/s, output: 1303.70 toks/s][A
Processed prompts:  18%|█▊        | 32/175 [00:12<00:21,  6.71it/s, est. speed input: 1535.99 toks/s, output: 1327.46 toks/s][A
Processed prompts:  19%|█▉        | 33/175 [00:12<00:20,  7.00it/s, est. speed input: 1587.48 toks/s, output: 1369.65 toks/s][A
Processed prompts:  20%|██        | 35/175 [00:12<00:20,  6.87it/s, est. speed input: 1636.43 toks/s, output: 1442.60 toks/s][A
Processed prompts:  21%|██        | 37/175 [00:12<00:18,  7.49it/s, est. speed input: 1705.90 toks/s, output: 1525.42 toks/s][A
Processed prompts:  22%|██▏       | 39/175 [00:12<00:14,  9.28it/s, est. speed input: 1789.84 toks/s, output: 1619.96 toks/s][A
Processed prompts:  23%|██▎       | 41/175 [00:13<00:14,  9.34it/s, est. speed input: 1874.29 toks/s, output: 1703.48 toks/s][A
Processed prompts:  25%|██▍       | 43/175 [00:13<00:17,  7.71it/s, est. speed input: 1903.93 toks/s, output: 1766.39 toks/s][A
Processed prompts:  25%|██▌       | 44/175 [00:13<00:20,  6.51it/s, est. speed input: 1905.31 toks/s, output: 1787.02 toks/s][A
Processed prompts:  26%|██▋       | 46/175 [00:13<00:15,  8.47it/s, est. speed input: 1982.95 toks/s, output: 1882.98 toks/s][A
Processed prompts:  28%|██▊       | 49/175 [00:14<00:11, 10.94it/s, est. speed input: 2050.30 toks/s, output: 1962.27 toks/s][A
Processed prompts:  29%|██▉       | 51/175 [00:14<00:11, 10.76it/s, est. speed input: 2104.09 toks/s, output: 2043.28 toks/s][A
Processed prompts:  30%|███       | 53/175 [00:14<00:12,  9.50it/s, est. speed input: 2133.27 toks/s, output: 2112.66 toks/s][A
Processed prompts:  31%|███▏      | 55/175 [00:14<00:11, 10.00it/s, est. speed input: 2184.54 toks/s, output: 2196.37 toks/s][A
Processed prompts:  33%|███▎      | 57/175 [00:15<00:17,  6.86it/s, est. speed input: 2183.52 toks/s, output: 2231.32 toks/s][A
Processed prompts:  33%|███▎      | 58/175 [00:15<00:16,  7.04it/s, est. speed input: 2185.03 toks/s, output: 2229.63 toks/s][A
Processed prompts:  34%|███▎      | 59/175 [00:15<00:15,  7.38it/s, est. speed input: 2192.47 toks/s, output: 2236.17 toks/s][A
Processed prompts:  35%|███▍      | 61/175 [00:15<00:13,  8.70it/s, est. speed input: 2238.99 toks/s, output: 2323.05 toks/s][A
Processed prompts:  36%|███▌      | 63/175 [00:15<00:13,  8.15it/s, est. speed input: 2291.72 toks/s, output: 2393.63 toks/s][A
Processed prompts:  37%|███▋      | 64/175 [00:16<00:14,  7.54it/s, est. speed input: 2300.27 toks/s, output: 2422.87 toks/s][A
Processed prompts:  38%|███▊      | 67/175 [00:16<00:13,  7.91it/s, est. speed input: 2322.23 toks/s, output: 2503.67 toks/s][A
Processed prompts:  39%|███▉      | 69/175 [00:16<00:11,  9.49it/s, est. speed input: 2347.52 toks/s, output: 2538.26 toks/s][A
Processed prompts:  41%|████      | 71/175 [00:16<00:12,  8.38it/s, est. speed input: 2363.50 toks/s, output: 2575.34 toks/s][A
Processed prompts:  42%|████▏     | 73/175 [00:16<00:10,  9.88it/s, est. speed input: 2420.45 toks/s, output: 2625.66 toks/s][A
Processed prompts:  43%|████▎     | 75/175 [00:17<00:12,  7.80it/s, est. speed input: 2424.04 toks/s, output: 2643.58 toks/s][A
Processed prompts:  43%|████▎     | 76/175 [00:17<00:14,  6.90it/s, est. speed input: 2421.19 toks/s, output: 2665.64 toks/s][A
Processed prompts:  45%|████▌     | 79/175 [00:18<00:17,  5.57it/s, est. speed input: 2385.57 toks/s, output: 2635.72 toks/s][A
Processed prompts:  46%|████▋     | 81/175 [00:18<00:14,  6.46it/s, est. speed input: 2413.89 toks/s, output: 2687.15 toks/s][A
Processed prompts:  49%|████▊     | 85/175 [00:18<00:10,  8.82it/s, est. speed input: 2479.85 toks/s, output: 2787.59 toks/s][A
Processed prompts:  50%|████▉     | 87/175 [00:18<00:10,  8.45it/s, est. speed input: 2488.78 toks/s, output: 2824.09 toks/s][A
Processed prompts:  50%|█████     | 88/175 [00:19<00:12,  6.96it/s, est. speed input: 2466.85 toks/s, output: 2804.48 toks/s][A
Processed prompts:  51%|█████▏    | 90/175 [00:19<00:11,  7.49it/s, est. speed input: 2502.57 toks/s, output: 2853.12 toks/s][A
Processed prompts:  53%|█████▎    | 92/175 [00:19<00:09,  8.33it/s, est. speed input: 2543.31 toks/s, output: 2942.93 toks/s][A
Processed prompts:  54%|█████▎    | 94/175 [00:19<00:08,  9.89it/s, est. speed input: 2565.89 toks/s, output: 2970.92 toks/s][A
Processed prompts:  55%|█████▍    | 96/175 [00:20<00:09,  7.93it/s, est. speed input: 2567.35 toks/s, output: 3000.78 toks/s][A
Processed prompts:  57%|█████▋    | 99/175 [00:20<00:09,  8.32it/s, est. speed input: 2608.39 toks/s, output: 3093.67 toks/s][A
Processed prompts:  58%|█████▊    | 101/175 [00:21<00:12,  5.88it/s, est. speed input: 2561.70 toks/s, output: 3055.31 toks/s][A
Processed prompts:  59%|█████▉    | 103/175 [00:21<00:09,  7.29it/s, est. speed input: 2590.80 toks/s, output: 3125.44 toks/s][A
Processed prompts:  60%|██████    | 105/175 [00:21<00:09,  7.18it/s, est. speed input: 2600.48 toks/s, output: 3166.99 toks/s][A
Processed prompts:  62%|██████▏   | 108/175 [00:21<00:06, 10.06it/s, est. speed input: 2688.73 toks/s, output: 3294.95 toks/s][A
Processed prompts:  63%|██████▎   | 110/175 [00:21<00:05, 10.99it/s, est. speed input: 2720.93 toks/s, output: 3369.84 toks/s][A
Processed prompts:  64%|██████▍   | 112/175 [00:21<00:05, 11.99it/s, est. speed input: 2792.09 toks/s, output: 3441.30 toks/s][A
Processed prompts:  65%|██████▌   | 114/175 [00:22<00:10,  5.91it/s, est. speed input: 2734.56 toks/s, output: 3389.54 toks/s][A
Processed prompts:  67%|██████▋   | 117/175 [00:22<00:08,  6.97it/s, est. speed input: 2754.77 toks/s, output: 3479.32 toks/s][A
Processed prompts:  68%|██████▊   | 119/175 [00:23<00:07,  7.23it/s, est. speed input: 2772.33 toks/s, output: 3561.65 toks/s][A
Processed prompts:  69%|██████▉   | 121/175 [00:23<00:06,  8.22it/s, est. speed input: 2810.83 toks/s, output: 3629.47 toks/s][A
Processed prompts:  70%|███████   | 123/175 [00:23<00:09,  5.56it/s, est. speed input: 2777.83 toks/s, output: 3631.05 toks/s][A
Processed prompts:  71%|███████▏  | 125/175 [00:24<00:09,  5.16it/s, est. speed input: 2755.72 toks/s, output: 3641.57 toks/s][A
Processed prompts:  73%|███████▎  | 127/175 [00:24<00:10,  4.42it/s, est. speed input: 2744.00 toks/s, output: 3674.54 toks/s][A
Processed prompts:  73%|███████▎  | 128/175 [00:25<00:12,  3.67it/s, est. speed input: 2706.46 toks/s, output: 3644.70 toks/s][A
Processed prompts:  74%|███████▎  | 129/175 [00:25<00:11,  3.84it/s, est. speed input: 2719.39 toks/s, output: 3677.74 toks/s][A
Processed prompts:  74%|███████▍  | 130/175 [00:26<00:13,  3.34it/s, est. speed input: 2693.82 toks/s, output: 3678.81 toks/s][A
Processed prompts:  75%|███████▌  | 132/175 [00:26<00:08,  4.82it/s, est. speed input: 2728.51 toks/s, output: 3759.96 toks/s][A
Processed prompts:  76%|███████▌  | 133/175 [00:27<00:14,  2.94it/s, est. speed input: 2665.87 toks/s, output: 3709.66 toks/s][A
Processed prompts:  77%|███████▋  | 134/175 [00:27<00:16,  2.52it/s, est. speed input: 2642.50 toks/s, output: 3696.72 toks/s][A
Processed prompts:  77%|███████▋  | 135/175 [00:27<00:14,  2.69it/s, est. speed input: 2629.59 toks/s, output: 3706.69 toks/s][A
Processed prompts:  78%|███████▊  | 136/175 [00:28<00:11,  3.28it/s, est. speed input: 2631.62 toks/s, output: 3730.31 toks/s][A
Processed prompts:  78%|███████▊  | 137/175 [00:28<00:14,  2.71it/s, est. speed input: 2611.92 toks/s, output: 3725.41 toks/s][A
Processed prompts:  79%|███████▉  | 138/175 [00:28<00:13,  2.75it/s, est. speed input: 2596.21 toks/s, output: 3731.21 toks/s][A
Processed prompts:  79%|███████▉  | 139/175 [00:29<00:17,  2.05it/s, est. speed input: 2544.48 toks/s, output: 3697.59 toks/s][A
Processed prompts:  80%|████████  | 140/175 [00:30<00:19,  1.75it/s, est. speed input: 2499.74 toks/s, output: 3670.47 toks/s][A
Processed prompts:  81%|████████  | 141/175 [00:34<00:56,  1.65s/it, est. speed input: 2217.02 toks/s, output: 3289.84 toks/s][A
Processed prompts:  81%|████████  | 142/175 [00:34<00:40,  1.22s/it, est. speed input: 2217.19 toks/s, output: 3325.38 toks/s][A
Processed prompts:  82%|████████▏ | 143/175 [00:36<00:40,  1.25s/it, est. speed input: 2151.01 toks/s, output: 3272.85 toks/s][A
Processed prompts:  82%|████████▏ | 144/175 [00:43<01:33,  3.03s/it, est. speed input: 1808.23 toks/s, output: 2802.01 toks/s][A
Processed prompts:  83%|████████▎ | 145/175 [00:44<01:15,  2.50s/it, est. speed input: 1770.87 toks/s, output: 2796.44 toks/s][A
Processed prompts:  83%|████████▎ | 146/175 [00:45<00:55,  1.90s/it, est. speed input: 1789.95 toks/s, output: 2840.87 toks/s][A
Processed prompts:  84%|████████▍ | 147/175 [00:45<00:39,  1.40s/it, est. speed input: 1792.77 toks/s, output: 2899.68 toks/s][A
Processed prompts:  85%|████████▍ | 148/175 [00:48<00:53,  2.00s/it, est. speed input: 1679.22 toks/s, output: 2772.76 toks/s][A
Processed prompts:  85%|████████▌ | 149/175 [00:52<01:02,  2.42s/it, est. speed input: 1587.26 toks/s, output: 2668.45 toks/s][A
Processed prompts:  86%|████████▌ | 150/175 [00:56<01:15,  3.00s/it, est. speed input: 1474.28 toks/s, output: 2538.20 toks/s][A
Processed prompts:  87%|████████▋ | 152/175 [00:57<00:39,  1.72s/it, est. speed input: 1483.98 toks/s, output: 2670.83 toks/s][A
Processed prompts:  87%|████████▋ | 153/175 [00:58<00:38,  1.75s/it, est. speed input: 1449.31 toks/s, output: 2663.61 toks/s][A
Processed prompts:  88%|████████▊ | 154/175 [00:59<00:30,  1.45s/it, est. speed input: 1453.59 toks/s, output: 2714.18 toks/s][A
Processed prompts:  89%|████████▊ | 155/175 [01:01<00:31,  1.59s/it, est. speed input: 1426.19 toks/s, output: 2704.16 toks/s][A
Processed prompts:  89%|████████▉ | 156/175 [01:16<01:39,  5.23s/it, est. speed input: 1159.61 toks/s, output: 2263.96 toks/s][A
Processed prompts:  90%|████████▉ | 157/175 [01:16<01:10,  3.90s/it, est. speed input: 1162.23 toks/s, output: 2325.82 toks/s][A
Processed prompts:  90%|█████████ | 158/175 [01:23<01:19,  4.67s/it, est. speed input: 1078.72 toks/s, output: 2220.66 toks/s][A
Processed prompts:  91%|█████████ | 159/175 [01:32<01:36,  6.01s/it, est. speed input: 978.72 toks/s, output: 2077.09 toks/s] [A
Processed prompts:  91%|█████████▏| 160/175 [01:32<01:04,  4.28s/it, est. speed input: 982.00 toks/s, output: 2152.20 toks/s][A
Processed prompts:  92%|█████████▏| 161/175 [01:44<01:30,  6.47s/it, est. speed input: 880.83 toks/s, output: 1990.69 toks/s][A
Processed prompts:  93%|█████████▎| 162/175 [01:47<01:13,  5.64s/it, est. speed input: 860.10 toks/s, output: 2002.33 toks/s][A
Processed prompts:  93%|█████████▎| 163/175 [01:54<01:09,  5.80s/it, est. speed input: 823.53 toks/s, output: 1974.07 toks/s][A
Processed prompts:  94%|█████████▎| 164/175 [02:03<01:16,  6.91s/it, est. speed input: 767.39 toks/s, output: 1903.21 toks/s][A
Processed prompts:  94%|█████████▍| 165/175 [02:17<01:29,  8.98s/it, est. speed input: 695.08 toks/s, output: 1793.54 toks/s][A
Processed prompts:  95%|█████████▍| 166/175 [02:17<00:57,  6.40s/it, est. speed input: 697.83 toks/s, output: 1870.61 toks/s][A
Processed prompts:  95%|█████████▌| 167/175 [02:31<01:07,  8.48s/it, est. speed input: 640.82 toks/s, output: 1787.62 toks/s][A
Processed prompts:  96%|█████████▌| 168/175 [02:35<00:49,  7.09s/it, est. speed input: 628.85 toks/s, output: 1825.86 toks/s][A
Processed prompts:  97%|█████████▋| 169/175 [02:46<00:50,  8.36s/it, est. speed input: 589.19 toks/s, output: 1785.43 toks/s][A
Processed prompts:  97%|█████████▋| 170/175 [03:11<01:07, 13.43s/it, est. speed input: 516.56 toks/s, output: 1635.09 toks/s][A
Processed prompts:  98%|█████████▊| 171/175 [05:56<03:55, 58.86s/it, est. speed input: 280.15 toks/s, output: 968.19 toks/s] [A
Processed prompts:  98%|█████████▊| 172/175 [05:59<02:06, 42.10s/it, est. speed input: 279.61 toks/s, output: 1049.61 toks/s][A
Processed prompts:  99%|█████████▉| 173/175 [06:00<00:59, 29.77s/it, est. speed input: 280.35 toks/s, output: 1136.03 toks/s][A
Processed prompts:  99%|█████████▉| 174/175 [06:00<00:20, 20.93s/it, est. speed input: 281.53 toks/s, output: 1224.44 toks/s][A
Processed prompts: 100%|██████████| 175/175 [06:05<00:00, 16.09s/it, est. speed input: 279.08 toks/s, output: 1296.84 toks/s][A
Processed prompts: 100%|██████████| 175/175 [06:05<00:00, 16.09s/it, est. speed input: 279.08 toks/s, output: 1296.84 toks/s][AProcessed prompts: 100%|██████████| 175/175 [06:05<00:00,  2.09s/it, est. speed input: 279.08 toks/s, output: 1296.84 toks/s]
Splits: 100%|██████████| 1/1 [06:05<00:00, 365.81s/it]Splits: 100%|██████████| 1/1 [06:05<00:00, 365.81s/it]
Creating parquet from Arrow format:   0%|          | 0/1 [00:00<?, ?ba/s]Creating parquet from Arrow format: 100%|██████████| 1/1 [00:00<00:00, 20.93ba/s]
[2025-10-26 08:10:25,997] [[32m    INFO[0m]: Cached 175 samples of extended|lcb:codegeneration_v6|0 (af02290c8d784372, GENERATIVE) at /mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562/0619260e1176b049/extended|lcb:codegeneration_v6|0/af02290c8d784372/GENERATIVE.parquet. (cache_management.py:345)[0m
Generating train split: 0 examples [00:00, ? examples/s]Generating train split: 175 examples [00:00, 5738.79 examples/s]
[rank0]:[W1026 08:10:30.526063604 ProcessGroupNCCL.cpp:1479] Warning: WARNING: destroy_process_group() was not called before program exit, which can leak resources. For more info, please see https://pytorch.org/docs/stable/distributed.html#shutdown (function operator())
[2025-10-26 08:10:31,305] [[32m    INFO[0m]: --- POST-PROCESSING MODEL RESPONSES --- (pipeline.py:344)[0m
[2025-10-26 08:10:31,308] [[32m    INFO[0m]: --- COMPUTING METRICS --- (pipeline.py:371)[0m
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.37it/s]100%|██████████| 1/1 [00:00<00:00,  5.72it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.34it/s]100%|██████████| 1/1 [00:00<00:00,  4.70it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.12it/s]100%|██████████| 1/1 [00:00<00:00,  4.71it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.77it/s]100%|██████████| 1/1 [00:00<00:00,  7.65it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.06it/s]100%|██████████| 1/1 [00:00<00:00,  5.47it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  9.14it/s]100%|██████████| 1/1 [00:00<00:00,  7.89it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.51it/s]100%|██████████| 1/1 [00:00<00:00,  7.46it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:22<00:00, 22.04s/it]100%|██████████| 1/1 [00:22<00:00, 22.07s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.01it/s]100%|██████████| 1/1 [00:00<00:00,  6.98it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.95it/s]100%|██████████| 1/1 [00:00<00:00,  7.00it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.45it/s]100%|██████████| 1/1 [00:00<00:00,  6.60it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.80it/s]100%|██████████| 1/1 [00:00<00:00,  7.64it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.70it/s]100%|██████████| 1/1 [00:00<00:00,  3.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.05it/s]100%|██████████| 1/1 [00:00<00:00,  7.05it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.71it/s]100%|██████████| 1/1 [00:00<00:00,  7.58it/s]
  0%|          | 0/1 [00:00<?, ?it/s]timeout occurred: alarm went off
100%|██████████| 1/1 [00:06<00:00,  6.13s/it]100%|██████████| 1/1 [00:06<00:00,  6.15s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.59it/s]100%|██████████| 1/1 [00:00<00:00,  5.08it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  2.95it/s]100%|██████████| 1/1 [00:00<00:00,  2.78it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.43it/s]100%|██████████| 1/1 [00:00<00:00,  3.98it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.69it/s]100%|██████████| 1/1 [00:00<00:00,  4.35it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.13it/s]100%|██████████| 1/1 [00:00<00:00,  5.51it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.63it/s]100%|██████████| 1/1 [00:00<00:00,  7.46it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  2.92it/s]100%|██████████| 1/1 [00:00<00:00,  2.70it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.09it/s]100%|██████████| 1/1 [00:00<00:00,  6.32it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:06<00:00,  6.14s/it]100%|██████████| 1/1 [00:06<00:00,  6.17s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:06<00:00,  6.15s/it]100%|██████████| 1/1 [00:06<00:00,  6.17s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.58it/s]100%|██████████| 1/1 [00:00<00:00,  5.64it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.99it/s]100%|██████████| 1/1 [00:00<00:00,  6.83it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.60it/s]100%|██████████| 1/1 [00:00<00:00,  5.09it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.56it/s]100%|██████████| 1/1 [00:00<00:00,  7.46it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.73it/s]100%|██████████| 1/1 [00:00<00:00,  6.02it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.59it/s]100%|██████████| 1/1 [00:00<00:00,  1.53it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.40it/s]100%|██████████| 1/1 [00:00<00:00,  4.74it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.58it/s]100%|██████████| 1/1 [00:00<00:00,  6.68it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.14it/s]100%|██████████| 1/1 [00:00<00:00,  5.53it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.43it/s]100%|██████████| 1/1 [00:00<00:00,  7.34it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.93it/s]100%|██████████| 1/1 [00:00<00:00,  5.36it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.60it/s]100%|██████████| 1/1 [00:00<00:00,  7.45it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.73it/s]100%|██████████| 1/1 [00:00<00:00,  3.48it/s]
  0%|          | 0/1 [00:00<?, ?it/s]timeout occurred: alarm went off
100%|██████████| 1/1 [00:06<00:00,  6.27s/it]100%|██████████| 1/1 [00:06<00:00,  6.29s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.05it/s]100%|██████████| 1/1 [00:00<00:00,  6.87it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.39it/s]100%|██████████| 1/1 [00:00<00:00,  5.33it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.39it/s]100%|██████████| 1/1 [00:00<00:00,  6.52it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.88it/s]100%|██████████| 1/1 [00:00<00:00,  6.88it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.08it/s]100%|██████████| 1/1 [00:00<00:00,  7.06it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.47it/s]100%|██████████| 1/1 [00:00<00:00,  7.37it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.46it/s]100%|██████████| 1/1 [00:00<00:00,  6.61it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.33it/s]100%|██████████| 1/1 [00:00<00:00,  6.88it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.20it/s]100%|██████████| 1/1 [00:00<00:00,  6.38it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.83it/s]100%|██████████| 1/1 [00:00<00:00,  5.79it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.76it/s]100%|██████████| 1/1 [00:00<00:00,  4.34it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.12it/s]100%|██████████| 1/1 [00:00<00:00,  6.77it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.93it/s]100%|██████████| 1/1 [00:00<00:00,  5.08it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.42it/s]100%|██████████| 1/1 [00:00<00:00,  7.31it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:05<00:00,  5.63s/it]100%|██████████| 1/1 [00:05<00:00,  5.66s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.02it/s]100%|██████████| 1/1 [00:00<00:00,  7.02it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.81it/s]100%|██████████| 1/1 [00:00<00:00,  4.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.98it/s]100%|██████████| 1/1 [00:00<00:00,  7.00it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.82it/s]100%|██████████| 1/1 [00:00<00:00,  7.62it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.36it/s]100%|██████████| 1/1 [00:00<00:00,  7.10it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.24it/s]100%|██████████| 1/1 [00:00<00:00,  6.43it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.77it/s]100%|██████████| 1/1 [00:00<00:00,  7.59it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.50it/s]100%|██████████| 1/1 [00:00<00:00,  7.39it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.21it/s]100%|██████████| 1/1 [00:00<00:00,  7.17it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.15it/s]100%|██████████| 1/1 [00:00<00:00,  6.85it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.27it/s]100%|██████████| 1/1 [00:00<00:00,  5.63it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.53it/s]100%|██████████| 1/1 [00:00<00:00,  7.40it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.51it/s]100%|██████████| 1/1 [00:00<00:00,  7.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.70it/s]100%|██████████| 1/1 [00:00<00:00,  7.55it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.36it/s]100%|██████████| 1/1 [00:00<00:00,  7.26it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.38it/s]100%|██████████| 1/1 [00:00<00:00,  6.54it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.02it/s]100%|██████████| 1/1 [00:01<00:00,  1.01s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.29it/s]100%|██████████| 1/1 [00:00<00:00,  6.46it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.89it/s]100%|██████████| 1/1 [00:00<00:00,  5.31it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.48it/s]100%|██████████| 1/1 [00:00<00:00,  3.18it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.88it/s]100%|██████████| 1/1 [00:00<00:00,  5.32it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.79it/s]100%|██████████| 1/1 [00:00<00:00,  6.82it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.54it/s]100%|██████████| 1/1 [00:00<00:00,  3.28it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.94it/s]100%|██████████| 1/1 [00:00<00:00,  5.37it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.85it/s]100%|██████████| 1/1 [00:00<00:00,  3.59it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.79it/s]100%|██████████| 1/1 [00:00<00:00,  6.82it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.67it/s]100%|██████████| 1/1 [00:00<00:00,  6.76it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  9.06it/s]100%|██████████| 1/1 [00:00<00:00,  7.83it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.53it/s]100%|██████████| 1/1 [00:00<00:00,  3.30it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.40it/s]100%|██████████| 1/1 [00:00<00:00,  7.33it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.73it/s]100%|██████████| 1/1 [00:00<00:00,  1.66it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.68it/s]100%|██████████| 1/1 [00:00<00:00,  1.61it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.76it/s]100%|██████████| 1/1 [00:00<00:00,  6.76it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.49it/s]100%|██████████| 1/1 [00:00<00:00,  7.37it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.04it/s]100%|██████████| 1/1 [00:00<00:00,  7.05it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.40it/s]100%|██████████| 1/1 [00:00<00:00,  7.28it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.56it/s]100%|██████████| 1/1 [00:00<00:00,  7.44it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.42it/s]100%|██████████| 1/1 [00:00<00:00,  7.39it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:06<00:00,  6.73s/it]100%|██████████| 1/1 [00:06<00:00,  6.76s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.97it/s]100%|██████████| 1/1 [00:00<00:00,  6.95it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.81it/s]100%|██████████| 1/1 [00:00<00:00,  7.63it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.61it/s]100%|██████████| 1/1 [00:00<00:00,  7.46it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.07it/s]100%|██████████| 1/1 [00:00<00:00,  6.30it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:06<00:00,  6.44s/it]100%|██████████| 1/1 [00:06<00:00,  6.47s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.39it/s]100%|██████████| 1/1 [00:00<00:00,  7.25it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.19it/s]100%|██████████| 1/1 [00:00<00:00,  7.16it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:07<00:00,  7.60s/it]100%|██████████| 1/1 [00:07<00:00,  7.63s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.70it/s]100%|██████████| 1/1 [00:00<00:00,  5.96it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.09it/s]100%|██████████| 1/1 [00:00<00:00,  7.07it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.00it/s]100%|██████████| 1/1 [00:00<00:00,  5.42it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  2.66it/s]100%|██████████| 1/1 [00:00<00:00,  2.48it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.62it/s]100%|██████████| 1/1 [00:00<00:00,  6.70it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.95it/s]100%|██████████| 1/1 [00:00<00:00,  6.91it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.56it/s]100%|██████████| 1/1 [00:00<00:00,  5.04it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  2.11it/s]100%|██████████| 1/1 [00:00<00:00,  1.98it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.83it/s]100%|██████████| 1/1 [00:00<00:00,  5.09it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.85it/s]100%|██████████| 1/1 [00:00<00:00,  6.10it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.82it/s]100%|██████████| 1/1 [00:00<00:00,  5.26it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.40it/s]100%|██████████| 1/1 [00:00<00:00,  4.90it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.85it/s]100%|██████████| 1/1 [00:00<00:00,  5.08it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.58it/s]100%|██████████| 1/1 [00:00<00:00,  5.05it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.90it/s]100%|██████████| 1/1 [00:00<00:00,  5.34it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.25it/s]100%|██████████| 1/1 [00:00<00:00,  7.17it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.34it/s]100%|██████████| 1/1 [00:00<00:00,  5.40it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.58it/s]100%|██████████| 1/1 [00:00<00:00,  5.88it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.48it/s]100%|██████████| 1/1 [00:00<00:00,  7.38it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.00it/s]100%|██████████| 1/1 [00:00<00:00,  7.03it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.22it/s]100%|██████████| 1/1 [00:00<00:00,  7.17it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.50it/s]100%|██████████| 1/1 [00:00<00:00,  5.00it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.53it/s]100%|██████████| 1/1 [00:00<00:00,  7.43it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.00it/s]100%|██████████| 1/1 [00:00<00:00,  7.02it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.71it/s]100%|██████████| 1/1 [00:00<00:00,  5.78it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.97it/s]100%|██████████| 1/1 [00:00<00:00,  5.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.81it/s]100%|██████████| 1/1 [00:00<00:00,  4.37it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:01<00:00,  1.00s/it]100%|██████████| 1/1 [00:01<00:00,  1.03s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.30it/s]100%|██████████| 1/1 [00:00<00:00,  4.82it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:04<00:00,  4.44s/it]100%|██████████| 1/1 [00:04<00:00,  4.48s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.05it/s]100%|██████████| 1/1 [00:00<00:00,  5.13it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.86it/s]100%|██████████| 1/1 [00:00<00:00,  6.89it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:06<00:00,  6.27s/it]100%|██████████| 1/1 [00:06<00:00,  6.30s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.89it/s]100%|██████████| 1/1 [00:00<00:00,  6.11it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.50it/s]100%|██████████| 1/1 [00:00<00:00,  6.56it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.96it/s]100%|██████████| 1/1 [00:00<00:00,  5.91it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.58it/s]100%|██████████| 1/1 [00:00<00:00,  3.96it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.40it/s]100%|██████████| 1/1 [00:00<00:00,  5.72it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.77it/s]100%|██████████| 1/1 [00:00<00:00,  6.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.62it/s]100%|██████████| 1/1 [00:00<00:00,  6.66it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.62it/s]100%|██████████| 1/1 [00:00<00:00,  6.32it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.96it/s]100%|██████████| 1/1 [00:00<00:00,  6.18it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.69it/s]100%|██████████| 1/1 [00:00<00:00,  5.94it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.86it/s]100%|██████████| 1/1 [00:00<00:00,  6.87it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.55it/s]100%|██████████| 1/1 [00:00<00:00,  5.03it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.99it/s]100%|██████████| 1/1 [00:00<00:00,  4.24it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.14it/s]100%|██████████| 1/1 [00:00<00:00,  4.66it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.01it/s]100%|██████████| 1/1 [00:01<00:00,  1.02s/it]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.48it/s]100%|██████████| 1/1 [00:00<00:00,  4.96it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.44it/s]100%|██████████| 1/1 [00:00<00:00,  3.97it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.20it/s]100%|██████████| 1/1 [00:00<00:00,  7.12it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.10it/s]100%|██████████| 1/1 [00:00<00:00,  4.65it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.29it/s]100%|██████████| 1/1 [00:00<00:00,  1.25it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.47it/s]100%|██████████| 1/1 [00:00<00:00,  5.41it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  3.48it/s]100%|██████████| 1/1 [00:00<00:00,  3.26it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.96it/s]100%|██████████| 1/1 [00:00<00:00,  4.43it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.89it/s]100%|██████████| 1/1 [00:00<00:00,  4.30it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  2.29it/s]100%|██████████| 1/1 [00:00<00:00,  2.18it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.23it/s]100%|██████████| 1/1 [00:00<00:00,  7.13it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.12it/s]100%|██████████| 1/1 [00:00<00:00,  3.83it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.05it/s]100%|██████████| 1/1 [00:00<00:00,  5.44it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.25it/s]100%|██████████| 1/1 [00:00<00:00,  7.19it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  1.83it/s]100%|██████████| 1/1 [00:00<00:00,  1.75it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.74it/s]100%|██████████| 1/1 [00:00<00:00,  6.47it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  7.28it/s]100%|██████████| 1/1 [00:00<00:00,  6.14it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.92it/s]100%|██████████| 1/1 [00:00<00:00,  5.38it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  6.41it/s]100%|██████████| 1/1 [00:00<00:00,  5.55it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  4.85it/s]100%|██████████| 1/1 [00:00<00:00,  4.45it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.38it/s]100%|██████████| 1/1 [00:00<00:00,  4.56it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  5.69it/s]100%|██████████| 1/1 [00:00<00:00,  5.16it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.78it/s]100%|██████████| 1/1 [00:00<00:00,  7.61it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  8.49it/s]100%|██████████| 1/1 [00:00<00:00,  7.36it/s]
  0%|          | 0/1 [00:00<?, ?it/s]100%|██████████| 1/1 [00:00<00:00,  9.02it/s]100%|██████████| 1/1 [00:00<00:00,  7.83it/s]
[2025-10-26 08:12:31,052] [[32m    INFO[0m]: --- DISPLAYING RESULTS --- (pipeline.py:432)[0m
[2025-10-26 08:12:31,063] [[32m    INFO[0m]: --- SAVING AND PUSHING RESULTS --- (pipeline.py:422)[0m
[2025-10-26 08:12:31,064] [[32m    INFO[0m]: Saving experiment tracker (evaluation_tracker.py:246)[0m
[2025-10-26 08:12:32,274] [[32m    INFO[0m]: Saving results to /mnt/public/wucanhui/lighteval/results/results/mnt/public/wucanhui/outputs/Qwen3-4B-math-reasoning/checkpoint-2562/results_2025-10-26T08-12-31.065401.json (evaluation_tracker.py:310)[0m
|              Task              |Version|     Metric      |Value |   |Stderr|
|--------------------------------|-------|-----------------|-----:|---|-----:|
|all                             |       |codegen_pass@1:16|0.2571|±  |0.0331|
|extended:lcb:codegeneration_v6:0|       |codegen_pass@1:16|0.2571|±  |0.0331|