studyOverflow commited on Jan 26

Commit

7ddcafe

verified ·

1 Parent(s): 73048bf

Add files using upload-large-folder tool

Browse files

Files changed (20) hide show

wandb/run-20260124_003511-fnfy86iu/files/config.yaml +88 -0
wandb/run-20260124_003511-fnfy86iu/files/output.log +84 -0
wandb/run-20260124_003511-fnfy86iu/files/requirements.txt +189 -0
wandb/run-20260124_003511-fnfy86iu/files/wandb-metadata.json +96 -0
wandb/run-20260124_003511-fnfy86iu/files/wandb-summary.json +1 -0
wandb/run-20260124_003511-fnfy86iu/logs/debug-core.log +12 -0
wandb/run-20260124_003511-fnfy86iu/logs/debug-internal.log +14 -0
wandb/run-20260124_003511-fnfy86iu/logs/debug.log +27 -0
wandb/run-20260124_022230-0y3z9z7o/files/config.yaml +87 -0
wandb/run-20260124_022230-0y3z9z7o/files/output.log +94 -0
wandb/run-20260124_022230-0y3z9z7o/files/requirements.txt +189 -0
wandb/run-20260124_022230-0y3z9z7o/files/wandb-metadata.json +96 -0
wandb/run-20260124_022230-0y3z9z7o/files/wandb-summary.json +1 -0
wandb/run-20260124_022230-0y3z9z7o/logs/debug-core.log +12 -0
wandb/run-20260124_022230-0y3z9z7o/logs/debug-internal.log +15 -0
wandb/run-20260124_022230-0y3z9z7o/logs/debug.log +27 -0
wandb/run-20260124_105101-s3i4k862/files/wandb-metadata.json +96 -0
wandb/run-20260124_105101-s3i4k862/logs/debug-core.log +12 -0
wandb/run-20260124_105101-s3i4k862/logs/debug-internal.log +14 -0
wandb/run-20260124_105101-s3i4k862/logs/debug.log +27 -0

wandb/run-20260124_003511-fnfy86iu/files/config.yaml ADDED Viewed

	@@ -0,0 +1,88 @@

+_wandb:
+    value:
+        cli_version: 0.18.5
+        m: []
+        python_version: 3.10.19
+        t:
+            "1":
+                - 1
+                - 11
+                - 41
+                - 49
+                - 55
+                - 63
+                - 71
+                - 83
+                - 98
+            "2":
+                - 1
+                - 11
+                - 41
+                - 49
+                - 55
+                - 63
+                - 71
+                - 83
+                - 98
+            "3":
+                - 13
+                - 23
+                - 55
+            "4": 3.10.19
+            "5": 0.18.5
+            "6": 4.46.1
+            "8":
+                - 5
+            "12": 0.18.5
+            "13": linux-x86_64
+allow_tf32:
+    value: true
+logdir:
+    value: logs
+mixed_precision:
+    value: bf16
+num_checkpoint_limit:
+    value: 5
+num_epochs:
+    value: 300
+pretrained:
+    value:
+        model: ./data/StableDiffusion
+        revision: main
+prompt_fn:
+    value: imagenet_animals
+resume_from:
+    value: ""
+reward_fn:
+    value: hpsv2
+run_name:
+    value: 2026.01.24_00.34.56
+sample:
+    value:
+        batch_size: 1
+        eta: 1
+        guidance_scale: 5
+        num_batches_per_epoch: 2
+        num_steps: 50
+save_freq:
+    value: 20
+seed:
+    value: 42
+train:
+    value:
+        adam_beta1: 0.9
+        adam_beta2: 0.999
+        adam_epsilon: 1e-08
+        adam_weight_decay: 0.0001
+        adv_clip_max: 5
+        batch_size: 1
+        cfg: true
+        clip_range: 0.0001
+        gradient_accumulation_steps: 1
+        learning_rate: 1e-05
+        max_grad_norm: 1
+        num_inner_epochs: 1
+        timestep_fraction: 1
+        use_8bit_adam: false
+use_lora:
+    value: false

wandb/run-20260124_003511-fnfy86iu/files/output.log ADDED Viewed

	@@ -0,0 +1,84 @@

+I0124 00:35:12.769941 130053895333696 train_g2rpo_sd_merge.py:510]
+allow_tf32: true
+logdir: logs
+mixed_precision: bf16
+num_checkpoint_limit: 5
+num_epochs: 300
+pretrained:
+  model: ./data/StableDiffusion
+  revision: main
+prompt_fn: imagenet_animals
+prompt_fn_kwargs: {}
+resume_from: ''
+reward_fn: hpsv2
+run_name: 2026.01.24_00.34.56
+sample:
+  batch_size: 1
+  eta: 1.0
+  guidance_scale: 5.0
+  num_batches_per_epoch: 2
+  num_steps: 50
+save_freq: 20
+seed: 42
+train:
+  adam_beta1: 0.9
+  adam_beta2: 0.999
+  adam_epsilon: 1.0e-08
+  adam_weight_decay: 0.0001
+  adv_clip_max: 5
+  batch_size: 1
+  cfg: true
+  clip_range: 0.0001
+  gradient_accumulation_steps: 1
+  learning_rate: 1.0e-05
+  max_grad_norm: 1.0
+  num_inner_epochs: 1
+  timestep_fraction: 1.0
+  use_8bit_adam: false
+use_lora: false
+Loading pipeline components...: 100%|████████████████████████████████████████████████████████████████████████████████████| 7/7 [00:00<00:00,  9.10it/s]
+Traceback (most recent call last):
+  File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 920, in <module>
+    app.run(main)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 316, in run
+    _run_main(main, args)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 261, in _run_main
+    sys.exit(main(argv))
+  File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 597, in main
+    unet, optimizer = accelerator.prepare(unet, optimizer)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1350, in prepare
+    result = tuple(
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1351, in <genexpr>
+    self._prepare_one(obj, first_pass=True, device_placement=d) for obj, d in zip(args, device_placement)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1226, in _prepare_one
+    return self.prepare_model(obj, device_placement=device_placement)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1477, in prepare_model
+    model = torch.nn.parallel.DistributedDataParallel(
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 858, in __init__
+    _verify_param_shape_across_processes(self.process_group, parameters)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/distributed/utils.py", line 281, in _verify_param_shape_across_processes
+    return dist._verify_params_across_processes(process_group, tensors, logger)
+RuntimeError: DDP expects same model across all ranks, but Rank 0 has 686 params, while rank 1 has inconsistent 0 params.
+[rank0]: Traceback (most recent call last):
+[rank0]:   File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 920, in <module>
+[rank0]:     app.run(main)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 316, in run
+[rank0]:     _run_main(main, args)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 261, in _run_main
+[rank0]:     sys.exit(main(argv))
+[rank0]:   File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 597, in main
+[rank0]:     unet, optimizer = accelerator.prepare(unet, optimizer)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1350, in prepare
+[rank0]:     result = tuple(
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1351, in <genexpr>
+[rank0]:     self._prepare_one(obj, first_pass=True, device_placement=d) for obj, d in zip(args, device_placement)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1226, in _prepare_one
+[rank0]:     return self.prepare_model(obj, device_placement=device_placement)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1477, in prepare_model
+[rank0]:     model = torch.nn.parallel.DistributedDataParallel(
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 858, in __init__
+[rank0]:     _verify_param_shape_across_processes(self.process_group, parameters)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/distributed/utils.py", line 281, in _verify_param_shape_across_processes
+[rank0]:     return dist._verify_params_across_processes(process_group, tensors, logger)
+[rank0]: RuntimeError: DDP expects same model across all ranks, but Rank 0 has 686 params, while rank 1 has inconsistent 0 params.

wandb/run-20260124_003511-fnfy86iu/files/requirements.txt ADDED Viewed

	@@ -0,0 +1,189 @@

+scipy==1.13.0
+regex==2024.9.11
+sentencepiece==0.2.0
+six==1.16.0
+anyio==4.11.0
+nvidia-cuda-nvrtc-cu12==12.6.77
+scikit-video==1.1.11
+platformdirs==4.5.0
+mypy==1.11.1
+ruff==0.6.5
+charset-normalizer==3.4.4
+torch==2.9.0+cu126
+av==13.1.0
+pillow==10.2.0
+gpustat==1.1.1
+torchvision==0.24.0+cu126
+multidict==6.7.0
+torchmetrics==1.5.1
+aiohttp==3.13.1
+transformers==4.46.1
+decord==0.6.0
+wcwidth==0.2.14
+sphinx-lint==1.0.0
+nvidia-cuda-runtime-cu12==12.6.77
+pytz==2025.2
+codespell==2.3.0
+hpsv2==1.2.0
+mypy_extensions==1.1.0
+numpy==1.26.3
+omegaconf==2.3.0
+Markdown==3.9
+tzdata==2025.2
+pandas==2.2.3
+pytorch-lightning==2.4.0
+aiosignal==1.4.0
+aiohappyeyeballs==2.6.1
+python-dateutil==2.9.0.post0
+seaborn==0.13.2
+beautifulsoup4==4.12.3
+isort==5.13.2
+httpx==0.28.1
+certifi==2025.10.5
+ml_collections==1.1.0
+nvidia-cudnn-cu12==9.10.2.21
+hf-xet==1.2.0
+requests==2.31.0
+inflect==6.0.4
+iniconfig==2.1.0
+braceexpand==0.1.7
+h5py==3.12.1
+wandb==0.18.5
+protobuf==3.20.3
+ninja==1.13.0
+kiwisolver==1.4.9
+networkx==3.3
+packaging==25.0
+fvcore==0.1.5.post20221221
+pyparsing==3.2.5
+starlette==0.41.3
+frozenlist==1.8.0
+docker-pycreds==0.4.0
+Werkzeug==3.1.3
+MarkupSafe==2.1.5
+einops==0.8.0
+sentry-sdk==2.42.0
+PyYAML==6.0.1
+nvidia-nccl-cu12==2.27.5
+datasets==4.3.0
+polib==1.2.0
+safetensors==0.6.2
+async-timeout==5.0.1
+setproctitle==1.3.7
+clint==0.5.1
+matplotlib==3.9.2
+propcache==0.4.1
+termcolor==3.1.0
+antlr4-python3-runtime==4.9.3
+cycler==0.12.1
+fastvideo==1.2.0
+toml==0.10.2
+xxhash==3.6.0
+wheel==0.44.0
+albumentations==1.4.20
+fastapi==0.115.3
+nvidia-cufft-cu12==11.3.0.4
+yarl==1.22.0
+psutil==7.1.0
+tensorboard-data-server==0.7.2
+pydantic==2.9.2
+nvidia-nvtx-cu12==12.6.77
+portalocker==3.2.0
+triton==3.5.0
+annotated-types==0.7.0
+proglog==0.1.12
+nvidia-cusparselt-cu12==0.7.1
+yapf==0.32.0
+Jinja2==3.1.6
+types-requests==2.32.4.20250913
+lightning-utilities==0.15.2
+grpcio==1.75.1
+uvicorn==0.32.0
+typing_extensions==4.15.0
+nvidia-nvjitlink-cu12==12.6.85
+watch==0.2.7
+moviepy==1.0.3
+timm==1.0.11
+pytest-split==0.8.0
+gdown==5.2.0
+types-setuptools==80.9.0.20250822
+nvidia-cusolver-cu12==11.7.1.2
+types-PyYAML==6.0.12.20250915
+pip==25.2
+qwen-vl-utils==0.0.14
+soupsieve==2.8
+zipp==3.23.0
+flash_attn==2.8.3
+yacs==0.1.8
+diffusers==0.32.0
+pluggy==1.6.0
+opencv-python-headless==4.11.0.86
+mpmath==1.3.0
+test_tube==0.7.5
+stringzilla==4.2.1
+fonttools==4.60.1
+nvidia-ml-py==13.580.82
+parameterized==0.9.0
+loguru==0.7.3
+tabulate==0.9.0
+idna==3.6
+iopath==0.1.10
+decorator==4.4.2
+nvidia-cufile-cu12==1.11.1.6
+threadpoolctl==3.6.0
+pyarrow==21.0.0
+httpcore==1.0.9
+hydra-core==1.3.2
+multiprocess==0.70.16
+contourpy==1.3.2
+clip==1.0
+tqdm==4.66.5
+open_clip_torch==3.2.0
+accelerate==1.0.1
+gitdb==4.0.12
+importlib_metadata==8.7.0
+nvidia-cublas-cu12==12.6.4.1
+h11==0.16.0
+filelock==3.19.1
+liger_kernel==0.4.1
+click==8.3.0
+urllib3==2.2.0
+imageio-ffmpeg==0.5.1
+setuptools==80.9.0
+joblib==1.5.2
+tensorboard==2.20.0
+attrs==25.4.0
+future==1.0.0
+albucore==0.0.19
+fsspec==2025.9.0
+sympy==1.14.0
+eval_type_backport==0.2.2
+pydantic_core==2.23.4
+sniffio==1.3.1
+nvidia-nvshmem-cu12==3.3.20
+exceptiongroup==1.3.0
+smmap==5.0.2
+tomli==2.0.2
+ftfy==6.3.0
+dill==0.4.0
+pytest==7.2.0
+PySocks==1.7.1
+nvidia-curand-cu12==10.3.7.77
+tokenizers==0.20.1
+args==0.1.0
+fairscale==0.4.13
+peft==0.13.2
+webdataset==1.0.2
+huggingface-hub==0.26.1
+GitPython==3.1.45
+pytorchvideo==0.1.5
+scikit-learn==1.5.2
+bitsandbytes==0.48.1
+nvidia-cusparse-cu12==12.5.4.2
+nvidia-cuda-cupti-cu12==12.6.80
+imageio==2.36.0
+pydub==0.25.1
+image-reward==1.5
+absl-py==2.3.1
+blessed==1.22.0
+torchdiffeq==0.2.4

wandb/run-20260124_003511-fnfy86iu/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,96 @@

+{
+  "os":  "Linux-6.8.0-85-generic-x86_64-with-glibc2.35",
+  "python":  "3.10.19",
+  "startedAt":  "2026-01-23T16:35:11.374381Z",
+  "args":  [
+    "--config",
+    "fastvideo/config_sd/base.py",
+    "--eta_step_list",
+    "0,1,2,3,4,5,6,7",
+    "--eta_step_merge_list",
+    "1,1,1,2,2,2,3,3",
+    "--granular_list",
+    "1",
+    "--num_generations",
+    "4",
+    "--eta",
+    "1.0",
+    "--init_same_noise"
+  ],
+  "program":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py",
+  "codePath":  "fastvideo/train_g2rpo_sd_merge.py",
+  "email":  "zhangemail1428@163.com",
+  "root":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code",
+  "host":  "abc",
+  "username":  "zsj",
+  "executable":  "/home/zsj/anaconda3/envs/g2rpo/bin/python",
+  "codePathLocal":  "fastvideo/train_g2rpo_sd_merge.py",
+  "cpu_count":  48,
+  "cpu_count_logical":  96,
+  "gpu":  "NVIDIA RTX 5880 Ada Generation",
+  "gpu_count":  8,
+  "disk":  {
+    "/":  {
+      "total":  "1006773899264",
+      "used":  "812103774208"
+    }
+  },
+  "memory":  {
+    "total":  "540697260032"
+  },
+  "cpu":  {
+    "count":  48,
+    "countLogical":  96
+  },
+  "gpu_nvidia":  [
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    }
+  ],
+  "cudaVersion":  "12.9"
+}

wandb/run-20260124_003511-fnfy86iu/files/wandb-summary.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"_wandb":{"runtime":666}}

wandb/run-20260124_003511-fnfy86iu/logs/debug-core.log ADDED Viewed

	@@ -0,0 +1,12 @@

+{"time":"2026-01-24T00:35:09.510417423+08:00","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmpmrd01299/port-583399.txt","pid":583399,"debug":false,"disable-analytics":false}
+{"time":"2026-01-24T00:35:09.510458378+08:00","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
+{"time":"2026-01-24T00:35:09.511463258+08:00","level":"INFO","msg":"Will exit if parent process dies.","ppid":583399}
+{"time":"2026-01-24T00:35:09.511480485+08:00","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":37279,"Zone":""}}
+{"time":"2026-01-24T00:35:09.680994997+08:00","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:35:11.378863134+08:00","level":"INFO","msg":"handleInformInit: received","streamId":"fnfy86iu","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:35:11.498973378+08:00","level":"INFO","msg":"handleInformInit: stream started","streamId":"fnfy86iu","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:46:17.507921807+08:00","level":"INFO","msg":"handleInformTeardown: server teardown initiated","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:46:17.508062689+08:00","level":"INFO","msg":"connection: Close: initiating connection closure","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:46:17.508144622+08:00","level":"INFO","msg":"server is shutting down"}
+{"time":"2026-01-24T00:46:17.50824603+08:00","level":"INFO","msg":"connection: Close: connection successfully closed","id":"127.0.0.1:60944"}
+{"time":"2026-01-24T00:46:18.435088972+08:00","level":"INFO","msg":"Parent process exited, terminating service process."}

wandb/run-20260124_003511-fnfy86iu/logs/debug-internal.log ADDED Viewed

	@@ -0,0 +1,14 @@

+{"time":"2026-01-24T00:35:11.379241849+08:00","level":"INFO","msg":"using version","core version":"0.18.5"}
+{"time":"2026-01-24T00:35:11.379275289+08:00","level":"INFO","msg":"created symlink","path":"/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_003511-fnfy86iu/logs/debug-core.log"}
+{"time":"2026-01-24T00:35:11.498908689+08:00","level":"INFO","msg":"created new stream","id":"fnfy86iu"}
+{"time":"2026-01-24T00:35:11.498965971+08:00","level":"INFO","msg":"stream: started","id":"fnfy86iu"}
+{"time":"2026-01-24T00:35:11.499204509+08:00","level":"INFO","msg":"handler: started","stream_id":{"value":"fnfy86iu"}}
+{"time":"2026-01-24T00:35:11.499270739+08:00","level":"INFO","msg":"writer: Do: started","stream_id":{"value":"fnfy86iu"}}
+{"time":"2026-01-24T00:35:11.499381171+08:00","level":"INFO","msg":"sender: started","stream_id":"fnfy86iu"}
+{"time":"2026-01-24T00:35:12.616857928+08:00","level":"INFO","msg":"Starting system monitor"}
+{"time":"2026-01-24T00:46:17.508040252+08:00","level":"INFO","msg":"stream: closing","id":"fnfy86iu"}
+{"time":"2026-01-24T00:46:17.508123223+08:00","level":"INFO","msg":"Stopping system monitor"}
+{"time":"2026-01-24T00:46:17.509233475+08:00","level":"INFO","msg":"Stopped system monitor"}
+{"time":"2026-01-24T00:46:17.97992374+08:00","level":"WARN","msg":"No job ingredients found, not creating job artifact"}
+{"time":"2026-01-24T00:46:17.979956234+08:00","level":"WARN","msg":"No source type found, not creating job artifact"}
+{"time":"2026-01-24T00:46:17.979968114+08:00","level":"INFO","msg":"sender: sendDefer: no job artifact to save"}

wandb/run-20260124_003511-fnfy86iu/logs/debug.log ADDED Viewed

	@@ -0,0 +1,27 @@

+2026-01-24 00:35:11,371 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Current SDK version is 0.18.5
+2026-01-24 00:35:11,371 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Configure stats pid to 583399
+2026-01-24 00:35:11,371 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Loading settings from /home/zsj/.config/wandb/settings
+2026-01-24 00:35:11,371 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Loading settings from /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/settings
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Loading settings from environment variables: {}
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Applying setup settings: {'mode': None, '_disable_service': None}
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Inferring run settings from compute environment: {'program_relpath': 'fastvideo/train_g2rpo_sd_merge.py', 'program_abspath': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py', 'program': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py'}
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_setup.py:_flush():79] Applying login settings: {}
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:_log_setup():534] Logging user logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_003511-fnfy86iu/logs/debug.log
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:_log_setup():535] Logging internal logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_003511-fnfy86iu/logs/debug-internal.log
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:init():621] calling init triggers
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:init():628] wandb.init called with sweep_config: {}
+config: {}
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:init():671] starting backend
+2026-01-24 00:35:11,372 INFO    MainThread:583399 [wandb_init.py:init():675] sending inform_init request
+2026-01-24 00:35:11,373 INFO    MainThread:583399 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2026-01-24 00:35:11,374 INFO    MainThread:583399 [wandb_init.py:init():688] backend started and connected
+2026-01-24 00:35:11,376 INFO    MainThread:583399 [wandb_init.py:init():783] updated telemetry
+2026-01-24 00:35:11,377 INFO    MainThread:583399 [wandb_init.py:init():816] communicating run to backend with 90.0 second timeout
+2026-01-24 00:35:12,610 INFO    MainThread:583399 [wandb_init.py:init():867] starting run threads in backend
+2026-01-24 00:35:12,765 INFO    MainThread:583399 [wandb_run.py:_console_start():2463] atexit reg
+2026-01-24 00:35:12,765 INFO    MainThread:583399 [wandb_run.py:_redirect():2311] redirect: wrap_raw
+2026-01-24 00:35:12,765 INFO    MainThread:583399 [wandb_run.py:_redirect():2376] Wrapping output streams.
+2026-01-24 00:35:12,765 INFO    MainThread:583399 [wandb_run.py:_redirect():2401] Redirects installed.
+2026-01-24 00:35:12,767 INFO    MainThread:583399 [wandb_init.py:init():911] run started, returning control to user process
+2026-01-24 00:35:12,767 INFO    MainThread:583399 [wandb_run.py:_config_callback():1390] config_cb None None {'allow_tf32': True, 'logdir': 'logs', 'mixed_precision': 'bf16', 'num_checkpoint_limit': 5, 'num_epochs': 300, 'pretrained': {'model': './data/StableDiffusion', 'revision': 'main'}, 'prompt_fn': 'imagenet_animals', 'prompt_fn_kwargs': {}, 'resume_from': '', 'reward_fn': 'hpsv2', 'run_name': '2026.01.24_00.34.56', 'sample': {'batch_size': 1, 'eta': 1.0, 'guidance_scale': 5.0, 'num_batches_per_epoch': 2, 'num_steps': 50}, 'save_freq': 20, 'seed': 42, 'train': {'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'adam_weight_decay': 0.0001, 'adv_clip_max': 5, 'batch_size': 1, 'cfg': True, 'clip_range': 0.0001, 'gradient_accumulation_steps': 1, 'learning_rate': 1e-05, 'max_grad_norm': 1.0, 'num_inner_epochs': 1, 'timestep_fraction': 1.0, 'use_8bit_adam': False}, 'use_lora': False}
+2026-01-24 00:46:17,508 WARNING MsgRouterThr:583399 [router.py:message_loop():77] message_loop has been closed

wandb/run-20260124_022230-0y3z9z7o/files/config.yaml ADDED Viewed

	@@ -0,0 +1,87 @@

+_wandb:
+    value:
+        cli_version: 0.18.5
+        m: []
+        python_version: 3.10.19
+        t:
+            "1":
+                - 1
+                - 11
+                - 41
+                - 49
+                - 55
+                - 71
+                - 83
+                - 98
+            "2":
+                - 1
+                - 11
+                - 41
+                - 49
+                - 55
+                - 63
+                - 71
+                - 83
+                - 98
+            "3":
+                - 13
+                - 23
+                - 55
+            "4": 3.10.19
+            "5": 0.18.5
+            "6": 4.46.1
+            "8":
+                - 5
+            "12": 0.18.5
+            "13": linux-x86_64
+allow_tf32:
+    value: true
+logdir:
+    value: logs
+mixed_precision:
+    value: bf16
+num_checkpoint_limit:
+    value: 5
+num_epochs:
+    value: 300
+pretrained:
+    value:
+        model: ./data/StableDiffusion
+        revision: main
+prompt_fn:
+    value: imagenet_animals
+resume_from:
+    value: ""
+reward_fn:
+    value: hpsv2
+run_name:
+    value: 2026.01.24_02.22.28
+sample:
+    value:
+        batch_size: 1
+        eta: 1
+        guidance_scale: 5
+        num_batches_per_epoch: 2
+        num_steps: 50
+save_freq:
+    value: 20
+seed:
+    value: 42
+train:
+    value:
+        adam_beta1: 0.9
+        adam_beta2: 0.999
+        adam_epsilon: 1e-08
+        adam_weight_decay: 0.0001
+        adv_clip_max: 5
+        batch_size: 1
+        cfg: true
+        clip_range: 0.0001
+        gradient_accumulation_steps: 1
+        learning_rate: 1e-05
+        max_grad_norm: 1
+        num_inner_epochs: 1
+        timestep_fraction: 1
+        use_8bit_adam: false
+use_lora:
+    value: false

wandb/run-20260124_022230-0y3z9z7o/files/output.log ADDED Viewed

	@@ -0,0 +1,94 @@

+I0124 02:22:31.450613 138092014643008 train_g2rpo_sd_merge.py:465]
+allow_tf32: true
+logdir: logs
+mixed_precision: bf16
+num_checkpoint_limit: 5
+num_epochs: 300
+pretrained:
+  model: ./data/StableDiffusion
+  revision: main
+prompt_fn: imagenet_animals
+prompt_fn_kwargs: {}
+resume_from: ''
+reward_fn: hpsv2
+run_name: 2026.01.24_02.22.28
+sample:
+  batch_size: 1
+  eta: 1.0
+  guidance_scale: 5.0
+  num_batches_per_epoch: 2
+  num_steps: 50
+save_freq: 20
+seed: 42
+train:
+  adam_beta1: 0.9
+  adam_beta2: 0.999
+  adam_epsilon: 1.0e-08
+  adam_weight_decay: 0.0001
+  adv_clip_max: 5
+  batch_size: 1
+  cfg: true
+  clip_range: 0.0001
+  gradient_accumulation_steps: 1
+  learning_rate: 1.0e-05
+  max_grad_norm: 1.0
+  num_inner_epochs: 1
+  timestep_fraction: 1.0
+  use_8bit_adam: false
+use_lora: false
+Loading pipeline components...: 100%|███████████████████████████████████████████████████████████████████████████████████████████| 7/7 [00:02<00:00,  2.47it/s]
+/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/timm/models/layers/__init__.py:48: FutureWarning: Importing from timm.models.layers is deprecated, please import via timm.layers
+  warnings.warn(f"Importing from {__name__} is deprecated, please import via timm.layers", FutureWarning)
+I0124 02:22:34.955836 138092014643008 factory.py:159] Loaded ViT-H-14 model config.
+I0124 02:22:40.351596 138092014643008 factory.py:207] Loading pretrained ViT-H-14 weights (./data/hps/open_clip_pytorch_model.bin).
+Traceback (most recent call last):
+  File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 930, in <module>
+    app.run(main)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 316, in run
+    _run_main(main, args)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 261, in _run_main
+    sys.exit(main(argv))
+  File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 603, in main
+    unet, optimizer = accelerator.prepare(unet, optimizer)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1350, in prepare
+    result = tuple(
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1351, in <genexpr>
+    self._prepare_one(obj, first_pass=True, device_placement=d) for obj, d in zip(args, device_placement)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1226, in _prepare_one
+    return self.prepare_model(obj, device_placement=device_placement)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1477, in prepare_model
+    model = torch.nn.parallel.DistributedDataParallel(
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 858, in __init__
+    _verify_param_shape_across_processes(self.process_group, parameters)
+  File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/distributed/utils.py", line 281, in _verify_param_shape_across_processes
+    return dist._verify_params_across_processes(process_group, tensors, logger)
+torch.distributed.DistBackendError: NCCL error in: /pytorch/torch/csrc/distributed/c10d/NCCLUtils.cpp:94, invalid usage (run with NCCL_DEBUG=WARN for details), NCCL version 2.27.5
+ncclInvalidUsage: This usually reflects invalid usage of NCCL library.
+Last error:
+Duplicate GPU detected : rank 0 and rank 6 both on CUDA device 2c000
+[rank0]: Traceback (most recent call last):
+[rank0]:   File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 930, in <module>
+[rank0]:     app.run(main)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 316, in run
+[rank0]:     _run_main(main, args)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/absl/app.py", line 261, in _run_main
+[rank0]:     sys.exit(main(argv))
+[rank0]:   File "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py", line 603, in main
+[rank0]:     unet, optimizer = accelerator.prepare(unet, optimizer)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1350, in prepare
+[rank0]:     result = tuple(
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1351, in <genexpr>
+[rank0]:     self._prepare_one(obj, first_pass=True, device_placement=d) for obj, d in zip(args, device_placement)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1226, in _prepare_one
+[rank0]:     return self.prepare_model(obj, device_placement=device_placement)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/accelerate/accelerator.py", line 1477, in prepare_model
+[rank0]:     model = torch.nn.parallel.DistributedDataParallel(
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 858, in __init__
+[rank0]:     _verify_param_shape_across_processes(self.process_group, parameters)
+[rank0]:   File "/home/zsj/anaconda3/envs/g2rpo/lib/python3.10/site-packages/torch/distributed/utils.py", line 281, in _verify_param_shape_across_processes
+[rank0]:     return dist._verify_params_across_processes(process_group, tensors, logger)
+[rank0]: torch.distributed.DistBackendError: NCCL error in: /pytorch/torch/csrc/distributed/c10d/NCCLUtils.cpp:94, invalid usage (run with NCCL_DEBUG=WARN for details), NCCL version 2.27.5
+[rank0]: ncclInvalidUsage: This usually reflects invalid usage of NCCL library.
+[rank0]: Last error:
+[rank0]: Duplicate GPU detected : rank 0 and rank 6 both on CUDA device 2c000

wandb/run-20260124_022230-0y3z9z7o/files/requirements.txt ADDED Viewed

	@@ -0,0 +1,189 @@

+scipy==1.13.0
+regex==2024.9.11
+sentencepiece==0.2.0
+six==1.16.0
+anyio==4.11.0
+nvidia-cuda-nvrtc-cu12==12.6.77
+scikit-video==1.1.11
+platformdirs==4.5.0
+mypy==1.11.1
+ruff==0.6.5
+charset-normalizer==3.4.4
+torch==2.9.0+cu126
+av==13.1.0
+pillow==10.2.0
+gpustat==1.1.1
+torchvision==0.24.0+cu126
+multidict==6.7.0
+torchmetrics==1.5.1
+aiohttp==3.13.1
+transformers==4.46.1
+decord==0.6.0
+wcwidth==0.2.14
+sphinx-lint==1.0.0
+nvidia-cuda-runtime-cu12==12.6.77
+pytz==2025.2
+codespell==2.3.0
+hpsv2==1.2.0
+mypy_extensions==1.1.0
+numpy==1.26.3
+omegaconf==2.3.0
+Markdown==3.9
+tzdata==2025.2
+pandas==2.2.3
+pytorch-lightning==2.4.0
+aiosignal==1.4.0
+aiohappyeyeballs==2.6.1
+python-dateutil==2.9.0.post0
+seaborn==0.13.2
+beautifulsoup4==4.12.3
+isort==5.13.2
+httpx==0.28.1
+certifi==2025.10.5
+ml_collections==1.1.0
+nvidia-cudnn-cu12==9.10.2.21
+hf-xet==1.2.0
+requests==2.31.0
+inflect==6.0.4
+iniconfig==2.1.0
+braceexpand==0.1.7
+h5py==3.12.1
+wandb==0.18.5
+protobuf==3.20.3
+ninja==1.13.0
+kiwisolver==1.4.9
+networkx==3.3
+packaging==25.0
+fvcore==0.1.5.post20221221
+pyparsing==3.2.5
+starlette==0.41.3
+frozenlist==1.8.0
+docker-pycreds==0.4.0
+Werkzeug==3.1.3
+MarkupSafe==2.1.5
+einops==0.8.0
+sentry-sdk==2.42.0
+PyYAML==6.0.1
+nvidia-nccl-cu12==2.27.5
+datasets==4.3.0
+polib==1.2.0
+safetensors==0.6.2
+async-timeout==5.0.1
+setproctitle==1.3.7
+clint==0.5.1
+matplotlib==3.9.2
+propcache==0.4.1
+termcolor==3.1.0
+antlr4-python3-runtime==4.9.3
+cycler==0.12.1
+fastvideo==1.2.0
+toml==0.10.2
+xxhash==3.6.0
+wheel==0.44.0
+albumentations==1.4.20
+fastapi==0.115.3
+nvidia-cufft-cu12==11.3.0.4
+yarl==1.22.0
+psutil==7.1.0
+tensorboard-data-server==0.7.2
+pydantic==2.9.2
+nvidia-nvtx-cu12==12.6.77
+portalocker==3.2.0
+triton==3.5.0
+annotated-types==0.7.0
+proglog==0.1.12
+nvidia-cusparselt-cu12==0.7.1
+yapf==0.32.0
+Jinja2==3.1.6
+types-requests==2.32.4.20250913
+lightning-utilities==0.15.2
+grpcio==1.75.1
+uvicorn==0.32.0
+typing_extensions==4.15.0
+nvidia-nvjitlink-cu12==12.6.85
+watch==0.2.7
+moviepy==1.0.3
+timm==1.0.11
+pytest-split==0.8.0
+gdown==5.2.0
+types-setuptools==80.9.0.20250822
+nvidia-cusolver-cu12==11.7.1.2
+types-PyYAML==6.0.12.20250915
+pip==25.2
+qwen-vl-utils==0.0.14
+soupsieve==2.8
+zipp==3.23.0
+flash_attn==2.8.3
+yacs==0.1.8
+diffusers==0.32.0
+pluggy==1.6.0
+opencv-python-headless==4.11.0.86
+mpmath==1.3.0
+test_tube==0.7.5
+stringzilla==4.2.1
+fonttools==4.60.1
+nvidia-ml-py==13.580.82
+parameterized==0.9.0
+loguru==0.7.3
+tabulate==0.9.0
+idna==3.6
+iopath==0.1.10
+decorator==4.4.2
+nvidia-cufile-cu12==1.11.1.6
+threadpoolctl==3.6.0
+pyarrow==21.0.0
+httpcore==1.0.9
+hydra-core==1.3.2
+multiprocess==0.70.16
+contourpy==1.3.2
+clip==1.0
+tqdm==4.66.5
+open_clip_torch==3.2.0
+accelerate==1.0.1
+gitdb==4.0.12
+importlib_metadata==8.7.0
+nvidia-cublas-cu12==12.6.4.1
+h11==0.16.0
+filelock==3.19.1
+liger_kernel==0.4.1
+click==8.3.0
+urllib3==2.2.0
+imageio-ffmpeg==0.5.1
+setuptools==80.9.0
+joblib==1.5.2
+tensorboard==2.20.0
+attrs==25.4.0
+future==1.0.0
+albucore==0.0.19
+fsspec==2025.9.0
+sympy==1.14.0
+eval_type_backport==0.2.2
+pydantic_core==2.23.4
+sniffio==1.3.1
+nvidia-nvshmem-cu12==3.3.20
+exceptiongroup==1.3.0
+smmap==5.0.2
+tomli==2.0.2
+ftfy==6.3.0
+dill==0.4.0
+pytest==7.2.0
+PySocks==1.7.1
+nvidia-curand-cu12==10.3.7.77
+tokenizers==0.20.1
+args==0.1.0
+fairscale==0.4.13
+peft==0.13.2
+webdataset==1.0.2
+huggingface-hub==0.26.1
+GitPython==3.1.45
+pytorchvideo==0.1.5
+scikit-learn==1.5.2
+bitsandbytes==0.48.1
+nvidia-cusparse-cu12==12.5.4.2
+nvidia-cuda-cupti-cu12==12.6.80
+imageio==2.36.0
+pydub==0.25.1
+image-reward==1.5
+absl-py==2.3.1
+blessed==1.22.0
+torchdiffeq==0.2.4

wandb/run-20260124_022230-0y3z9z7o/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,96 @@

+{
+  "os":  "Linux-6.8.0-85-generic-x86_64-with-glibc2.35",
+  "python":  "3.10.19",
+  "startedAt":  "2026-01-23T18:22:30.277742Z",
+  "args":  [
+    "--config",
+    "fastvideo/config_sd/base.py",
+    "--eta_step_list",
+    "0,1,2,3,4,5,6,7",
+    "--eta_step_merge_list",
+    "1,1,1,2,2,2,3,3",
+    "--granular_list",
+    "1",
+    "--num_generations",
+    "4",
+    "--eta",
+    "1.0",
+    "--init_same_noise"
+  ],
+  "program":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py",
+  "codePath":  "fastvideo/train_g2rpo_sd_merge.py",
+  "email":  "zhangemail1428@163.com",
+  "root":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code",
+  "host":  "abc",
+  "username":  "zsj",
+  "executable":  "/home/zsj/anaconda3/envs/g2rpo/bin/python",
+  "codePathLocal":  "fastvideo/train_g2rpo_sd_merge.py",
+  "cpu_count":  48,
+  "cpu_count_logical":  96,
+  "gpu":  "NVIDIA RTX 5880 Ada Generation",
+  "gpu_count":  8,
+  "disk":  {
+    "/":  {
+      "total":  "1006773899264",
+      "used":  "813053333504"
+    }
+  },
+  "memory":  {
+    "total":  "540697260032"
+  },
+  "cpu":  {
+    "count":  48,
+    "countLogical":  96
+  },
+  "gpu_nvidia":  [
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    }
+  ],
+  "cudaVersion":  "12.9"
+}

wandb/run-20260124_022230-0y3z9z7o/files/wandb-summary.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"_wandb":{"runtime":15}}

wandb/run-20260124_022230-0y3z9z7o/logs/debug-core.log ADDED Viewed

	@@ -0,0 +1,12 @@

+{"time":"2026-01-24T02:22:29.302091572+08:00","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmptjgtjg7f/port-608086.txt","pid":608086,"debug":false,"disable-analytics":false}
+{"time":"2026-01-24T02:22:29.302119596+08:00","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
+{"time":"2026-01-24T02:22:29.30272344+08:00","level":"INFO","msg":"Will exit if parent process dies.","ppid":608086}
+{"time":"2026-01-24T02:22:29.302734848+08:00","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":42453,"Zone":""}}
+{"time":"2026-01-24T02:22:29.492955085+08:00","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:30.281666201+08:00","level":"INFO","msg":"handleInformInit: received","streamId":"0y3z9z7o","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:30.394942882+08:00","level":"INFO","msg":"handleInformInit: stream started","streamId":"0y3z9z7o","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:45.992004001+08:00","level":"INFO","msg":"handleInformTeardown: server teardown initiated","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:45.992302576+08:00","level":"INFO","msg":"server is shutting down"}
+{"time":"2026-01-24T02:22:45.992296318+08:00","level":"INFO","msg":"connection: Close: initiating connection closure","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:45.992713821+08:00","level":"INFO","msg":"connection: Close: connection successfully closed","id":"127.0.0.1:51798"}
+{"time":"2026-01-24T02:22:48.165257156+08:00","level":"INFO","msg":"Parent process exited, terminating service process."}

wandb/run-20260124_022230-0y3z9z7o/logs/debug-internal.log ADDED Viewed

	@@ -0,0 +1,15 @@

+{"time":"2026-01-24T02:22:30.281841189+08:00","level":"INFO","msg":"using version","core version":"0.18.5"}
+{"time":"2026-01-24T02:22:30.281861284+08:00","level":"INFO","msg":"created symlink","path":"/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_022230-0y3z9z7o/logs/debug-core.log"}
+{"time":"2026-01-24T02:22:30.394846024+08:00","level":"INFO","msg":"created new stream","id":"0y3z9z7o"}
+{"time":"2026-01-24T02:22:30.394931982+08:00","level":"INFO","msg":"stream: started","id":"0y3z9z7o"}
+{"time":"2026-01-24T02:22:30.395106768+08:00","level":"INFO","msg":"sender: started","stream_id":"0y3z9z7o"}
+{"time":"2026-01-24T02:22:30.395039138+08:00","level":"INFO","msg":"handler: started","stream_id":{"value":"0y3z9z7o"}}
+{"time":"2026-01-24T02:22:30.395033137+08:00","level":"INFO","msg":"writer: Do: started","stream_id":{"value":"0y3z9z7o"}}
+{"time":"2026-01-24T02:22:31.287570308+08:00","level":"INFO","msg":"Starting system monitor"}
+{"time":"2026-01-24T02:22:45.992135089+08:00","level":"INFO","msg":"stream: closing","id":"0y3z9z7o"}
+{"time":"2026-01-24T02:22:45.992197139+08:00","level":"INFO","msg":"Stopping system monitor"}
+{"time":"2026-01-24T02:22:45.995895301+08:00","level":"INFO","msg":"Stopped system monitor"}
+{"time":"2026-01-24T02:22:46.363069461+08:00","level":"WARN","msg":"No job ingredients found, not creating job artifact"}
+{"time":"2026-01-24T02:22:46.363103824+08:00","level":"WARN","msg":"No source type found, not creating job artifact"}
+{"time":"2026-01-24T02:22:46.363114999+08:00","level":"INFO","msg":"sender: sendDefer: no job artifact to save"}
+{"time":"2026-01-24T02:22:47.353967974+08:00","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}

wandb/run-20260124_022230-0y3z9z7o/logs/debug.log ADDED Viewed

	@@ -0,0 +1,27 @@

+2026-01-24 02:22:30,273 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Current SDK version is 0.18.5
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Configure stats pid to 608086
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Loading settings from /home/zsj/.config/wandb/settings
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Loading settings from /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/settings
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Loading settings from environment variables: {}
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Applying setup settings: {'mode': None, '_disable_service': None}
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Inferring run settings from compute environment: {'program_relpath': 'fastvideo/train_g2rpo_sd_merge.py', 'program_abspath': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py', 'program': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py'}
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_setup.py:_flush():79] Applying login settings: {}
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:_log_setup():534] Logging user logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_022230-0y3z9z7o/logs/debug.log
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:_log_setup():535] Logging internal logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_022230-0y3z9z7o/logs/debug-internal.log
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:init():621] calling init triggers
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:init():628] wandb.init called with sweep_config: {}
+config: {}
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:init():671] starting backend
+2026-01-24 02:22:30,274 INFO    MainThread:608086 [wandb_init.py:init():675] sending inform_init request
+2026-01-24 02:22:30,276 INFO    MainThread:608086 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2026-01-24 02:22:30,277 INFO    MainThread:608086 [wandb_init.py:init():688] backend started and connected
+2026-01-24 02:22:30,282 INFO    MainThread:608086 [wandb_init.py:init():783] updated telemetry
+2026-01-24 02:22:30,283 INFO    MainThread:608086 [wandb_init.py:init():816] communicating run to backend with 90.0 second timeout
+2026-01-24 02:22:31,277 INFO    MainThread:608086 [wandb_init.py:init():867] starting run threads in backend
+2026-01-24 02:22:31,446 INFO    MainThread:608086 [wandb_run.py:_console_start():2463] atexit reg
+2026-01-24 02:22:31,446 INFO    MainThread:608086 [wandb_run.py:_redirect():2311] redirect: wrap_raw
+2026-01-24 02:22:31,447 INFO    MainThread:608086 [wandb_run.py:_redirect():2376] Wrapping output streams.
+2026-01-24 02:22:31,447 INFO    MainThread:608086 [wandb_run.py:_redirect():2401] Redirects installed.
+2026-01-24 02:22:31,448 INFO    MainThread:608086 [wandb_init.py:init():911] run started, returning control to user process
+2026-01-24 02:22:31,448 INFO    MainThread:608086 [wandb_run.py:_config_callback():1390] config_cb None None {'allow_tf32': True, 'logdir': 'logs', 'mixed_precision': 'bf16', 'num_checkpoint_limit': 5, 'num_epochs': 300, 'pretrained': {'model': './data/StableDiffusion', 'revision': 'main'}, 'prompt_fn': 'imagenet_animals', 'prompt_fn_kwargs': {}, 'resume_from': '', 'reward_fn': 'hpsv2', 'run_name': '2026.01.24_02.22.28', 'sample': {'batch_size': 1, 'eta': 1.0, 'guidance_scale': 5.0, 'num_batches_per_epoch': 2, 'num_steps': 50}, 'save_freq': 20, 'seed': 42, 'train': {'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'adam_weight_decay': 0.0001, 'adv_clip_max': 5, 'batch_size': 1, 'cfg': True, 'clip_range': 0.0001, 'gradient_accumulation_steps': 1, 'learning_rate': 1e-05, 'max_grad_norm': 1.0, 'num_inner_epochs': 1, 'timestep_fraction': 1.0, 'use_8bit_adam': False}, 'use_lora': False}
+2026-01-24 02:22:45,992 WARNING MsgRouterThr:608086 [router.py:message_loop():77] message_loop has been closed

wandb/run-20260124_105101-s3i4k862/files/wandb-metadata.json ADDED Viewed

	@@ -0,0 +1,96 @@

+{
+  "os":  "Linux-6.8.0-85-generic-x86_64-with-glibc2.35",
+  "python":  "3.10.19",
+  "startedAt":  "2026-01-24T02:51:01.789219Z",
+  "args":  [
+    "--config",
+    "fastvideo/config_sd/base.py",
+    "--eta_step_list",
+    "0,1,2,3,4,5,6,7",
+    "--eta_step_merge_list",
+    "1,1,1,2,2,2,3,3",
+    "--granular_list",
+    "1",
+    "--num_generations",
+    "4",
+    "--eta",
+    "1.0",
+    "--init_same_noise"
+  ],
+  "program":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py",
+  "codePath":  "fastvideo/train_g2rpo_sd_merge.py",
+  "email":  "zhangemail1428@163.com",
+  "root":  "/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code",
+  "host":  "abc",
+  "username":  "zsj",
+  "executable":  "/home/zsj/anaconda3/envs/g2rpo/bin/python",
+  "codePathLocal":  "fastvideo/train_g2rpo_sd_merge.py",
+  "cpu_count":  48,
+  "cpu_count_logical":  96,
+  "gpu":  "NVIDIA RTX 5880 Ada Generation",
+  "gpu_count":  8,
+  "disk":  {
+    "/":  {
+      "total":  "1006773899264",
+      "used":  "811835744256"
+    }
+  },
+  "memory":  {
+    "total":  "540697260032"
+  },
+  "cpu":  {
+    "count":  48,
+    "countLogical":  96
+  },
+  "gpu_nvidia":  [
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    },
+    {
+      "name":  "NVIDIA RTX 5880 Ada Generation",
+      "memoryTotal":  "51527024640",
+      "cudaCores":  14080,
+      "architecture":  "Ada"
+    }
+  ],
+  "cudaVersion":  "12.9"
+}

wandb/run-20260124_105101-s3i4k862/logs/debug-core.log ADDED Viewed

	@@ -0,0 +1,12 @@

+{"time":"2026-01-24T10:51:00.7426538+08:00","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmpwxlv3x4b/port-694321.txt","pid":694321,"debug":false,"disable-analytics":false}
+{"time":"2026-01-24T10:51:00.742686139+08:00","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
+{"time":"2026-01-24T10:51:00.743491711+08:00","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":35647,"Zone":""}}
+{"time":"2026-01-24T10:51:00.743589092+08:00","level":"INFO","msg":"Will exit if parent process dies.","ppid":694321}
+{"time":"2026-01-24T10:51:00.933222669+08:00","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T10:51:01.795205328+08:00","level":"INFO","msg":"handleInformInit: received","streamId":"s3i4k862","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T10:51:01.911996105+08:00","level":"INFO","msg":"handleInformInit: stream started","streamId":"s3i4k862","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T11:02:19.490190123+08:00","level":"INFO","msg":"handleInformTeardown: server teardown initiated","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T11:02:19.490498292+08:00","level":"INFO","msg":"server is shutting down"}
+{"time":"2026-01-24T11:02:19.490490222+08:00","level":"INFO","msg":"connection: Close: initiating connection closure","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T11:02:19.491527379+08:00","level":"INFO","msg":"connection: Close: connection successfully closed","id":"127.0.0.1:33952"}
+{"time":"2026-01-24T11:02:20.129116951+08:00","level":"INFO","msg":"Parent process exited, terminating service process."}

wandb/run-20260124_105101-s3i4k862/logs/debug-internal.log ADDED Viewed

	@@ -0,0 +1,14 @@

+{"time":"2026-01-24T10:51:01.795371252+08:00","level":"INFO","msg":"using version","core version":"0.18.5"}
+{"time":"2026-01-24T10:51:01.795385102+08:00","level":"INFO","msg":"created symlink","path":"/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_105101-s3i4k862/logs/debug-core.log"}
+{"time":"2026-01-24T10:51:01.911927136+08:00","level":"INFO","msg":"created new stream","id":"s3i4k862"}
+{"time":"2026-01-24T10:51:01.911986864+08:00","level":"INFO","msg":"stream: started","id":"s3i4k862"}
+{"time":"2026-01-24T10:51:01.912277115+08:00","level":"INFO","msg":"sender: started","stream_id":"s3i4k862"}
+{"time":"2026-01-24T10:51:01.912165824+08:00","level":"INFO","msg":"writer: Do: started","stream_id":{"value":"s3i4k862"}}
+{"time":"2026-01-24T10:51:01.912358876+08:00","level":"INFO","msg":"handler: started","stream_id":{"value":"s3i4k862"}}
+{"time":"2026-01-24T10:51:03.472265752+08:00","level":"INFO","msg":"Starting system monitor"}
+{"time":"2026-01-24T11:02:19.490516218+08:00","level":"INFO","msg":"stream: closing","id":"s3i4k862"}
+{"time":"2026-01-24T11:02:19.490615109+08:00","level":"INFO","msg":"Stopping system monitor"}
+{"time":"2026-01-24T11:02:19.492503467+08:00","level":"INFO","msg":"Stopped system monitor"}
+{"time":"2026-01-24T11:02:19.786591052+08:00","level":"WARN","msg":"No job ingredients found, not creating job artifact"}
+{"time":"2026-01-24T11:02:19.786627546+08:00","level":"WARN","msg":"No source type found, not creating job artifact"}
+{"time":"2026-01-24T11:02:19.786641103+08:00","level":"INFO","msg":"sender: sendDefer: no job artifact to save"}

wandb/run-20260124_105101-s3i4k862/logs/debug.log ADDED Viewed

	@@ -0,0 +1,27 @@

+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Current SDK version is 0.18.5
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Configure stats pid to 694321
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Loading settings from /home/zsj/.config/wandb/settings
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Loading settings from /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/settings
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Loading settings from environment variables: {}
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Applying setup settings: {'mode': None, '_disable_service': None}
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Inferring run settings from compute environment: {'program_relpath': 'fastvideo/train_g2rpo_sd_merge.py', 'program_abspath': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py', 'program': '/data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/fastvideo/train_g2rpo_sd_merge.py'}
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_setup.py:_flush():79] Applying login settings: {}
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:_log_setup():534] Logging user logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_105101-s3i4k862/logs/debug.log
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:_log_setup():535] Logging internal logs to /data1/zsj/SceneDPO/Rebuttal/E-GRPO/scoure_code/wandb/run-20260124_105101-s3i4k862/logs/debug-internal.log
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:init():621] calling init triggers
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:init():628] wandb.init called with sweep_config: {}
+config: {}
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:init():671] starting backend
+2026-01-24 10:51:01,786 INFO    MainThread:694321 [wandb_init.py:init():675] sending inform_init request
+2026-01-24 10:51:01,788 INFO    MainThread:694321 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
+2026-01-24 10:51:01,789 INFO    MainThread:694321 [wandb_init.py:init():688] backend started and connected
+2026-01-24 10:51:01,791 INFO    MainThread:694321 [wandb_init.py:init():783] updated telemetry
+2026-01-24 10:51:01,792 INFO    MainThread:694321 [wandb_init.py:init():816] communicating run to backend with 90.0 second timeout
+2026-01-24 10:51:03,461 INFO    MainThread:694321 [wandb_init.py:init():867] starting run threads in backend
+2026-01-24 10:51:03,631 INFO    MainThread:694321 [wandb_run.py:_console_start():2463] atexit reg
+2026-01-24 10:51:03,631 INFO    MainThread:694321 [wandb_run.py:_redirect():2311] redirect: wrap_raw
+2026-01-24 10:51:03,631 INFO    MainThread:694321 [wandb_run.py:_redirect():2376] Wrapping output streams.
+2026-01-24 10:51:03,631 INFO    MainThread:694321 [wandb_run.py:_redirect():2401] Redirects installed.
+2026-01-24 10:51:03,632 INFO    MainThread:694321 [wandb_init.py:init():911] run started, returning control to user process
+2026-01-24 10:51:03,633 INFO    MainThread:694321 [wandb_run.py:_config_callback():1390] config_cb None None {'allow_tf32': True, 'logdir': 'logs', 'mixed_precision': 'bf16', 'num_checkpoint_limit': 5, 'num_epochs': 300, 'pretrained': {'model': './data/StableDiffusion', 'revision': 'main'}, 'prompt_fn': 'imagenet_animals', 'prompt_fn_kwargs': {}, 'resume_from': '', 'reward_fn': 'hpsv2', 'run_name': '2026.01.24_10.51.00', 'sample': {'batch_size': 1, 'eta': 1.0, 'guidance_scale': 5.0, 'num_batches_per_epoch': 2, 'num_steps': 50}, 'save_freq': 20, 'seed': 42, 'train': {'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'adam_weight_decay': 0.0001, 'adv_clip_max': 5, 'batch_size': 1, 'cfg': True, 'clip_range': 0.0001, 'gradient_accumulation_steps': 1, 'learning_rate': 1e-05, 'max_grad_norm': 1.0, 'num_inner_epochs': 1, 'timestep_fraction': 1.0, 'use_8bit_adam': False}, 'use_lora': False}
+2026-01-24 11:02:19,491 WARNING MsgRouterThr:694321 [router.py:message_loop():77] message_loop has been closed