MW_litevla-ms / wandb /debug.log
ducido's picture
Initial commit
b1ce9da verified
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_setup.py:_flush():81] Current SDK version is 0.24.2
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_setup.py:_flush():81] Configure stats pid to 138391
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_setup.py:_flush():81] Loading settings from environment variables
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_init.py:setup_run_log_directory():717] Logging user logs to outputs/train/2026-02-09/08-44-30_MW_100%_scratch_litevla-ms_lastlayer/wandb/run-20260209_084451-csy2m2pr/logs/debug.log
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_init.py:setup_run_log_directory():718] Logging internal logs to outputs/train/2026-02-09/08-44-30_MW_100%_scratch_litevla-ms_lastlayer/wandb/run-20260209_084451-csy2m2pr/logs/debug-internal.log
2026-02-09 08:44:51,657 INFO MainThread:138391 [wandb_init.py:init():844] calling init triggers
2026-02-09 08:44:51,658 INFO MainThread:138391 [wandb_init.py:init():849] wandb.init called with sweep_config: {}
config: {'dataset': {'repo_id': '.', 'root': '/pfss/mlde/workspaces/mlde_wsp_IAS_SAMMerge/VLA/duc/VLA-Humanoid-MW/metaworld_mt50', 'episodes': None, 'image_transforms': {'enable': True, 'max_num_transforms': 3, 'random_order': False, 'image_tfs': {'hue': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'hue': [-0.05, 0.05]}}, 'contrast': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'contrast': [0.8, 1.2]}}, 'sharpness': {'weight': 1.0, 'type': 'SharpnessJitter', 'kwargs': {'sharpness': [0.5, 1.5]}}, 'brightness': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'brightness': [0.8, 1.2]}}, 'saturation': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'saturation': [0.5, 1.5]}}, 'crop_resize': {'weight': 1.0, 'type': 'RandomResizedCrop', 'kwargs': {'size': [256, 256], 'ratio': [1, 1], 'scale': [0.9, 0.95]}}, 'rotate': {'weight': 1.0, 'type': 'RandomRotate', 'kwargs': {'degrees': [-5, 5]}}}, 'wrist_tfs': {'hue': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'hue': [-0.05, 0.05]}}, 'contrast': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'contrast': [0.8, 1.2]}}, 'sharpness': {'weight': 1.0, 'type': 'SharpnessJitter', 'kwargs': {'sharpness': [0.5, 1.5]}}, 'brightness': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'brightness': [0.8, 1.2]}}, 'saturation': {'weight': 1.0, 'type': 'ColorJitter', 'kwargs': {'saturation': [0.5, 1.5]}}}}, 'revision': None, 'use_imagenet_stats': True, 'video_backend': 'torchcodec', 'vqa_data_path': None}, 'env': None, 'policy': {'type': 'litevla-ms', 'n_obs_steps': 1, 'normalization_mapping': {'VISUAL': <NormalizationMode.IDENTITY: 'IDENTITY'>, 'STATE': <NormalizationMode.MEAN_STD: 'MEAN_STD'>, 'ACTION': <NormalizationMode.MEAN_STD: 'MEAN_STD'>}, 'input_features': {}, 'output_features': {}, 'device': 'cuda', 'use_amp': False, 'gradient_accumulation_steps': 1, 'chunk_size': 50, 'n_action_steps': 1, 'max_state_dim': 32, 'max_action_dim': 32, 'resize_imgs_with_padding': [512, 512], 'empty_cameras': 0, 'adapt_to_pi_aloha': False, 'use_delta_joint_actions_aloha': False, 'tokenizer_max_length': 48, 'num_steps': 10, 'use_cache': True, 'freeze_vision_encoder': True, 'train_expert_only': False, 'train_state_proj': True, 'optimizer_lr': 0.0001, 'optimizer_betas': [0.9, 0.95], 'optimizer_eps': 1e-08, 'optimizer_weight_decay': 1e-10, 'optimizer_grad_clip_norm': 10, 'scheduler_warmup_steps': 1000, 'scheduler_decay_steps': 100000, 'scheduler_decay_lr': 2.5e-06, 'vlm_model_name': '/pfss/mlde/workspaces/mlde_wsp_IAS_SAMMerge/VLA/duc/VLA-Humanoid-MW/SmolVLM2-500M-Video-Instruct', 'load_vlm_weights': True, 'add_image_special_tokens': False, 'attention_mode': 'cross_attn', 'prefix_length': 0, 'pad_language_to': 'max_length', 'num_expert_layers': 0, 'num_vlm_layers': 16, 'self_attn_every_n_layers': 2, 'expert_width_multiplier': 0.75, 'min_period': 0.004, 'max_period': 4.0, 'of_path': '/pfss/mlde/workspaces/mlde_wsp_IAS_SAMMerge/VLA/duc/VLA-Humanoid-MW/ori_mw_if/ori_mw_100%_of.h5'}, 'output_dir': 'outputs/train/2026-02-09/08-44-30_MW_100%_scratch_litevla-ms_lastlayer', 'job_name': 'MW_100%_scratch_litevla-ms_lastlayer', 'resume': False, 'seed': 42, 'num_workers': 8, 'batch_size': 64, 'steps': 100000, 'eval_freq': 20000, 'log_freq': 200, 'save_checkpoint': True, 'save_freq': 10000, 'use_policy_training_preset': True, 'optimizer': {'type': 'adamw', 'lr': 0.0001, 'weight_decay': 1e-10, 'grad_clip_norm': 10, 'betas': [0.9, 0.95], 'eps': 1e-08}, 'scheduler': {'type': 'cosine_decay_with_warmup', 'num_warmup_steps': 1000, 'num_decay_steps': 100000, 'peak_lr': 0.0001, 'decay_lr': 2.5e-06}, 'eval': {'n_episodes': 50, 'batch_size': 50, 'use_async_envs': False}, 'wandb': {'enable': True, 'disable_artifact': True, 'project': 'LiteVLA-MS', 'entity': 'Robotics_VLA', 'notes': None, 'run_id': None, 'mode': 'online'}, '_wandb': {}}
2026-02-09 08:44:51,658 INFO MainThread:138391 [wandb_init.py:init():892] starting backend
2026-02-09 08:44:51,963 INFO MainThread:138391 [wandb_init.py:init():895] sending inform_init request
2026-02-09 08:44:51,976 INFO MainThread:138391 [wandb_init.py:init():903] backend started and connected
2026-02-09 08:44:51,978 INFO MainThread:138391 [wandb_init.py:init():973] updated telemetry
2026-02-09 08:44:51,985 INFO MainThread:138391 [wandb_init.py:init():997] communicating run to backend with 90.0 second timeout
2026-02-09 08:44:52,532 INFO MainThread:138391 [wandb_init.py:init():1042] starting run threads in backend
2026-02-09 08:44:52,674 INFO MainThread:138391 [wandb_run.py:_console_start():2529] atexit reg
2026-02-09 08:44:52,674 INFO MainThread:138391 [wandb_run.py:_redirect():2377] redirect: wrap_raw
2026-02-09 08:44:52,674 INFO MainThread:138391 [wandb_run.py:_redirect():2446] Wrapping output streams.
2026-02-09 08:44:52,674 INFO MainThread:138391 [wandb_run.py:_redirect():2469] Redirects installed.
2026-02-09 08:44:52,685 INFO MainThread:138391 [wandb_init.py:init():1082] run started, returning control to user process
2026-02-10 03:09:33,944 INFO wandb-AsyncioManager-main:138391 [service_client.py:_forward_responses():94] Reached EOF.
2026-02-10 03:09:33,945 INFO wandb-AsyncioManager-main:138391 [mailbox.py:close():154] Closing mailbox, abandoning 1 handles.