| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Starting merged model save process |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Arguments: {'lambdas_path': '/work/gj26/b20042/LLM-AdaMerge/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset-lambda05/llm_adamerge_lambdas.json', 'model_config': '/work/gj26/b20042/LLM-AdaMerge/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset-lambda05/model_config.yaml', 'output_dir': '/work/gj26/b20042/LLM-AdaMerge/mergekit/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset/lambda05', 'model_name': 'merged-model', 'push_to_hub': False, 'hub_repo_id': 'lejelly/ds7b-ep3-data10-ood-math-taskwise-lambda05', 'private': False, 'device': 'cuda', 'debug': False} |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Loading lambdas from /work/gj26/b20042/LLM-AdaMerge/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset-lambda05/llm_adamerge_lambdas.json |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Auto-detected parameter-wise merge from JSON structure |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Merge type: parameter_wise |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - [Initial] Memory Usage: |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Process: 0.39 GB (0.2%) |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - System: 9.44 GB / 212.49 GB (9.1%) |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Available: 193.24 GB |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - GPU 0: Allocated: 0.00 GB, Reserved: 0.00 GB, Total: 94.50 GB |
| 2025-11-18 07:33:04 - experiment_save_merged_model - INFO - Loading models |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - [After loading models] Memory Usage: |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - Process: 0.63 GB (0.3%) |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - System: 51.05 GB / 212.49 GB (32.0%) |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - Available: 144.58 GB |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - GPU 0: Allocated: 38.61 GB, Reserved: 40.64 GB, Total: 94.50 GB |
| 2025-11-18 07:33:14 - experiment_save_merged_model - INFO - Initializing parameter_wise AdaMerge |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Loading learned lambdas |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Deleting original models to free memory (task vectors already computed) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - [Before deleting models] Memory Usage: |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Process: 39.07 GB (18.4%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - System: 106.84 GB / 212.49 GB (58.2%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Available: 88.79 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - GPU 0: Allocated: 38.61 GB, Reserved: 40.64 GB, Total: 94.50 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Clearing model_loader references |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Deleting model variables |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Running garbage collection |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - [After deleting models and GC] Memory Usage: |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Process: 39.07 GB (18.4%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - System: 66.17 GB / 212.49 GB (39.1%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Available: 129.47 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - GPU 0: Allocated: 0.00 GB, Reserved: 0.00 GB, Total: 94.50 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - [After loading lambdas] Memory Usage: |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Process: 39.07 GB (18.4%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - System: 66.17 GB / 212.49 GB (39.1%) |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Available: 129.47 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - GPU 0: Allocated: 0.00 GB, Reserved: 0.00 GB, Total: 94.50 GB |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Creating merged model with learned lambdas |
| 2025-11-18 07:37:40 - experiment_save_merged_model - INFO - Using merge_models_for_save() |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - [After merging models] Memory Usage: |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Process: 39.07 GB (18.4%) |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - System: 102.96 GB / 212.49 GB (54.8%) |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Available: 95.96 GB |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - GPU 0: Allocated: 12.87 GB, Reserved: 53.44 GB, Total: 94.50 GB |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Freeing memory from AdaMerge object (task vectors and base params no longer needed) |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Deleting task vectors |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Deleting base params |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Deleting functional model |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - [After freeing AdaMerge memory] Memory Usage: |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Process: 0.42 GB (0.2%) |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - System: 23.10 GB / 212.49 GB (17.3%) |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Available: 175.82 GB |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - GPU 0: Allocated: 12.87 GB, Reserved: 13.05 GB, Total: 94.50 GB |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Saving merged model to /work/gj26/b20042/LLM-AdaMerge/mergekit/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset/lambda05 |
| 2025-11-18 07:39:38 - experiment_save_merged_model - INFO - Moving merged model to CPU for saving |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Successfully saved 3 safetensors files: |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - - model-00003-of-00003.safetensors (3674.14 MB) |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - - model-00002-of-00003.safetensors (4750.20 MB) |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - - model-00001-of-00003.safetensors (4756.17 MB) |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - [After saving model] Memory Usage: |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Process: 13.31 GB (6.3%) |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - System: 23.18 GB / 212.49 GB (18.8%) |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Available: 172.46 GB |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - GPU 0: Allocated: 0.00 GB, Reserved: 0.00 GB, Total: 94.50 GB |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Saving tokenizer |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Copied lambdas file to /work/gj26/b20042/LLM-AdaMerge/mergekit/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset/lambda05/learned_lambdas.json |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Creating model card |
| 2025-11-18 07:40:30 - experiment_save_merged_model - INFO - Cleaning up models |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - [After cleanup] Memory Usage: |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - Process: 13.31 GB (6.3%) |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - System: 23.12 GB / 212.49 GB (18.8%) |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - Available: 172.51 GB |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - GPU 0: Allocated: 0.00 GB, Reserved: 0.00 GB, Total: 94.50 GB |
| 2025-11-18 07:40:31 - experiment_save_merged_model - INFO - Model saved successfully to /work/gj26/b20042/LLM-AdaMerge/mergekit/outputs/deepseek-7b/k-fold/task-wise/math/cross_entropy-ep3-10%dataset/lambda05 |
|
|