runtime error
Exit code: 1. Reason: atch ckpt: torch.Size([256]) vs model:torch.Size([768]) transformer.wte.weight | MISMATCH | Reinit due to size mismatch ckpt: torch.Size([50257, 256]) vs model:torch.Size([50257, 768]) transformer.h.{0, 1, 2, 3}.attn.c_attn.bias | MISMATCH | Reinit due to size mismatch ckpt: torch.Size([768]) vs model:torch.Size([2304]) transformer.ln_f.bias | MISMATCH | Reinit due to size mismatch ckpt: torch.Size([256]) vs model:torch.Size([768]) transformer.ln_f.weight | MISMATCH | Reinit due to size mismatch ckpt: torch.Size([256]) vs model:torch.Size([768]) Notes: - MISSING :those params were newly initialized because missing from the checkpoint. Consider training on your downstream task. - MISMATCH :ckpt weights were loaded, but they did not match the original empty weight shapes. Traceback (most recent call last): File "/app/app.py", line 6, in <module> model = GPT2LMHeadModel.from_pretrained("burman-ai/gpt2-chatbot") File "/usr/local/lib/python3.13/site-packages/transformers/modeling_utils.py", line 4110, in from_pretrained load_info = cls._finalize_load_state_dict(model, load_config, load_info) File "/usr/local/lib/python3.13/site-packages/transformers/modeling_utils.py", line 4284, in _finalize_load_state_dict log_state_dict_report( ~~~~~~~~~~~~~~~~~~~~~^ model=model, ^^^^^^^^^^^^ ...<7 lines>... conversion_errors=load_info.conversion_errors, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ) ^ File "/usr/local/lib/python3.13/site-packages/transformers/utils/loading_report.py", line 249, in log_state_dict_report raise RuntimeError( "You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!" ) RuntimeError: You set `ignore_mismatched_sizes` to `False`, thus raising an error. For details look at the above report!
Container logs:
Fetching error logs...