marianeft
/

MedQuAD

marianeft commited on Feb 24, 2025

Commit

030c7b9

verified ·

1 Parent(s): 230bb4c

Update fine_tune_gpt2_medquad.py

Files changed (1) hide show

fine_tune_gpt2_medquad.py CHANGED Viewed

@@ -1,13 +1,8 @@
-It looks like the `marianeft/MedQuAD` model you're trying to load does not have the required files (`pytorch_model.bin`, `model.safetensors`, `tf_model.h5`, `model.ckpt` or `flax_model.msgpack`). To address this, let's use a different approach.
-Instead of saving the fine-tuned model to the `marianeft/MedQuAD` directory, let's save it to a new directory. Here’s the modified code to handle this issue:
-```python
 import pandas as pd
 from transformers import GPT2LMHeadModel, GPT2Tokenizer, Trainer, TrainingArguments
 from datasets import load_dataset
-# Load MedQuAD dataset properly
 dataset = load_dataset("marianeft/MedQuAD", split="train")
 # Load the GPT-2 model and tokenizer
@@ -49,9 +44,4 @@ trainer.train()
 # Save the model to a new directory
 model.save_pretrained("fine_tuned_medquad")
-tokenizer.save_pretrained("fine_tuned_medquad")
-```
-This code will save your fine-tuned model and tokenizer to the "fine_tuned_medquad" directory instead of "marianeft/MedQuAD". This way, you avoid trying to save the model to the original model directory which caused the issue.
-Give this a try and let me know if it works for you!

 import pandas as pd
 from transformers import GPT2LMHeadModel, GPT2Tokenizer, Trainer, TrainingArguments
 from datasets import load_dataset
+# Load MedQuAD dataset
 dataset = load_dataset("marianeft/MedQuAD", split="train")
 # Load the GPT-2 model and tokenizer
 # Save the model to a new directory
 model.save_pretrained("fine_tuned_medquad")
+tokenizer.save_pretrained("fine_tuned_medquad")