Spaces:

parth-1
/

MetaGuard-Train

Runtime error

parth-1 commited on Apr 26

Commit

6557131

verified ·

1 Parent(s): 08bdaa0

Update grpo_train.py

Files changed (1) hide show

grpo_train.py CHANGED Viewed

@@ -362,7 +362,7 @@ model, tokenizer = FastLanguageModel.from_pretrained(
     model_name="unsloth/Llama-3.1-8B-Instruct",
     load_in_4bit=USE_4BIT,
     max_seq_length=2048,
-    dtype=None,
 )
 model = FastLanguageModel.get_peft_model(

     model_name="unsloth/Llama-3.1-8B-Instruct",
     load_in_4bit=USE_4BIT,
     max_seq_length=2048,
+    dtype=torch.float16 if USE_4BIT else None,
 )
 model = FastLanguageModel.get_peft_model(