suwesh
/

llamatron-1B-peft

Text Generation

text-generation-inference

Model card Files Files and versions

suwesh commited on Aug 22, 2025

Commit

de3c8a3

·

verified ·

1 Parent(s): 781d08b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -87,7 +87,7 @@ model = PeftModel.from_pretrained(base_model, "suwesh/llamatron-1B-peft", subfol
 <pre>Checkpoint 11000 Training and Validation losses: 1.06 | 1.09</pre>
 # Evaluation details
-We use the [nvidia/Llama-3.1-Nemotron-Nano](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1) LLM as a Judge for evaluating the responses between the base llama 3.2 1b instruct and out PEFT model. The following are the judge's preference for each prompt to the two models:
 <pre>base: 122
 peft: 388
 tie: 29

 <pre>Checkpoint 11000 Training and Validation losses: 1.06 | 1.09</pre>
 # Evaluation details
+We use the [nvidia/Llama-3.1-Nemotron-Nano](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1) LLM as a Judge for evaluating the responses between the base llama 3.2 1b instruct and our PEFT model. The following are the judge's preference for each prompt to the two models, we also provide the ground truth in the prompt to the judge:
 <pre>base: 122
 peft: 388
 tie: 29