Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -108,7 +108,7 @@ print(tokenizer.decode(out[0], skip_special_tokens=True))
 ## Training (Editing) Details
 ### Data
-We use [HH-Golden dataset](https://huggingface.co/datasets/nz/anthropic-hh-golden-rlhf), which manually improves the quality of noisy samples in the HH-RLHF dataset.
 - Data format: (toxic, non-toxic) sentence pairs.
 - Sample size: 500 pairs for ProFS editing (compared to 2,000 pairs used for DPO fine-tuning).

 ## Training (Editing) Details
 ### Data
+We use the [HH-Golden dataset](https://huggingface.co/datasets/nz/anthropic-hh-golden-rlhf), which manually improves the quality of noisy samples in the HH-RLHF dataset.
 - Data format: (toxic, non-toxic) sentence pairs.
 - Sample size: 500 pairs for ProFS editing (compared to 2,000 pairs used for DPO fine-tuning).