Spaces:

ActiveUltraFeedback
/

README

No application file

davmel commited on Mar 11

Commit

1544dd4

verified ·

1 Parent(s): c969fb9

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ colorTo: green
 sdk_version: 6.9.0
 ---
-# Active UltraFeedback
 **Active UltraFeedback** is a scalable pipeline for generating high-quality preference datasets to align large language models (LLMs). We leverage **uncertainty quantification** and **active learning** to annotate only the most informative samples, drastically reducing costs while beating standard baselines.

 sdk_version: 6.9.0
 ---
+# This repo accompanies the paper: [ActiveUltraFeedback — arXiv:2603.09692](https://arxiv.org/abs/2603.09692).
 **Active UltraFeedback** is a scalable pipeline for generating high-quality preference datasets to align large language models (LLMs). We leverage **uncertainty quantification** and **active learning** to annotate only the most informative samples, drastically reducing costs while beating standard baselines.