Spaces:

ActiveUltraFeedback
/

README

No application file

davmel commited on 23 days ago

Commit

f816d7b

verified ·

1 Parent(s): 2a47179

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,10 @@
 # Active UltraFeedback
 **Active UltraFeedback** is a scalable pipeline for generating high-quality preference datasets to align large language models (LLMs). We leverage **uncertainty quantification** and **active learning** to annotate only the most informative samples, drastically reducing costs while beating standard baselines.

+---
+license: mit
+title: ActiveUltraFeedback
+short_description: Sample-Efficient RLHF Preference data generation
+papers: 2603.09692
+---
 # Active UltraFeedback
 **Active UltraFeedback** is a scalable pipeline for generating high-quality preference datasets to align large language models (LLMs). We leverage **uncertainty quantification** and **active learning** to annotate only the most informative samples, drastically reducing costs while beating standard baselines.