causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21, 2025 • 847 • 5
causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21, 2025 • 847 • 5
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2 Viewer • Updated Jul 3, 2025 • 920k • 2
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2 Viewer • Updated Jul 3, 2025 • 920k • 2
GraPE: A Generate-Plan-Edit Framework for Compositional T2I Synthesis Paper • 2412.06089 • Published Dec 8, 2024 • 4
IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages Paper • 2404.16816 • Published Apr 25, 2024 • 3