Viewer
• Updated
• 3.83k • 19.1k
trl-lib/documentation-images
Viewer
• Updated
• 11 • 53.7k
Viewer
• Updated
• 103k • 4.12k
• 8
trl-lib/llava-instruct-mix
Viewer
• Updated
• 228k • 1.17k
• 3
trl-lib/OpenMathReasoning
Viewer
• Updated
• 3.2M • 583
trl-lib/chatbot_arena_completions
Viewer
• Updated
• 33k • 192
• 1
Viewer
• Updated
• 83.1k • 121
• 3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
• Updated
• 16.6k • 66
• 4
trl-lib/ultrafeedback-prompt
Viewer
• Updated
• 39.8k • 230
• 9
Viewer
• Updated
• 179k • 163
• 3
Viewer
• Updated
• 130k • 1.9k
• 30
Viewer
• Updated
• 41.2k • 328
• 2
Viewer
• Updated
• 445k • 3.45k
• 12
trl-lib/lm-human-preferences-sentiment
Viewer
• Updated
• 6.26k • 15
trl-lib/lm-human-preferences-descriptiveness
Viewer
• Updated
• 6.26k • 17
• 1
trl-lib/hh-rlhf-helpful-base
Viewer
• Updated
• 46.2k • 217
• 3
Viewer
• Updated
• 51.8k • 8
trl-lib/Capybara-Preferences
Viewer
• Updated
• 15.4k • 20
Viewer
• Updated
• 16k • 4.58k
• 17
trl-lib/ultrafeedback_binarized
Viewer
• Updated
• 63.1k • 3.4k
• 23
trl-lib/capybara-preferencces-7k
Viewer
• Updated
• 7.56k • 11
Viewer
• Updated
• 15k • 473
• 9
trl-lib/ultrachat_200k_chatml
Viewer
• Updated
• 231k • 30
• 3