Viewer
• Updated • 3.83k • 24.1k
• 13
trl-lib/trl-download-stats
Viewer
• Updated • 2.3k • 67
trl-lib/documentation-images
Viewer
• Updated • 11 • 58.5k
Viewer
• Updated • 103k • 3.01k
• 11
trl-lib/llava-instruct-mix
Viewer
• Updated • 228k • 1.33k
• 4
trl-lib/OpenMathReasoning
Viewer
• Updated • 3.2M • 929
trl-lib/chatbot_arena_completions
Viewer
• Updated • 33k • 107
• 1
Viewer
• Updated • 83.1k • 153
• 3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
• Updated • 16.6k • 87
• 4
trl-lib/ultrafeedback-prompt
Viewer
• Updated • 39.8k • 104
• 9
Viewer
• Updated • 179k • 205
• 3
Viewer
• Updated • 130k • 3.39k
• 31
Viewer
• Updated • 41.2k • 404
• 4
Viewer
• Updated • 445k • 7.69k
• 12
trl-lib/lm-human-preferences-sentiment
Viewer
• Updated • 6.26k • 67
trl-lib/lm-human-preferences-descriptiveness
Viewer
• Updated • 6.26k • 46
• 1
trl-lib/hh-rlhf-helpful-base
Viewer
• Updated • 46.2k • 65
• 3
Viewer
• Updated • 51.8k • 14
trl-lib/Capybara-Preferences
Viewer
• Updated • 15.4k • 26
Viewer
• Updated • 16k • 4.61k
• 23
trl-lib/ultrafeedback_binarized
Viewer
• Updated • 63.1k • 7.81k
• 26
trl-lib/capybara-preferencces-7k
Viewer
• Updated • 7.56k • 12
Viewer
• Updated • 15k • 203
• 9
trl-lib/ultrachat_200k_chatml
Viewer
• Updated • 231k • 28
• 3