Viewer
•
Updated
•
1.23k
•
16.1k
trl-lib/documentation-images
Viewer
•
Updated
•
9
•
59.8k
Viewer
•
Updated
•
103k
•
4.42k
•
1
trl-lib/llava-instruct-mix
Viewer
•
Updated
•
228k
•
1.11k
•
2
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
372
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
273
•
1
Viewer
•
Updated
•
83.1k
•
278
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
109
•
3
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
335
•
9
Viewer
•
Updated
•
179k
•
723
•
3
Viewer
•
Updated
•
130k
•
2.72k
•
31
Viewer
•
Updated
•
41.2k
•
142
•
2
Viewer
•
Updated
•
445k
•
2.15k
•
10
trl-lib/lm-human-preferences-sentiment
Viewer
•
Updated
•
6.26k
•
1.1k
trl-lib/lm-human-preferences-descriptiveness
Viewer
•
Updated
•
6.26k
•
45
•
1
trl-lib/hh-rlhf-helpful-base
Viewer
•
Updated
•
46.2k
•
2.04k
•
3
Viewer
•
Updated
•
51.8k
•
10
trl-lib/Capybara-Preferences
Viewer
•
Updated
•
15.4k
•
19
Viewer
•
Updated
•
16k
•
3.47k
•
17
trl-lib/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
5.77k
•
21
trl-lib/capybara-preferencces-7k
Viewer
•
Updated
•
7.56k
•
22
Viewer
•
Updated
•
15k
•
99
•
9
trl-lib/ultrachat_200k_chatml
Viewer
•
Updated
•
231k
•
37
•
3