A collection of preference datasets used for training and evaluation of code reward models.
Themis
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
View all PapersA collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models.
-
project-themis/Themis-RM-0.6B-PMP
Text Classification β’ 0.6B β’ Updated β’ 2 -
project-themis/Themis-RM-1.7B-PMP
Text Classification β’ 2B β’ Updated β’ 3 -
project-themis/Themis-RM-4B-PMP
Text Classification β’ 4B β’ Updated -
project-themis/Themis-RM-8B-PMP
Text Classification β’ 8B β’ Updated β’ 12
A collection of strong code reward models trained on a diverse collection of code preferences.
-
project-themis/Themis-RM-0.6B
Text Classification β’ 0.6B β’ Updated β’ 194 -
project-themis/Themis-RM-1.7B
Text Classification β’ 2B β’ Updated β’ 51 -
project-themis/Themis-RM-4B
Text Classification β’ 4B β’ Updated β’ 51 -
project-themis/Themis-RM-8B
Text Classification β’ 8B β’ Updated β’ 138
A collection of preference datasets used for training and evaluation of code reward models.
A collection of strong code reward models trained on a diverse collection of code preferences.
-
project-themis/Themis-RM-0.6B
Text Classification β’ 0.6B β’ Updated β’ 194 -
project-themis/Themis-RM-1.7B
Text Classification β’ 2B β’ Updated β’ 51 -
project-themis/Themis-RM-4B
Text Classification β’ 4B β’ Updated β’ 51 -
project-themis/Themis-RM-8B
Text Classification β’ 8B β’ Updated β’ 138
A collection of preference model pretraining checkpoints trained on general preference datasets intended as precursors for code reward models.
-
project-themis/Themis-RM-0.6B-PMP
Text Classification β’ 0.6B β’ Updated β’ 2 -
project-themis/Themis-RM-1.7B-PMP
Text Classification β’ 2B β’ Updated β’ 3 -
project-themis/Themis-RM-4B-PMP
Text Classification β’ 4B β’ Updated -
project-themis/Themis-RM-8B-PMP
Text Classification β’ 8B β’ Updated β’ 12