No application file Reinforcement Learning Human Feedback 🔥 Collecting human preferences for RL model training.
ahirtonlopes/layoutlmv2-base-uncased_finetuned_docvqa Document Question Answering • 0.2B • Updated Mar 23 • 4
ahirtonlopes/distilbert-base-uncased-finetuned-squad Question Answering • 66.4M • Updated Nov 9, 2023 • 6
ahirtonlopes/swin-tiny-patch4-window7-224-finetuned-cifar10 Image Classification • Updated Oct 5, 2023 • 11