GitBag/llama3-ultrafeedback-reasoning-iter_3-1731243878-armo-tokenized_harvard Viewer • Updated Nov 13, 2024 • 57.2k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_3-1731243878-armo-tokenized Viewer • Updated Nov 13, 2024 • 57.2k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_3-1731243878-armo Viewer • Updated Nov 13, 2024 • 60.8k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731046941-armo-tokenized_harvard Viewer • Updated Nov 11, 2024 • 57.2k • 6
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731046941-armo-tokenized Viewer • Updated Nov 11, 2024 • 57.2k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731046941-armo Viewer • Updated Nov 11, 2024 • 60.8k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731041913-armo-tokenized_harvard Viewer • Updated Nov 10, 2024 • 57.5k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731041913-armo-tokenized Viewer • Updated Nov 10, 2024 • 57.5k • 6
GitBag/llama3-ultrafeedback-reasoning-iter_2-1731041913-armo Viewer • Updated Nov 10, 2024 • 60.8k • 7
GitBag/llama3-ultrafeedback-reasoning-armo-tokenized_harvard Viewer • Updated Nov 8, 2024 • 53.9k • 7
GitBag/rloo_ultrainteract_pair_lr_1e-8_555134_1729977727_eval Viewer • Updated Oct 27, 2024 • 500 • 4