GitBag/llama3-ultrafeedback-reasoning-iter_2-1732268914-tokenized_harvard Viewer • Updated Nov 25, 2024 • 226k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1732268914-tokenized Viewer • Updated Nov 24, 2024 • 226k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_2-1732268914-armo Viewer • Updated Nov 24, 2024 • 60.8k • 7
GitBag/llama3-ultrafeedback-reasoning-ReRe-armo-tokenized_harvard Viewer • Updated Nov 21, 2024 • 229k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo-tokenized_harvard Viewer • Updated Nov 18, 2024 • 54.6k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo-tokenized Viewer • Updated Nov 18, 2024 • 54.6k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_5-1731714556-armo Viewer • Updated Nov 18, 2024 • 60.8k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo-tokenized_harvard Viewer • Updated Nov 15, 2024 • 56.3k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo-tokenized Viewer • Updated Nov 15, 2024 • 56.3k • 7
GitBag/llama3-ultrafeedback-reasoning-iter_4-1731513485-armo Viewer • Updated Nov 15, 2024 • 60.8k • 7