BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning Paper • 2506.06955 • Published Jun 8, 2025
deepseek-ai/DeepSeek-R1-0528 Text Generation • 685B • Updated May 29, 2025 • 1.1M • • 2.41k
nguyenthanhasia/NeuBAROCO_InconsistentSyllogisms_JA Viewer • Updated May 8, 2025 • 334 • 11 • 1
Running on CPU Upgrade 191 LLM Hallucination Leaderboard 🚀 191 View and filter LLM hallucination leaderboard